.. |
a3c
|
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707)
|
2021-05-19 16:32:29 +02:00 |
ars
|
[RLlib] Fix seeding for ES and ARS. (#16744)
|
2021-07-19 13:13:05 -04:00 |
cql
|
[rllib] Enhancements to Input API for customizing offline datasets (#16957)
|
2021-07-10 15:05:25 -07:00 |
ddpg
|
[RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). (#17014)
|
2021-07-13 14:01:30 -04:00 |
dqn
|
[RLlib] DQN (Rainbow): Fix torch noisy layer support and loss (#16716)
|
2021-07-13 16:48:06 -04:00 |
dreamer
|
[CI] Upgrade flake8 to 3.9.1 (#15527)
|
2021-05-03 14:23:28 -07:00 |
es
|
[RLlib] Fix seeding for ES and ARS. (#16744)
|
2021-07-19 13:13:05 -04:00 |
impala
|
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
2021-07-19 13:16:03 -04:00 |
maml
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
marwil
|
[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218)
|
2021-06-03 22:29:00 +02:00 |
mbmpo
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
pg
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
ppo
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
qmix
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
sac
|
[RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). (#17014)
|
2021-07-13 14:01:30 -04:00 |
slateq
|
[RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393)
|
2021-03-08 15:41:27 +01:00 |
tests
|
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
2021-07-19 13:16:03 -04:00 |
__init__.py
|
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815)
|
2021-05-18 13:23:00 +02:00 |
callbacks.py
|
Retry remove gpustat dependency (#17115)
|
2021-07-19 11:14:10 -07:00 |
mock.py
|
Auto report object store memory usage; remove some deprecated code (#14260)
|
2021-03-01 13:19:44 -08:00 |
registry.py
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
trainer.py
|
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031)
|
2021-07-19 13:16:03 -04:00 |
trainer_template.py
|
[RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429)
|
2021-06-19 22:42:00 +02:00 |