.. |
a3c
|
[RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449)
|
2020-10-22 12:39:34 -07:00 |
ars
|
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
2020-10-12 13:49:48 -07:00 |
ddpg
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
dqn
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
dreamer
|
[RLlib] ConvTranspose2D module (#11231)
|
2020-10-12 15:00:42 +02:00 |
es
|
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
2020-10-12 13:49:48 -07:00 |
impala
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
maml
|
[RLlib] Do not create env on driver iff num_workers > 0. (#11307)
|
2020-10-15 18:21:30 +02:00 |
marwil
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
mbmpo
|
[RLlib] Do not create env on driver iff num_workers > 0. (#11307)
|
2020-10-15 18:21:30 +02:00 |
pg
|
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308)
|
2020-10-12 13:49:48 -07:00 |
ppo
|
[rllib] Add torch_distributed_backend flag for DDPPO (#11362) (#11425)
|
2020-10-21 18:30:42 -07:00 |
qmix
|
add large data warning (#10957)
|
2020-09-23 15:46:06 -07:00 |
sac
|
[RLlib] SAC: log_alpha not being learnt when on GPU. (#11298)
|
2020-10-12 13:48:44 -07:00 |
__init__.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |
callbacks.py
|
[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477)
|
2020-09-03 17:27:05 +02:00 |
mock.py
|
[tune] Use public methods for trainable (#9184)
|
2020-07-01 11:00:00 -07:00 |
registry.py
|
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
2020-09-09 17:33:21 +02:00 |
trainer.py
|
Remove memory quota enforcement from actors (#11480)
|
2020-10-21 14:29:03 -07:00 |
trainer_template.py
|
[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033)
|
2020-10-06 20:28:16 +02:00 |