ray/rllib/agents
2020-10-22 12:39:34 -07:00
..
a3c [RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449) 2020-10-22 12:39:34 -07:00
ars [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
ddpg [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
dqn [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
dreamer [RLlib] ConvTranspose2D module (#11231) 2020-10-12 15:00:42 +02:00
es [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
impala [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
maml [RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
marwil [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
mbmpo [RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
pg [RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
ppo [rllib] Add torch_distributed_backend flag for DDPPO (#11362) (#11425) 2020-10-21 18:30:42 -07:00
qmix add large data warning (#10957) 2020-09-23 15:46:06 -07:00
sac [RLlib] SAC: log_alpha not being learnt when on GPU. (#11298) 2020-10-12 13:48:44 -07:00
__init__.py [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
callbacks.py [RLlib] Issue 10469: Callbacks should receive env idx ... (#10477) 2020-09-03 17:27:05 +02:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
trainer.py Remove memory quota enforcement from actors (#11480) 2020-10-21 14:29:03 -07:00
trainer_template.py [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00