ray/rllib/agents at d1dd5d578e0cbdee651eeb125dcb78e36b9cf322 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Kingsley Kuan d1dd5d578e [RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449 )		2020-10-22 12:39:34 -07:00
..
a3c	[RLlib] Fix PyTorch A3C / A2C loss function using mixed reduced sum / mean (#11449 )	2020-10-22 12:39:34 -07:00
ars	[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308 )	2020-10-12 13:49:48 -07:00
ddpg	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
dqn	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
dreamer	[RLlib] ConvTranspose2D module (#11231 )	2020-10-12 15:00:42 +02:00
es	[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308 )	2020-10-12 13:49:48 -07:00
impala	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
maml	[RLlib] Do not create env on driver iff num_workers > 0. (#11307 )	2020-10-15 18:21:30 +02:00
marwil	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
mbmpo	[RLlib] Do not create env on driver iff num_workers > 0. (#11307 )	2020-10-15 18:21:30 +02:00
pg	[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308 )	2020-10-12 13:49:48 -07:00
ppo	[rllib] Add torch_distributed_backend flag for DDPPO (#11362 ) (#11425 )	2020-10-21 18:30:42 -07:00
qmix	add large data warning (#10957 )	2020-09-23 15:46:06 -07:00
sac	[RLlib] SAC: log_alpha not being learnt when on GPU. (#11298 )	2020-10-12 13:48:44 -07:00
__init__.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
callbacks.py	[RLlib] Issue 10469: Callbacks should receive env idx ... (#10477 )	2020-09-03 17:27:05 +02:00
mock.py	[tune] Use public methods for trainable (#9184 )	2020-07-01 11:00:00 -07:00
registry.py	[RLlib] Behavioral Cloning (from MARWIL). (#10619 )	2020-09-09 17:33:21 +02:00
trainer.py	Remove memory quota enforcement from actors (#11480 )	2020-10-21 14:29:03 -07:00
trainer_template.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00