ray/rllib/agents at d4413299c0ba877f53f0ca6774716bf968008a32 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-10 13:26:39 -04:00

History

Sven Mika 0de41e4a6b [RLlib] Trainer sub-class QMIX/MAML/MB-MPO (instead of `build_trainer`). (#20639 )		2021-12-02 13:17:10 +01:00
..
a3c	[RLlib] Trainer sub-class A2C/A3C (instead of `build_trainer`). (#20635 )	2021-11-24 22:07:13 +01:00
ars	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
cql	[RLlib] Use SampleBrach instead of input dict whenever possible (#20746 )	2021-12-02 13:11:26 +01:00
ddpg	[RLlib] Use SampleBrach instead of input dict whenever possible (#20746 )	2021-12-02 13:11:26 +01:00
dqn	[RLlib] Trainer sub-class QMIX/MAML/MB-MPO (instead of `build_trainer`). (#20639 )	2021-12-02 13:17:10 +01:00
dreamer	[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982 )	2021-11-03 10:00:46 +01:00
es	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
impala	[RLlib] Trainer sub-class IMPALA (instead of using `build_trainer()`). (#20570 )	2021-11-30 19:08:36 +01:00
maml	[RLlib] Trainer sub-class QMIX/MAML/MB-MPO (instead of `build_trainer`). (#20639 )	2021-12-02 13:17:10 +01:00
marwil	[RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552 )	2021-11-19 11:57:37 +01:00
mbmpo	[RLlib] Trainer sub-class QMIX/MAML/MB-MPO (instead of `build_trainer`). (#20639 )	2021-12-02 13:17:10 +01:00
pg	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
ppo	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
qmix	[RLlib] Trainer sub-class QMIX/MAML/MB-MPO (instead of `build_trainer`). (#20639 )	2021-12-02 13:17:10 +01:00
sac	[RLlib] Use SampleBrach instead of input dict whenever possible (#20746 )	2021-12-02 13:11:26 +01:00
slateq	[RLlib] Replay buffer API (cleanups; docstrings; renames; move into `rllib/execution/buffers` dir) (#20552 )	2021-11-19 11:57:37 +01:00
tests	[RLlib] Trainer sub-class for APPO (instead of using `build_trainer()`). (#20424 )	2021-11-22 22:14:21 +01:00
__init__.py	[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815 )	2021-05-18 13:23:00 +02:00
callbacks.py	[RLlib] Add a comment in the doc string of `on_learn_on_batch` callback function. (#20456 )	2021-11-19 10:49:07 +01:00
mock.py	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
registry.py	[RLlib] Trainer sub-class PPO/DDPPO (instead of `build_trainer()`). (#20571 )	2021-11-23 23:01:05 +01:00
trainer.py	[RLlib] Trainer sub-class DDPG/TD3/APEX-DDPG (instead of `build_trainer`). (#20636 )	2021-12-01 10:52:12 +01:00
trainer_template.py	[RLlib] Trainer sub-class for APPO (instead of using `build_trainer()`). (#20424 )	2021-11-22 22:14:21 +01:00