ray/rllib/agents
2021-07-13 09:57:15 -07:00
..
a3c [RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup (#14707) 2021-05-19 16:32:29 +02:00
ars [RLlib] Multi-GPU support for Torch algorithms. (#14709) 2021-04-16 09:16:24 +02:00
cql [rllib] Enhancements to Input API for customizing offline datasets (#16957) 2021-07-10 15:05:25 -07:00
ddpg Revert "[RLlib] Fix bug in policy.py: normalize_actions=True has to call unsquash_action, not normalize_action." (#17002) 2021-07-12 11:09:14 -07:00
dqn Revert "[RLlib] Fix bug in policy.py: normalize_actions=True has to call unsquash_action, not normalize_action." (#17002) 2021-07-12 11:09:14 -07:00
dreamer [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
es [RLlib] Multi-GPU support for Torch algorithms. (#14709) 2021-04-16 09:16:24 +02:00
impala [RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429) 2021-06-19 22:42:00 +02:00
maml [RLlib] CQL TensorFlow support (#15841) 2021-05-18 11:10:46 +02:00
marwil [RLlib] MARWIL + BC: Various fixes and enhancements. (#16218) 2021-06-03 22:29:00 +02:00
mbmpo [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 2021-03-08 15:41:27 +01:00
pg [RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684) 2021-04-27 10:44:54 +02:00
ppo Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036) 2021-07-13 09:57:15 -07:00
qmix [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 2021-06-30 12:32:11 +02:00
sac Revert "[RLlib] Fix bug in policy.py: normalize_actions=True has to call unsquash_action, not normalize_action." (#17002) 2021-07-12 11:09:14 -07:00
slateq [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 2021-03-08 15:41:27 +01:00
tests [RLlib] Trainer.add_policy() not working for tf, if added policy is trained afterwards. (#16927) 2021-07-11 23:41:38 +02:00
__init__.py [RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815) 2021-05-18 13:23:00 +02:00
callbacks.py [RLlib] Memory leak docs (#15908) 2021-06-10 18:10:21 +02:00
mock.py Auto report object store memory usage; remove some deprecated code (#14260) 2021-03-01 13:19:44 -08:00
registry.py [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
trainer.py Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036) 2021-07-13 09:57:15 -07:00
trainer_template.py [RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429) 2021-06-19 22:42:00 +02:00