ray/rllib/agents
2021-02-25 14:27:35 -08:00
..
a3c [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
ars [RLLib] Readme.md Documentation for Almost All Algorithms in rllib/agents (#13035) 2020-12-29 18:45:55 -05:00
cql [RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243) 2021-02-22 17:30:18 +01:00
ddpg [RLlib] DDPG: Support simplex action space. (#14011) 2021-02-10 15:10:01 +01:00
dqn [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
dreamer [RLlib] Dreamer: Fix broken import and add compilation test case. (#13553) 2021-01-21 16:30:26 +01:00
es [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
impala Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360) 2021-02-25 14:27:35 -08:00
maml [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
marwil [RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584) 2021-02-08 15:02:19 +01:00
mbmpo [RLlib] Issue #13507: Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037) 2021-02-11 18:58:46 +01:00
pg [RLlib] JAXPolicy prep. PR #1. (#13077) 2020-12-26 20:14:18 -05:00
ppo [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
qmix [RLlib] Env directory cleanup and tests. (#13082) 2021-01-19 10:09:39 +01:00
sac Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360) 2021-02-25 14:27:35 -08:00
slateq [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
__init__.py [RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033) 2020-10-06 20:28:16 +02:00
callbacks.py [RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584) 2021-02-08 15:02:19 +01:00
mock.py [RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718) 2021-02-08 12:05:16 +01:00
registry.py [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
trainer.py Revert "[RLlib + Tune] Add placement group support to RLlib. (#14289)" (#14360) 2021-02-25 14:27:35 -08:00
trainer_template.py [RLlib] move evaluation to trainer.step() such that the result is properly logged (#12708) 2021-01-25 12:56:00 +01:00