ray/rllib/agents
2020-05-26 11:10:27 +02:00
..
a3c [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
ars [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
ddpg [RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414) 2020-05-26 11:10:27 +02:00
dqn [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
es [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
impala [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
marwil [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
pg [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
ppo Fix missing learning rate and entropy coeff schedule for torch PPO (#8572) 2020-05-23 10:54:18 -07:00
qmix [RLlib] Add QMIX support for complex obs spaces (Issue 8523). (#8533) 2020-05-22 10:17:51 +02:00
sac [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
agent.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
callbacks.py [rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
mock.py [RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304) 2020-05-04 23:53:38 +02:00
registry.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
trainer.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
trainer_template.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00