ray/rllib/agents
2020-06-16 09:01:20 +02:00
..
a3c [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
ars [RLlib] Add testing Policy.compute_single_action() for all agents. (#8903) 2020-06-13 17:51:50 +02:00
ddpg [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
dqn [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
dyna [RLlib] Prototype of a DynaTrainer (for env dynamics learning in upcoming MBMPO algo). (#8860) 2020-06-16 09:01:20 +02:00
es [RLlib] Add testing Policy.compute_single_action() for all agents. (#8903) 2020-06-13 17:51:50 +02:00
impala [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
marwil [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
pg [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
ppo [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
qmix [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
sac [RLlib] Minor rllib.utils cleanup. (#8932) 2020-06-16 08:52:20 +02:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
agent.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
callbacks.py [rllib] observation function api for multi-agent (#8236) 2020-05-04 22:13:49 -07:00
mock.py [RLlib] Fix broken tune tests in master due to framework=auto errors. (#8672) 2020-05-29 11:55:47 +02:00
registry.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
trainer.py [rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893) 2020-06-12 20:17:27 -07:00
trainer_template.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00