ray/rllib/agents
2020-10-02 23:07:44 +02:00
..
a3c [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 2020-10-01 16:57:10 +02:00
ars [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
ddpg [RLlib] Fix all example scripts to run on GPUs. (#11105) 2020-10-02 23:07:44 +02:00
dqn [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
dreamer [RLlib] Dreamer (#10172) 2020-08-26 13:24:05 +02:00
es [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
impala [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
maml Cartpole MAML + Discrete (#11028) 2020-10-02 12:56:34 +02:00
marwil add large data warning (#10957) 2020-09-23 15:46:06 -07:00
mbmpo [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
pg [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 2020-10-01 16:57:10 +02:00
ppo [RLlib] Fix all example scripts to run on GPUs. (#11105) 2020-10-02 23:07:44 +02:00
qmix add large data warning (#10957) 2020-09-23 15:46:06 -07:00
sac [RLlib] Fix all example scripts to run on GPUs. (#11105) 2020-10-02 23:07:44 +02:00
__init__.py [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
callbacks.py [RLlib] Issue 10469: Callbacks should receive env idx ... (#10477) 2020-09-03 17:27:05 +02:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [RLlib] Behavioral Cloning (from MARWIL). (#10619) 2020-09-09 17:33:21 +02:00
trainer.py [RLlib] Trajectory view API: Simple List Collector (on by default for PPO); LSTM-agnostic (#11056) 2020-10-01 16:57:10 +02:00
trainer_template.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00