ray/rllib/agents
2020-08-06 10:54:20 +02:00
..
a3c [LINT] Except RLlib from checking for flake8 error F821 (#9946) 2020-08-06 10:44:37 +02:00
ars [RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207) 2020-07-02 13:03:10 +02:00
ddpg [RLlib] Issue 9667 DDPG Torch bugs and enhancements. (#9680) 2020-07-28 14:15:03 +02:00
dqn [RLlib] Implement DQN PyTorch distributional head. (#9589) 2020-07-25 09:29:24 +02:00
dyna [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 2020-06-25 19:01:32 +02:00
es [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
impala [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
maml [RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409) 2020-08-02 18:12:09 +02:00
marwil [RLlib] Issue 9402 MARWIL producing nan rewards. (#9429) 2020-07-14 05:07:16 +02:00
mbmpo [RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409) 2020-08-02 18:12:09 +02:00
pg [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
ppo [RLlib] Trajectory view API - 02 actual API scaffold (#9753) 2020-08-06 10:54:20 +02:00
qmix [RLlib] Issue 8384: QMIX doesn't learn anything. (#9527) 2020-07-17 12:14:34 +02:00
sac [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
agent.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
callbacks.py [rllib] Add type annotations for evaluation/, env/ packages (#9003) 2020-06-19 13:09:05 -07:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [RLlib] Implementation of "Model-based Meta Policy Optimization" (MB MPO) (#9409) 2020-08-02 18:12:09 +02:00
trainer.py [RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678) 2020-07-29 21:15:09 +02:00
trainer_template.py [rllib] Remove deprecated policy optimizer package. (#9262) 2020-07-02 14:39:40 -07:00