ray/rllib/agents
Eric Liang 5acd3e66dd
[rllib] Fix torch TD error, IMPALA LR updates (#9477)
* update

* add test

* lint

* fix super call

* speed es test up
2020-07-23 12:50:25 -07:00
..
a3c [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
ars [RLlib] tf-eager support for ES and ARS (tf2.x preparation). (#9207) 2020-07-02 13:03:10 +02:00
ddpg [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
dqn [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
dyna [RLlib] Minor cleanup in preparation to tf2.x support. (#9130) 2020-06-25 19:01:32 +02:00
es [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
impala [rllib] Fix torch TD error, IMPALA LR updates (#9477) 2020-07-23 12:50:25 -07:00
maml [rllib] MAML Transform (#9463) 2020-07-16 11:11:33 -07:00
marwil [RLlib] Issue 9402 MARWIL producing nan rewards. (#9429) 2020-07-14 05:07:16 +02:00
pg [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
ppo [RLlib] Issue #9437 (PyTorch converts to CPU tensor, even if on GPU). (#9497) 2020-07-16 14:55:50 +02:00
qmix [RLlib] Issue 8384: QMIX doesn't learn anything. (#9527) 2020-07-17 12:14:34 +02:00
sac [RLlib] Tf2.x native. (#8752) 2020-07-11 22:06:35 +02:00
__init__.py [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
agent.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
callbacks.py [rllib] Add type annotations for evaluation/, env/ packages (#9003) 2020-06-19 13:09:05 -07:00
mock.py [tune] Use public methods for trainable (#9184) 2020-07-01 11:00:00 -07:00
registry.py [rllib] MAML Agent (#8862) 2020-06-23 09:48:23 -07:00
trainer.py [RLlib] Layout of Trajectory View API (new class: Trajectory; not used yet). (#9269) 2020-07-14 04:27:49 +02:00
trainer_template.py [rllib] Remove deprecated policy optimizer package. (#9262) 2020-07-02 14:39:40 -07:00