ray/rllib/agents
2021-07-22 10:59:13 -04:00
..
a3c [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
ars [RLlib] Fix seeding for ES and ARS. (#16744) 2021-07-19 13:13:05 -04:00
cql [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
ddpg [RLlib] Redo fix bug normalize vs unsquash actions (original PR made log-likelihood test flakey). (#17014) 2021-07-13 14:01:30 -04:00
dqn [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
dreamer [CI] Upgrade flake8 to 3.9.1 (#15527) 2021-05-03 14:23:28 -07:00
es [RLlib] Fix seeding for ES and ARS. (#16744) 2021-07-19 13:13:05 -04:00
impala [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
maml [RLlib] CQL TensorFlow support (#15841) 2021-05-18 11:10:46 +02:00
marwil [RLlib] MARWIL + BC: Various fixes and enhancements. (#16218) 2021-06-03 22:29:00 +02:00
mbmpo [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 2021-03-08 15:41:27 +01:00
pg [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
ppo [RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
qmix [RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 2021-06-30 12:32:11 +02:00
sac [rllib] Add merge_trainer_config arguments to trainer template (#17160) 2021-07-21 15:43:06 -07:00
slateq [RLlib] Multi-GPU for tf-DQN/PG/A2C. (#13393) 2021-03-08 15:41:27 +01:00
tests [RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 2021-07-19 13:16:03 -04:00
__init__.py [RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. (#15815) 2021-05-18 13:23:00 +02:00
callbacks.py Retry remove gpustat dependency (#17115) 2021-07-19 11:14:10 -07:00
mock.py Auto report object store memory usage; remove some deprecated code (#14260) 2021-03-01 13:19:44 -08:00
registry.py [RLlib] R2D2 Implementation. (#13933) 2021-02-25 12:18:11 +01:00
trainer.py [RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077) 2021-07-22 10:59:13 -04:00
trainer_template.py [rllib] Add merge_trainer_config arguments to trainer template (#17160) 2021-07-21 15:43:06 -07:00