ray/rllib/agents/qmix
gjoliver d81885c1f1
[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809)
* Fix DDPG, since it is based on GenericOffPolicyTrainer.

* Fix QMix, SAC, and MADDPA too.

* Undo QMix change.

* Fix DQN input batch type. Always use SampleBatch.

* apex ddpg should not use replay_buffer_config yet.

* Make eager tf policy to use SampleBatch.

* lint

* LINT.

* Re-enable RLlib broken tests to make sure things work ok now.

* fixes.

Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-10-28 18:06:47 +02:00
..
tests [RLlib] Issue 15973: Trainer.with_updates(validate_config=...) behaves confusingly. (#16429) 2021-06-19 22:42:00 +02:00
__init__.py [rllib] Deprecate policy optimizers (#8345) 2020-05-21 10:16:18 -07:00
mixers.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
model.py [RLlib] Attention Net prep PR #3. (#12450) 2020-12-07 13:08:17 +01:00
qmix.py [RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809) 2021-10-28 18:06:47 +02:00
qmix_policy.py [RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879) 2021-09-30 16:39:05 +02:00
README.md [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00

Code in this package is adapted from https://github.com/oxwhirl/pymarl.