mirror of
https://github.com/vale981/ray
synced 2025-03-06 18:41:40 -05:00
![]() * Fix DDPG, since it is based on GenericOffPolicyTrainer. * Fix QMix, SAC, and MADDPA too. * Undo QMix change. * Fix DQN input batch type. Always use SampleBatch. * apex ddpg should not use replay_buffer_config yet. * Make eager tf policy to use SampleBatch. * lint * LINT. * Re-enable RLlib broken tests to make sure things work ok now. * fixes. Co-authored-by: sven1977 <svenmika1977@gmail.com> |
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
mixers.py | ||
model.py | ||
qmix.py | ||
qmix_policy.py | ||
README.md |
Code in this package is adapted from https://github.com/oxwhirl/pymarl.