mirror of
https://github.com/vale981/ray
synced 2025-03-06 10:31:39 -05:00
![]() * Fix DDPG, since it is based on GenericOffPolicyTrainer. * Fix QMix, SAC, and MADDPA too. * Undo QMix change. * Fix DQN input batch type. Always use SampleBatch. * apex ddpg should not use replay_buffer_config yet. * Make eager tf policy to use SampleBatch. * lint * LINT. * Re-enable RLlib broken tests to make sure things work ok now. * fixes. Co-authored-by: sven1977 <svenmika1977@gmail.com> |
||
---|---|---|
.. | ||
__init__.py | ||
maddpg.py | ||
maddpg_policy.py | ||
README.md |
Implementation of MADDPG in RLLib
Please check justinkterry/maddpg-rllib for more information.