hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

gjoliver d81885c1f1 [RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809 ) * Fix DDPG, since it is based on GenericOffPolicyTrainer. * Fix QMix, SAC, and MADDPA too. * Undo QMix change. * Fix DQN input batch type. Always use SampleBatch. * apex ddpg should not use replay_buffer_config yet. * Make eager tf policy to use SampleBatch. * lint * LINT. * Re-enable RLlib broken tests to make sure things work ok now. * fixes. Co-authored-by: sven1977 <svenmika1977@gmail.com>		2021-10-28 18:06:47 +02:00
..
__init__.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
maddpg.py	[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809 )	2021-10-28 18:06:47 +02:00
maddpg_policy.py	[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879 )	2021-09-30 16:39:05 +02:00
README.md	[RLlib] Update MADDPG example repo to maintained fork (#6831 )	2020-01-18 13:08:27 -08:00

README.md

Implementation of MADDPG in RLLib

Please check justinkterry/maddpg-rllib for more information.