mirror of
https://github.com/vale981/ray
synced 2025-03-08 19:41:38 -05:00
![]() * WIP. * WIP. * WIP. * WIP. * WIP. * Fix * WIP. * Add TD3 quick Pendulum regresison. * Cleanup. * Fix. * LINT. * Fix. * Sort quick_learning test cases, add TD3. * Sort quick_learning test cases, add TD3. * Revert test_checkpoint_restore.py (debugging) changes. * Fix old soft_q settings in documentation and test configs. * More doc fixes. * Fix test case. * Fix test case. * Lower test load. * WIP. |
||
---|---|---|
.. | ||
common | ||
tests | ||
__init__.py | ||
apex.py | ||
ddpg.py | ||
ddpg_policy.py | ||
noop_model.py | ||
README.md | ||
td3.py |
Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.