ray/rllib/agents/ddpg at 83e06cd30a45245c2cb0e9f4bd924224b1581554 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Sven Mika 83e06cd30a [RLlib] DDPG refactor and Exploration API action noise classes. (#7314 ) * WIP. * WIP. * WIP. * WIP. * WIP. * Fix * WIP. * Add TD3 quick Pendulum regresison. * Cleanup. * Fix. * LINT. * Fix. * Sort quick_learning test cases, add TD3. * Sort quick_learning test cases, add TD3. * Revert test_checkpoint_restore.py (debugging) changes. * Fix old soft_q settings in documentation and test configs. * More doc fixes. * Fix test case. * Fix test case. * Lower test load. * WIP.		2020-03-01 11:53:35 -08:00
..
common	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
tests	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
__init__.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
apex.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
ddpg.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
ddpg_policy.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
noop_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
README.md	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
td3.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00

README.md

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.