ray/rllib/agents/ddpg
2020-04-09 14:04:21 -07:00
..
common [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
tests [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 2020-04-09 14:04:21 -07:00
__init__.py [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
apex.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
ddpg.py [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 2020-04-09 14:04:21 -07:00
ddpg_model.py [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 2020-04-09 14:04:21 -07:00
ddpg_policy.py [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 2020-04-09 14:04:21 -07:00
noop_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
README.md [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
td3.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.