ray/rllib/agents/ddpg
2020-04-03 10:44:25 -07:00
..
common [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
tests [RLlib] Cleanup/unify all test cases. (#7533) 2020-03-11 20:39:47 -07:00
__init__.py [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
apex.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
ddpg.py [RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814) 2020-04-03 10:44:25 -07:00
ddpg_policy.py [RLlib] Remove all instances of tf.contrib.layers. ... from RLlib code (deprecated). (#7851) 2020-04-01 18:03:14 -07:00
noop_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
README.md [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
td3.py [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.