ray/rllib/agents/ddpg
mvindiola1 2b893d1bb5
fix incorrect critic loss in TD3 (#10775)
Co-authored-by: Manny Vindiola <manuel.m.vindiola.civ@mail.mil>
2020-09-20 20:01:51 -07:00
..
tests fix incorrect critic loss in TD3 (#10775) 2020-09-20 20:01:51 -07:00
__init__.py [RLlib] DDPG PyTorch version. (#7953) 2020-04-16 10:20:01 +02:00
apex.py [RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204) 2020-07-08 16:12:20 +02:00
ddpg.py [RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544) 2020-09-06 10:58:00 +02:00
ddpg_tf_model.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
ddpg_tf_policy.py fix incorrect critic loss in TD3 (#10775) 2020-09-20 20:01:51 -07:00
ddpg_torch_model.py [RLlib] SAC algo cleanup. (#10825) 2020-09-20 11:27:02 +02:00
ddpg_torch_policy.py fix incorrect critic loss in TD3 (#10775) 2020-09-20 20:01:51 -07:00
noop_model.py [RLlib] Tf2x preparation; part 2 (upgrading try_import_tf()). (#9136) 2020-06-30 10:13:20 +02:00
README.md [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
td3.py [RLlib] DDPG PyTorch version. (#7953) 2020-04-16 10:20:01 +02:00

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.