ray/rllib/agents/ddpg at 2b893d1bb5be8f8db87f732bb73c3f7cab425395 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

mvindiola1 2b893d1bb5 fix incorrect critic loss in TD3 (#10775 ) Co-authored-by: Manny Vindiola <manuel.m.vindiola.civ@mail.mil>		2020-09-20 20:01:51 -07:00
..
tests	fix incorrect critic loss in TD3 (#10775 )	2020-09-20 20:01:51 -07:00
__init__.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
apex.py	[RLlib] DDPG and SAC eager support (preparation for tf2.x) (#9204 )	2020-07-08 16:12:20 +02:00
ddpg.py	[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544 )	2020-09-06 10:58:00 +02:00
ddpg_tf_model.py	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
ddpg_tf_policy.py	fix incorrect critic loss in TD3 (#10775 )	2020-09-20 20:01:51 -07:00
ddpg_torch_model.py	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
ddpg_torch_policy.py	fix incorrect critic loss in TD3 (#10775 )	2020-09-20 20:01:51 -07:00
noop_model.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
README.md	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
td3.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00

README.md

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.