ray/rllib/agents/ddpg at 407a3523f367a1e2f124b4bdfb1aef3a2d4340a7 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 19c8033df2 [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 ) * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT and fixes. MB-MPO and MAML not working yet. * wip * update * update * rmeove * remove dep * higher * Update requirements_rllib.txt * Update requirements_rllib.txt * relpos * no mbmpo Co-authored-by: Eric Liang <ekhliang@gmail.com>		2020-12-01 17:41:10 -08:00
..
tests	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
__init__.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
apex.py	[RLlib] Fix all example scripts to run on GPUs. (#11105 )	2020-10-02 23:07:44 +02:00
ddpg.py	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
ddpg_tf_model.py	[RLlib] SAC algo cleanup. (#10825 )	2020-09-20 11:27:02 +02:00
ddpg_tf_policy.py	[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945 )	2020-11-29 12:31:24 +01:00
ddpg_torch_model.py	[RLlib] MB-MPO cleanup (comments, docstrings, type annotations). (#11033 )	2020-10-06 20:28:16 +02:00
ddpg_torch_policy.py	[RLlib] Attention Net prep PR #1 : Smaller cleanups. (#12447 )	2020-11-27 16:25:47 -08:00
noop_model.py	[RLlib] Tf2x preparation; part 2 (upgrading `try_import_tf()`). (#9136 )	2020-06-30 10:13:20 +02:00
README.md	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
td3.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00

README.md

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.