mirror of
https://github.com/vale981/ray
synced 2025-03-06 10:31:39 -05:00
![]() * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT and fixes. MB-MPO and MAML not working yet. * wip * update * update * rmeove * remove dep * higher * Update requirements_rllib.txt * Update requirements_rllib.txt * relpos * no mbmpo Co-authored-by: Eric Liang <ekhliang@gmail.com> |
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
apex.py | ||
ddpg.py | ||
ddpg_tf_model.py | ||
ddpg_tf_policy.py | ||
ddpg_torch_model.py | ||
ddpg_torch_policy.py | ||
noop_model.py | ||
README.md | ||
td3.py |
Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.