ray/rllib/agents/dqn
Sven Mika d0fab84e4d
[RLlib] DDPG PyTorch version. (#7953)
The DDPG/TD3 algorithms currently do not have a PyTorch implementation. This PR adds PyTorch support for DDPG/TD3 to RLlib.
This PR:
- Depends on the re-factor PR for DDPG (Functional Algorithm API).
- Adds learning regression tests for the PyTorch version of DDPG and a DDPG (torch)
- Updates the documentation to reflect that DDPG and TD3 now support PyTorch.

* Learning Pendulum-v0 on torch version (same config as tf). Wall time a little slower (~20% than tf).
* Fix GPU target model problem.
2020-04-16 10:20:01 +02:00
..
common [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
tests [RLlib] SAC Torch (incl. Atari learning) (#7984) 2020-04-15 13:25:16 +02:00
__init__.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
apex.py [rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958) 2020-04-10 00:56:08 -07:00
distributional_q_tf_model.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
dqn.py [RLlib] SAC Torch (incl. Atari learning) (#7984) 2020-04-15 13:25:16 +02:00
dqn_tf_policy.py [RLlib] SAC Torch (incl. Atari learning) (#7984) 2020-04-15 13:25:16 +02:00
dqn_torch_model.py [RLlib] DDPG PyTorch version. (#7953) 2020-04-16 10:20:01 +02:00
dqn_torch_policy.py [RLlib] DDPG PyTorch version. (#7953) 2020-04-16 10:20:01 +02:00
README.md [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
simple_q.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
simple_q_model.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
simple_q_tf_policy.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
simple_q_torch_policy.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00