ray/rllib/agents/dqn at dbcad35022f68e3e1ac6b7e8551df115fc1c2b81 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika d0fab84e4d [RLlib] DDPG PyTorch version. (#7953 ) The DDPG/TD3 algorithms currently do not have a PyTorch implementation. This PR adds PyTorch support for DDPG/TD3 to RLlib. This PR: - Depends on the re-factor PR for DDPG (Functional Algorithm API). - Adds learning regression tests for the PyTorch version of DDPG and a DDPG (torch) - Updates the documentation to reflect that DDPG and TD3 now support PyTorch. * Learning Pendulum-v0 on torch version (same config as tf). Wall time a little slower (~20% than tf). * Fix GPU target model problem.		2020-04-16 10:20:01 +02:00
..
common	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
tests	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
__init__.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
apex.py	[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958 )	2020-04-10 00:56:08 -07:00
distributional_q_tf_model.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
dqn.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
dqn_tf_policy.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
dqn_torch_model.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
dqn_torch_policy.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
README.md	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
simple_q.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
simple_q_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
simple_q_tf_policy.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
simple_q_torch_policy.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00

README.md

Code in this package is adapted from https://github.com/openai/baselines/tree/master/baselines/deepq.