ray/rllib/utils/exploration at 3812bfedda7c10bd8e5ead343e02577bfe159728 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

History

Sven Mika d0fab84e4d [RLlib] DDPG PyTorch version. (#7953 ) The DDPG/TD3 algorithms currently do not have a PyTorch implementation. This PR adds PyTorch support for DDPG/TD3 to RLlib. This PR: - Depends on the re-factor PR for DDPG (Functional Algorithm API). - Adds learning regression tests for the PyTorch version of DDPG and a DDPG (torch) - Updates the documentation to reflect that DDPG and TD3 now support PyTorch. * Learning Pendulum-v0 on torch version (same config as tf). Wall time a little slower (~20% than tf). * Fix GPU target model problem.		2020-04-16 10:20:01 +02:00
..
tests	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
__init__.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
epsilon_greedy.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
exploration.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
gaussian_noise.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
ornstein_uhlenbeck_noise.py	[RLlib] DDPG PyTorch version. (#7953 )	2020-04-16 10:20:01 +02:00
parameter_noise.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
per_worker_epsilon_greedy.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
per_worker_gaussian_noise.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
per_worker_ornstein_uhlenbeck_noise.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
random.py	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
soft_q.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00
stochastic_sampling.py	[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798 )	2020-04-01 00:43:21 -07:00