ray/rllib/agents/ddpg at 5537fe13b097097668f9c08a00051e8b7a2d1980 - hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-08 19:41:38 -05:00

History

Sven Mika 5537fe13b0 [RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814 )		2020-04-03 10:44:25 -07:00
..
common	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
tests	[RLlib] Cleanup/unify all test cases. (#7533 )	2020-03-11 20:39:47 -07:00
__init__.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
apex.py	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 )	2020-03-14 12:05:04 -07:00
ddpg.py	[RLlib] Exploration API: ParamNoise Integration into DQN; working example/test cases. (#7814 )	2020-04-03 10:44:25 -07:00
ddpg_policy.py	[RLlib] Remove all instances of tf.contrib.layers. ... from RLlib code (deprecated). (#7851 )	2020-04-01 18:03:14 -07:00
noop_model.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
README.md	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
td3.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00

README.md

Implementation of deep deterministic policy gradients (https://arxiv.org/abs/1509.02971), including an Ape-X variant.