ray/rllib/policy at 584645cc7da2bfd7d341d52b59c9c8561dbd119b - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 83e06cd30a [RLlib] DDPG refactor and Exploration API action noise classes. (#7314 ) * WIP. * WIP. * WIP. * WIP. * WIP. * Fix * WIP. * Add TD3 quick Pendulum regresison. * Cleanup. * Fix. * LINT. * Fix. * Sort quick_learning test cases, add TD3. * Sort quick_learning test cases, add TD3. * Revert test_checkpoint_restore.py (debugging) changes. * Fix old soft_q settings in documentation and test configs. * More doc fixes. * Fix test case. * Fix test case. * Lower test load. * WIP.		2020-03-01 11:53:35 -08:00
..
tests	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
__init__.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
dynamic_tf_policy.py	[Core/RLlib] Move `log_once` from rllib to ray.util. (#7273 )	2020-02-27 10:40:44 -08:00
eager_tf_policy.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
policy.py	[rllib] Fix multiagent example crash due to undefined abstract method (#7329 )	2020-02-26 22:54:40 -08:00
rnn_sequencing.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
sample_batch.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
tf_policy.py	[RLlib] DDPG refactor and Exploration API action noise classes. (#7314 )	2020-03-01 11:53:35 -08:00
tf_policy_template.py	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
torch_policy.py	[RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107 ) (#7124 )	2020-02-22 14:19:49 -08:00
torch_policy_template.py	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00