ray/rllib/policy
Sven Mika 83e06cd30a
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
* WIP.

* WIP.

* WIP.

* WIP.

* WIP.

* Fix

* WIP.

* Add TD3 quick Pendulum regresison.

* Cleanup.

* Fix.

* LINT.

* Fix.

* Sort quick_learning test cases, add TD3.

* Sort quick_learning test cases, add TD3.

* Revert test_checkpoint_restore.py (debugging) changes.

* Fix old soft_q settings in documentation and test configs.

* More doc fixes.

* Fix test case.

* Fix test case.

* Lower test load.

* WIP.
2020-03-01 11:53:35 -08:00
..
tests [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
__init__.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
dynamic_tf_policy.py [Core/RLlib] Move log_once from rllib to ray.util. (#7273) 2020-02-27 10:40:44 -08:00
eager_tf_policy.py [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
policy.py [rllib] Fix multiagent example crash due to undefined abstract method (#7329) 2020-02-26 22:54:40 -08:00
rnn_sequencing.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
sample_batch.py Remove future imports (#6724) 2020-01-09 00:15:48 -08:00
tf_policy.py [RLlib] DDPG refactor and Exploration API action noise classes. (#7314) 2020-03-01 11:53:35 -08:00
tf_policy_template.py [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
torch_policy.py [RLlib] Policy.compute_log_likelihoods() and SAC refactor. (issue #7107) (#7124) 2020-02-22 14:19:49 -08:00
torch_policy_template.py [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238) 2020-02-22 11:02:31 -08:00