ray/rllib/agents/sac
Sven Mika 1d4823c0ec
[RLlib] Add testing framework_iterator. (#7852)
* Add testing framework_iterator.

* LINT.

* WIP.

* Fix and LINT.

* LINT fix.
2020-04-03 12:24:25 -07:00
..
tests [RLlib] Add testing framework_iterator. (#7852) 2020-04-03 12:24:25 -07:00
__init__.py [RLlib] SAC refactor with new SquashedGaussian distribution class. (#7272) 2020-02-23 16:10:20 -08:00
README.md [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
sac.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
sac_model.py [rllib] Add back get_policy_output method for SAC model (#7604) 2020-03-20 12:44:04 -07:00
sac_policy.py [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 2020-04-01 00:43:21 -07:00

Implementation of the Soft Actor-Critic algorithm:

[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et. al https://arxiv.org/abs/1812.05905.pdf

For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf