mirror of
https://github.com/vale981/ray
synced 2025-03-06 02:21:39 -05:00
.. | ||
tests | ||
__init__.py | ||
README.md | ||
sac.py | ||
sac_tf_model.py | ||
sac_tf_policy.py | ||
sac_torch_model.py | ||
sac_torch_policy.py |
Implementation of the Soft Actor-Critic algorithm:
[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et. al https://arxiv.org/abs/1812.05905.pdf
For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf