mirror of
https://github.com/vale981/ray
synced 2025-03-08 11:31:40 -05:00
![]() * WIP. * Fixes. * LINT. * WIP. * WIP. * Fixes. * Fixes. * Fixes. * Fixes. * WIP. * Fixes. * Test * Fix. * Fixes and LINT. * Fixes and LINT. * LINT. |
||
---|---|---|
.. | ||
tests | ||
__init__.py | ||
README.md | ||
sac.py | ||
sac_tf_model.py | ||
sac_tf_policy.py | ||
sac_torch_model.py | ||
sac_torch_policy.py |
Implementation of the Soft Actor-Critic algorithm:
[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et al. https://arxiv.org/abs/1812.05905.pdf
For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf