ray/rllib/agents/sac/README.md

8 lines
399 B
Markdown

Implementation of the Soft Actor-Critic algorithm:
[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et al.
https://arxiv.org/abs/1812.05905.pdf
For supporting discrete action spaces, we implemented this patch on top of the original algorithm:
[2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou
https://arxiv.org/pdf/1910.07207v2.pdf