mirror of
https://github.com/vale981/ray
synced 2025-03-06 18:41:40 -05:00
8 lines
399 B
Markdown
8 lines
399 B
Markdown
Implementation of the Soft Actor-Critic algorithm:
|
|
|
|
[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et al.
|
|
https://arxiv.org/abs/1812.05905.pdf
|
|
|
|
For supporting discrete action spaces, we implemented this patch on top of the original algorithm:
|
|
[2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou
|
|
https://arxiv.org/pdf/1910.07207v2.pdf
|