ray/rllib/agents/sac
2020-04-09 14:04:21 -07:00
..
tests [RLlib] Add testing framework_iterator. (#7852) 2020-04-03 12:24:25 -07:00
__init__.py [RLlib] SAC refactor with new SquashedGaussian distribution class. (#7272) 2020-02-23 16:10:20 -08:00
README.md [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
sac.py [RLlib] DQN torch version. (#7597) 2020-04-06 11:56:16 -07:00
sac_model.py [rllib] Add back get_policy_output method for SAC model (#7604) 2020-03-20 12:44:04 -07:00
sac_policy.py [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934) 2020-04-09 14:04:21 -07:00

Implementation of the Soft Actor-Critic algorithm:

[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et. al https://arxiv.org/abs/1812.05905.pdf

For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf