ray/rllib/agents/sac at 1b31c118063db1902e78dea0cf0d98ec85cdfb06 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Sven Mika 1b31c11806 [RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934 )		2020-04-09 14:04:21 -07:00
..
tests	[RLlib] Add testing framework_iterator. (#7852 )	2020-04-03 12:24:25 -07:00
__init__.py	[RLlib] SAC refactor with new SquashedGaussian distribution class. (#7272 )	2020-02-23 16:10:20 -08:00
README.md	[RLlib] SAC add discrete action support. (#7320 )	2020-03-06 10:37:12 -08:00
sac.py	[RLlib] DQN torch version. (#7597 )	2020-04-06 11:56:16 -07:00
sac_model.py	[rllib] Add back get_policy_output method for SAC model (#7604 )	2020-03-20 12:44:04 -07:00
sac_policy.py	[RLlib] DDPG re-factor to fit into RLlib's functional algorithm builder API. (#7934 )	2020-04-09 14:04:21 -07:00

README.md

Implementation of the Soft Actor-Critic algorithm:

[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et. al https://arxiv.org/abs/1812.05905.pdf

For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf