ray/rllib/agents/sac
Eric Liang dd70720578
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
* bulk rename

* deprecation warn

* update doc

* update fig

* line length

* rename

* make pytest comptaible

* fix test

* fi sys

* rename

* wip

* fix more

* lint

* update svg

* comments

* lint

* fix use of batch steps
2020-03-14 12:05:04 -07:00
..
tests [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
__init__.py [RLlib] SAC refactor with new SquashedGaussian distribution class. (#7272) 2020-02-23 16:10:20 -08:00
README.md [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
sac.py [rllib] Rename sample_batch_size => rollout_fragment_length (#7503) 2020-03-14 12:05:04 -07:00
sac_model.py [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00
sac_policy.py [RLlib] SAC add discrete action support. (#7320) 2020-03-06 10:37:12 -08:00

Implementation of the Soft Actor-Critic algorithm:

[1] Soft Actor-Critic Algorithms and Applications - T. Haarnoja, A. Zhou, K. Hartikainen, et. al https://arxiv.org/abs/1812.05905.pdf

For supporting discrete action spaces, we implemented this patch on top of the original algorithm: [2] Soft Actor-Critic for Discrete Action Settings - Petros Christodoulou https://arxiv.org/pdf/1910.07207v2.pdf