ray/docker
Kristian Hartikainen 13fb9fe3db [rllib] Feature/soft actor critic v2 (#5328)
* Add base for Soft Actor-Critic

* Pick changes from old SAC branch

* Update sac.py

* First implementation of sac model

* Remove unnecessary SAC imports

* Prune unnecessary noise and exploration code

* Implement SAC model and use that in SAC policy

* runs but doesn't learn

* clear state

* fix batch size

* Add missing alpha grads and vars

* -200 by 2k timesteps

* doc

* lazy squash

* one file

* ignore tfp

* revert done
2019-08-01 23:37:36 -07:00
..
base-deps [ci] Change Jenkins to py3 (#5022) 2019-06-24 21:50:37 -07:00
deploy Remove CMake files (#4493) 2019-04-02 22:17:33 -07:00
examples [rllib] Feature/soft actor critic v2 (#5328) 2019-08-01 23:37:36 -07:00
stress_test Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00
tune_test Update wheels to 0.8.0dev2 (#5186) 2019-07-12 17:27:03 -07:00