* custom action dist wip
* Test case for custom action dist
* ActionDistribution.get_parameter_shape_for_action_space pattern
* Edit exception message to also suggest using a custom action distribution
* Clean up ModelCatalog.get_action_dist
* Pass model config to ActionDistribution constructors
* Update custom action distribution test case
* Name fix
* Autoformatter
* parameter shape static methods for torch distributions
* Fix docstring
* Generalize fake array for graph initialization
* Fix action dist constructors
* Correct parameter shape static methods for multicategorical and gaussian
* Make suggested changes to custom action dist's
* Correct instances of not passing model config to action dist
* Autoformatter
* fix tuple distribution constructor
* bugfix
* [tune] Fix small bug in experiment_analysis
_experiment_state was not set as an attribute of ExperimentAnalysis
* Update python/ray/tune/analysis/experiment_analysis.py
Co-Authored-By: Richard Liaw <rliaw@berkeley.edu>
* Add base for Soft Actor-Critic
* Pick changes from old SAC branch
* Update sac.py
* First implementation of sac model
* Remove unnecessary SAC imports
* Prune unnecessary noise and exploration code
* Implement SAC model and use that in SAC policy
* runs but doesn't learn
* clear state
* fix batch size
* Add missing alpha grads and vars
* -200 by 2k timesteps
* doc
* lazy squash
* one file
* ignore tfp
* revert done
* Regression test
* Split TaskDependencyManager::SubscribeDependencies into ray.get and ray.wait dependencies
- Some initial implementation
* unit test
* Improve unit tests for TaskDependencyManager
* Implement SubscribeWaitDependencies and UnsubscribeWaitDependencies, unit tests passing
* Add ray.wait python test for drivers that exit early
* Add WorkerID to Worker
* Update test to use two nodes
* Regression test for ray.wait passes
* Extend regression test to include ray.wait from an actor
* Fix ClientID and WorkerIDs
* lint
* lint
* Remove unnecessary ray_get argument
* fix build