* [rllib] Separate optimisers for DDPG actor & crit.
* [rllib] Better names for DDPG variables & options
Config changes:
- noise_scale -> exploration_ou_noise_scale
- exploration_theta -> exploration_ou_theta
- exploration_sigma -> exploration_ou_sigma
- act_noise -> exploration_gaussian_sigma
- noise_clip -> target_noise_clip
* [rllib] Make DDPG less class-y
Used functions to replace three classes with only an __init__ method & a
handful of unrelated attributes.
* [rllib] Refactor DDPG noise
* [rllib] Unify DDPG exploration annealing
Added option "exploration_should_anneal" to enable linear annealing of
exploration noise. By default this is off, for consistency with DDPG &
TD3 papers. Also renamed "exploration_final_eps" to
"exploration_final_scale" (that name seems to have been carried over
from DQN, and doesn't really make sense here). Finally, tried to rename
"eps" to "noise_scale" wherever possible.
## What do these changes do?
Sets ASHA defaults to paper defaults.
## Related issue number
## Linter
- [ ] I've run `scripts/format.sh` to lint the changes in this PR.
* ssh_control_path added as an auth option.
* revamped default ssh options to take in control path, nodeupdater checks auth config to see if a custom SSH sockets path was specified, otherwise the original hardcoded path is used. control path is now a nodeupdater instance variable
* revert socketdir in auth config and change method for determining dir
* new ssh dir method
* Lint
* ' -> " lint
* changed using USER env to getpass.getuser()
* Cleaner and more correct treatment of agent states in rollout.py
* support lstm_use_prev_action_reward in rollout.py
* Linter.
* appease flake8
* Use _DUMMY_AGENT_ID instead of 0.
* All agents have a policy_agent_mapping.
Reset the mapping cache at the start of each episode.
* Update rollout.py
* Fix rollout.py for single-agent envs.
* Use agent_id, not policy_id.
* Make full use of node
implement local node
fix bugs mentioned in comments
* Add more tests
* Use more specific exception handling
* fix, lint
* fix for py2.x