ray/rllib/contrib/bandits
Sven Mika e153e3179f
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
* Rollback.

* WIP.

* WIP.

* LINT.

* WIP.

* Fix.

* Fix.

* Fix.

* LINT.

* Fix (SAC does currently not support eager).

* Fix.

* WIP.

* LINT.

* Update rllib/evaluation/sampler.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/evaluation/sampler.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/utils/exploration/exploration.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/utils/exploration/exploration.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* WIP.

* WIP.

* Fix.

* LINT.

* LINT.

* Fix and LINT.

* WIP.

* WIP.

* WIP.

* WIP.

* Fix.

* LINT.

* Fix.

* Fix and LINT.

* Update rllib/utils/exploration/exploration.py

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Update rllib/policy/dynamic_tf_policy.py

Co-Authored-By: Eric Liang <ekhliang@gmail.com>

* Fixes.

* LINT.

* WIP.

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-04-01 00:43:21 -07:00
..
agents [rllib] Add scaling guide to documentation, improve bandit docs (#7780) 2020-03-27 22:05:43 -07:00
envs Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
examples Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
models Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
__init__.py Contextual Bandit algorithms (WIP) (#7642) 2020-03-26 13:41:16 -07:00
exploration.py [RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798) 2020-04-01 00:43:21 -07:00