Sven Mika
|
c593fb09b7
|
[RLlib] Remove all f-strings to keep py3.5 compatibility.
|
2020-04-30 11:10:16 -07:00 |
|
Sven Mika
|
bf25aee392
|
[RLlib] Deprecate all Model(v1) usage. (#8146)
Deprecate all Model(v1) usage.
|
2020-04-29 12:12:59 +02:00 |
|
Sven Mika
|
d2b5c171cb
|
[RLlib] Add pytorch sigils to toc and add links to algo overview table. (#7950)
* Add torch sigils to toc-tree for DQN/APEX.
* WIP.
|
2020-04-09 10:40:18 -07:00 |
|
Sven Mika
|
e153e3179f
|
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
* Rollback.
* WIP.
* WIP.
* LINT.
* WIP.
* Fix.
* Fix.
* Fix.
* LINT.
* Fix (SAC does currently not support eager).
* Fix.
* WIP.
* LINT.
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* WIP.
* WIP.
* Fix.
* LINT.
* LINT.
* Fix and LINT.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix.
* LINT.
* Fix.
* Fix and LINT.
* Update rllib/utils/exploration/exploration.py
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Fixes.
* LINT.
* WIP.
Co-authored-by: Eric Liang <ekhliang@gmail.com>
|
2020-04-01 00:43:21 -07:00 |
|
Eric Liang
|
5cebee68d6
|
[rllib] Add scaling guide to documentation, improve bandit docs (#7780)
* update
* reword
* update
* ms
* multi node sgd
* reorder
* improve bandit docs
* contrib
* update
* ref
* improve refs
* fix build
* add pillow dep
* add pil
* update pil
* pillow
* remove false
|
2020-03-27 22:05:43 -07:00 |
|
Saurabh Gupta
|
6ddf84b019
|
Contextual Bandit algorithms (WIP) (#7642)
|
2020-03-26 13:41:16 -07:00 |
|