Sven Mika
|
b95e28faea
|
[RLlib] APEX_DDPG (PyTorch) test case and docs. (#8288)
APEX_DDPG (PyTorch) test case and docs.
|
2020-05-04 09:36:27 +02:00 |
|
Eric Liang
|
2298f6fb40
|
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
2020-04-23 12:39:19 -07:00 |
|
Sven Mika
|
e153e3179f
|
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
* Rollback.
* WIP.
* WIP.
* LINT.
* WIP.
* Fix.
* Fix.
* Fix.
* LINT.
* Fix (SAC does currently not support eager).
* Fix.
* WIP.
* LINT.
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* WIP.
* WIP.
* Fix.
* LINT.
* LINT.
* Fix and LINT.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix.
* LINT.
* Fix.
* Fix and LINT.
* Update rllib/utils/exploration/exploration.py
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Fixes.
* LINT.
* WIP.
Co-authored-by: Eric Liang <ekhliang@gmail.com>
|
2020-04-01 00:43:21 -07:00 |
|
Eric Liang
|
be48e1964b
|
[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504)
* fix sched
* lintc
* lint
* fix
* add unit test
* fix
* format
* fix test
* fix test
|
2020-03-10 11:14:14 -07:00 |
|
Eric Liang
|
596b39e36a
|
[rllib] Make timestep a required arg for exploration classes (#7380)
|
2020-03-04 13:00:37 -08:00 |
|
Sven Mika
|
83e06cd30a
|
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix
* WIP.
* Add TD3 quick Pendulum regresison.
* Cleanup.
* Fix.
* LINT.
* Fix.
* Sort quick_learning test cases, add TD3.
* Sort quick_learning test cases, add TD3.
* Revert test_checkpoint_restore.py (debugging) changes.
* Fix old soft_q settings in documentation and test configs.
* More doc fixes.
* Fix test case.
* Fix test case.
* Lower test load.
* WIP.
|
2020-03-01 11:53:35 -08:00 |
|
Sven Mika
|
6e1c3ea824
|
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
2020-02-10 15:22:07 -08:00 |
|