Avnish Narayan
|
b6077a36d4
|
[RLlib; Pre-checks/better failure behavior]: Env Checker for Gym Environments (#20481)
|
2021-11-19 09:41:03 +01:00 |
|
Sven Mika
|
3f89f35e52
|
[RLlib] Better error messages and hints; + failure-mode tests; (#18466)
|
2021-09-10 16:52:47 +02:00 |
|
Hao Chen
|
d49dadf891
|
Change Python's ObjectID to ObjectRef (#9353)
|
2020-07-10 17:49:04 +08:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
b37a162076
|
[RLlib] Make envs specifiable in configs by their class path. (#8750)
|
2020-06-03 08:14:29 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
1d4823c0ec
|
[RLlib] Add testing framework_iterator. (#7852)
* Add testing framework_iterator.
* LINT.
* WIP.
* Fix and LINT.
* LINT fix.
|
2020-04-03 12:24:25 -07:00 |
|
Sven Mika
|
e153e3179f
|
[RLlib] Exploration API: Policy changes needed for forward pass noisifications. (#7798)
* Rollback.
* WIP.
* WIP.
* LINT.
* WIP.
* Fix.
* Fix.
* Fix.
* LINT.
* Fix (SAC does currently not support eager).
* Fix.
* WIP.
* LINT.
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/evaluation/sampler.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/utils/exploration/exploration.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* WIP.
* WIP.
* Fix.
* LINT.
* LINT.
* Fix and LINT.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix.
* LINT.
* Fix.
* Fix and LINT.
* Update rllib/utils/exploration/exploration.py
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Update rllib/policy/dynamic_tf_policy.py
Co-Authored-By: Eric Liang <ekhliang@gmail.com>
* Fixes.
* LINT.
* WIP.
Co-authored-by: Eric Liang <ekhliang@gmail.com>
|
2020-04-01 00:43:21 -07:00 |
|
Sven Mika
|
20ef4a8603
|
[RLlib] Cleanup/unify all test cases. (#7533)
|
2020-03-11 20:39:47 -07:00 |
|
Eric Liang
|
be48e1964b
|
[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504)
* fix sched
* lintc
* lint
* fix
* add unit test
* fix
* format
* fix test
* fix test
|
2020-03-10 11:14:14 -07:00 |
|
Sven Mika
|
d537e9f0d8
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
|
Adrian O'Grady
|
fe6ce714a0
|
[rllib] - TaskPool.completed_prefetch() no longer returns stale object ids after an error (#7139)
|
2020-02-13 22:30:44 -08:00 |
|
Sven Mika
|
6e1c3ea824
|
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
2020-02-10 15:22:07 -08:00 |
|
Sven Mika
|
ae9a3a2237
|
[RLlib] from_config util method for framework agnostic components; start moving RLlib tests into Bazel. (#6865)
|
2020-01-22 17:02:58 -08:00 |
|