Eric Liang
|
4963dfaae0
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
e485aa846a
|
[RLlib; Docs overhaul] Overhaul of auto-API reference pages (via sphinx autoclass/automodule). (#19786)
|
2021-12-15 22:32:52 +01:00 |
|
Sven Mika
|
8ea1bc5ff9
|
[RLlib] Allow for more than 2^31 policy timesteps. (#11301)
|
2020-10-12 13:49:11 -07:00 |
|
Sven Mika
|
f43d934817
|
[RLlib] Type annotations for policy. (#9248)
|
2020-07-05 13:09:51 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Sven Mika
|
baa053496a
|
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
2020-05-26 11:10:27 +02:00 |
|
Sven Mika
|
6c2b9a4cfa
|
[RLlib] Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
Remove tf.py_function from all Schedule classes (not differentiable and causes other bugs in MA setups). (#8304)
|
2020-05-04 23:53:38 +02:00 |
|
Eric Liang
|
be48e1964b
|
[rllib] Fix per-worker exploration in Ape-X; make more kwargs required for future safety (#7504)
* fix sched
* lintc
* lint
* fix
* add unit test
* fix
* format
* fix test
* fix test
|
2020-03-10 11:14:14 -07:00 |
|
Sven Mika
|
83e06cd30a
|
[RLlib] DDPG refactor and Exploration API action noise classes. (#7314)
* WIP.
* WIP.
* WIP.
* WIP.
* WIP.
* Fix
* WIP.
* Add TD3 quick Pendulum regresison.
* Cleanup.
* Fix.
* LINT.
* Fix.
* Sort quick_learning test cases, add TD3.
* Sort quick_learning test cases, add TD3.
* Revert test_checkpoint_restore.py (debugging) changes.
* Fix old soft_q settings in documentation and test configs.
* More doc fixes.
* Fix test case.
* Fix test case.
* Lower test load.
* WIP.
|
2020-03-01 11:53:35 -08:00 |
|
Sven Mika
|
6e1c3ea824
|
[RLlib] Exploration API (+EpsilonGreedy sub-class). (#6974)
|
2020-02-10 15:22:07 -08:00 |
|
Sven Mika
|
136ada5fb9
|
[RLlib] Experiment with py_func as a means to further unify tf and torch (Schedule classes). (#6951)
|
2020-01-30 11:27:57 -08:00 |
|
Sven Mika
|
4c97348cb6
|
[RLlib] Schedule-classes multi-framework support. (#6926)
|
2020-01-28 11:07:55 -08:00 |
|