Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
60b2219d72
|
[RLlib] Allow for evaluation to run by timesteps (alternative to episodes ) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2021-12-04 13:26:33 +01:00 |
|
Sven Mika
|
ed85f59194
|
[RLlib] Unify all RLlib Trainer.train() -> results[info][learner][policy ID][learner_stats] and add structure tests. (#18879)
|
2021-09-30 16:39:05 +02:00 |
|
Sven Mika
|
e2be41b407
|
[RLlib] MARWIL + BC: Various fixes and enhancements. (#16218)
|
2021-06-03 22:29:00 +02:00 |
|
Sven Mika
|
c524f86785
|
[RLlib] BC/MARWIL/recurrent nets minor cleanups and bug fixes. (#13064)
|
2020-12-27 09:46:03 -05:00 |
|
Sven Mika
|
bfc4f95e01
|
[RLlib] Fix test_bc.py test case. (#11722)
* Fix large json test file.
* Fix large json test file.
* WIP.
|
2020-10-31 00:16:09 -07:00 |
|
Sven Mika
|
4b278c36fc
|
[RLlib] Behavioral Cloning (from MARWIL). (#10619)
|
2020-09-09 17:33:21 +02:00 |
|