Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
09886d7ab8
|
[RLlib] Upgrade gym 0.23 (#24171)
|
2022-05-23 08:18:44 +02:00 |
|
Eric Liang
|
1ff874e8e8
|
[spelling] Add linter rule for mis-capitalizations of RLLib -> RLlib (#23817)
|
2022-04-10 16:12:53 -07:00 |
|
Siyuan (Ryans) Zhuang
|
0c74ecad12
|
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)
|
2022-03-15 17:34:21 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
60b2219d72
|
[RLlib] Allow for evaluation to run by timesteps (alternative to episodes ) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2021-12-04 13:26:33 +01:00 |
|
Sven Mika
|
e6aae61487
|
[RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705)
|
2021-09-20 15:31:57 +02:00 |
|
Sven Mika
|
c5d20849ae
|
[RLlib] Rename rllib rollout into rllib evaluate (backward compatible) to match Trainer API. (#18467)
|
2021-09-15 08:45:17 +02:00 |
|