Sven Mika
|
60b2219d72
|
[RLlib] Allow for evaluation to run by timesteps (alternative to episodes ) and add auto-setting to make sure train doesn't ever have to wait for eval (e.g. long episodes) to finish. (#20757)
|
2021-12-04 13:26:33 +01:00 |
|
Sven Mika
|
e6aae61487
|
[RLlib; testing] Fix bug in stress tests not handling >1 trials per experiment (due to grid-search in IMPALA stress tests). (#18705)
|
2021-09-20 15:31:57 +02:00 |
|
Sven Mika
|
c5d20849ae
|
[RLlib] Rename rllib rollout into rllib evaluate (backward compatible) to match Trainer API. (#18467)
|
2021-09-15 08:45:17 +02:00 |
|