Avnish Narayan
|
0d2ba41e41
|
[RLlib] [CI] Deflake longer running RLlib learning tests for off policy algorithms. Fix seeding issue in TransformedAction Environments (#21685)
|
2022-02-04 14:59:56 +01:00 |
|
Sven Mika
|
63db0e3a7c
|
[RLlib] Fix SAC learning test flakiness introduced in PR: "Sub-class Trainer (instead of build_trainer() ): All remaining classes; soft-deprecate build_trainer ." (#20985)
|
2021-12-09 14:24:27 +01:00 |
|
Sven Mika
|
8a72824c63
|
[RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)
|
2021-09-15 22:16:48 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|