Jun Gong
|
e1cf0cc982
|
[RLlib] Deflake cartpole crashing tests. (#27097)
Make sure cartpole crashing tests are not flaky.
|
2022-07-27 12:50:34 -07:00 |
|
Jun Gong
|
a22457b548
|
[RLlib] Small bug fix (#27003)
|
2022-07-27 00:02:18 -07:00 |
|
Sven Mika
|
4aea24c8a8
|
[RLlib] restart_failed_sub_environments now works for MA cases and crashes during reset() ; +more tests and logging; add eval worker sub-env fault tolerance test. (#26276)
|
2022-07-15 08:55:14 +02:00 |
|
Sven Mika
|
1499af945b
|
[RLlib] Algorithm step() fixes: evaluation should NOT be part of timed training_step loop. (#25924)
|
2022-06-20 19:53:47 +02:00 |
|
Sven Mika
|
d95009a3ac
|
[RLlib] Vectorized envs: Gracefully handle sub-environments failing by restarting them (if configured so). (#24967)
|
2022-05-28 10:50:03 +02:00 |
|
Sven Mika
|
853d10871c
|
[RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)
|
2022-01-05 18:22:33 +01:00 |
|
Sven Mika
|
62dbf26394
|
[RLlib] POC: Run PGTrainer w/o the distr. exec API (Trainer's new training_iteration method). (#20984)
|
2021-12-21 08:39:05 +01:00 |
|
Sven Mika
|
599e589481
|
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)
|
2021-08-31 14:56:53 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
baa053496a
|
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
2020-05-26 11:10:27 +02:00 |
|