Sven Mika
|
90b21ce27e
|
[RLlib] De-flake 3 test cases; Fix config.simple_optimizer and SampleBatch.is_training warnings. (#17321)
|
2021-07-27 14:39:06 -04:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
e98808ce11
|
[RLlib] Fix 2 flakey test cases. (#14892)
|
2021-03-29 17:20:29 +02:00 |
|
Eric Liang
|
af8a93f2a4
|
Deflake some RLlib tests (#14947)
* fix
* update
* 100
* flake
|
2021-03-26 11:45:17 -07:00 |
|
Sven Mika
|
4e17f95927
|
[RLlib] Unflake 2 test cases (SAC cont. cartpole). (#14620)
|
2021-03-15 14:03:54 +01:00 |
|
Sven Mika
|
52c94b7ee9
|
[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522)
|
2021-02-02 13:05:58 +01:00 |
|
Sven Mika
|
2e3655e8a9
|
[RLlib] Issue 9071 A3C w/ RNN not working due to VF assuming no RNN. (#13238)
|
2021-01-19 14:22:36 +01:00 |
|
Sven Mika
|
bb03e2499b
|
[RLlib] PyBullet Env native support via env str-specifier (if installed). (#12209)
|
2020-11-30 12:41:24 +01:00 |
|
Sven Mika
|
4afaa46028
|
[RLlib] Increase the scope of RLlib's regression tests. (#12200)
|
2020-11-24 22:18:31 +01:00 |
|
Sven Mika
|
8a891b3c30
|
[RLlib] SAC n_step > 1. (#10567)
|
2020-09-05 22:26:42 +02:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
baa053496a
|
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
2020-05-26 11:10:27 +02:00 |
|