Sven Mika
|
2d34216660
|
[RLlib] APEX-DQN: Bug fix for torch and add learning test. (#15762)
|
2021-05-20 09:27:03 +02:00 |
|
Sven Mika
|
6f4d988713
|
[RLlib] Issue 15556: Fix R2D2 using chunks from previous episodes in the "burn-in" window. (#15737)
|
2021-05-18 11:05:42 +02:00 |
|
Eric Liang
|
af8a93f2a4
|
Deflake some RLlib tests (#14947)
* fix
* update
* 100
* flake
|
2021-03-26 11:45:17 -07:00 |
|
Sven Mika
|
8000258333
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
|
Sven Mika
|
deb33bce84
|
[RLlib] Add DQN SoftQ learning test case. (#12712)
|
2020-12-10 14:55:19 +01:00 |
|
Sven Mika
|
b6b54f1c81
|
[RLlib] Trajectory view API: enable by default for SAC, DDPG, DQN, SimpleQ (#11827)
|
2020-11-16 10:54:35 -08:00 |
|
Sven Mika
|
5b788ccb13
|
[RLlib] Trajectory view API (prep PR for switching on by default across all RLlib; plumbing only) (#11717)
|
2020-11-03 12:53:34 -08:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Sven Mika
|
baa053496a
|
[RLlib] Benchmark and regression test yaml cleanup and restructuring. (#8414)
|
2020-05-26 11:10:27 +02:00 |
|