Sven Mika
|
22c9c4aa39
|
[RLlib] Slate-Q +GPU torch bug fix. (#23464)
|
2022-03-24 17:39:33 +01:00 |
|
Jun Gong
|
e765915ded
|
[RLlib] Make sure SlateQ works with GPU. (#22738)
|
2022-03-04 17:49:51 +01:00 |
|
Sven Mika
|
7b687e6cd8
|
[RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544)
|
2022-02-25 21:58:16 +01:00 |
|
Sven Mika
|
6522935291
|
[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389)
|
2022-02-22 09:36:44 +01:00 |
|
Sven Mika
|
38d75ce058
|
[RLlib] Cleanup SlateQ algo; add test + add target Q-net (#21827)
|
2022-02-04 17:01:12 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
99ae7bae05
|
[RLlib] JAXPolicy prep. PR #1. (#13077)
|
2020-12-26 20:14:18 -05:00 |
|
desktable
|
5af745c90d
|
[RLlib] Implement the SlateQ algorithm (#11450)
|
2020-11-03 09:52:04 +01:00 |
|