Jun Gong
|
a385c9b127
|
[RLlib] Update bandit_envs_recommender_system (#22421)
|
2022-02-24 22:43:41 +01:00 |
|
Sven Mika
|
6522935291
|
[RLlib] Slate-Q tf implementation and tests/benchmarks. (#22389)
|
2022-02-22 09:36:44 +01:00 |
|
Sven Mika
|
38d75ce058
|
[RLlib] Cleanup SlateQ algo; add test + add target Q-net (#21827)
|
2022-02-04 17:01:12 +01:00 |
|
Jun Gong
|
9c95b9a5fa
|
[RLlib] Add an env wrapper so RecSim works with our Bandits agent. (#22028)
|
2022-02-02 12:15:38 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
893536ebd9
|
[RLlib] Move bandits into main agents folder; Make RecSim adapter more accessible; (#21773)
|
2022-01-27 13:58:12 +01:00 |
|