Rohan Potdar
|
38c9e1d52a
|
[RLlib]: Fix OPE trainables (#26279)
Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2022-07-17 14:25:53 -07:00 |
|
Rohan Potdar
|
09ce4711fd
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
|
Rohan Potdar
|
28df3f34f5
|
[RLlib]: Off-Policy Evaluation fixes. (#25899)
|
2022-06-21 13:24:24 +02:00 |
|
Rohan Potdar
|
a9d8da0100
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
|
Eric Liang
|
4963dfaae0
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
|
Sven Mika
|
7cca7782f1
|
[RLlib] OPE (off policy estimator) API. (#24384)
|
2022-05-02 21:15:50 +02:00 |
|