kourosh hakhamaneshi
|
4607e788c1
|
[RLlib] Fix test_ope flakiness (#27676)
|
2022-08-09 16:12:30 -07:00 |
|
Rohan Potdar
|
5b6a58ed28
|
[RLlib] Add OPE Learning Tests (#27154)
|
2022-08-02 17:51:38 -07:00 |
|
Rohan Potdar
|
deccf33912
|
[RLlib]: Add Off-Policy Estimation docs (#26809)
Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2022-07-26 13:57:56 -07:00 |
|
Rohan Potdar
|
4fded80813
|
[RLlib]: Fix FQE Policy call (#26671)
|
2022-07-19 00:58:31 -07:00 |
|
Rohan Potdar
|
38c9e1d52a
|
[RLlib]: Fix OPE trainables (#26279)
Co-authored-by: Kourosh Hakhamaneshi <kourosh@anyscale.com>
|
2022-07-17 14:25:53 -07:00 |
|
Rohan Potdar
|
09ce4711fd
|
[RLlib]: Move OPE to evaluation config (#25911)
|
2022-07-12 11:04:34 -07:00 |
|
kourosh hakhamaneshi
|
be6e4c644f
|
[RLlib] Feature importance evaluation for offline RL (#26412)
|
2022-07-11 18:12:50 -07:00 |
|
Rohan Potdar
|
28df3f34f5
|
[RLlib]: Off-Policy Evaluation fixes. (#25899)
|
2022-06-21 13:24:24 +02:00 |
|
Sven Mika
|
96693055bd
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
|
Rohan Potdar
|
a9d8da0100
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
|
Eric Liang
|
4963dfaae0
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
|
Rohan Potdar
|
5a70b732e8
|
[RLlib] MARWIL and BC Config. (#24853)
|
2022-05-21 12:50:20 +02:00 |
|
Sven Mika
|
7cca7782f1
|
[RLlib] OPE (off policy estimator) API. (#24384)
|
2022-05-02 21:15:50 +02:00 |
|