Avnish Narayan
|
f5a9a44b9c
|
[RLlib] Revert Revert Fix apex long running test (#26928)
|
2022-07-26 15:10:25 -07:00 |
|
Avnish Narayan
|
a50a81a13a
|
Revert "[RLlib] Fix apex breakout release test performance. (#26867)" (#26927)
|
2022-07-23 17:27:50 +02:00 |
|
Avnish Narayan
|
2cfd6c2e97
|
[RLlib] Fix apex breakout release test performance. (#26867)
|
2022-07-23 13:53:03 +02:00 |
|
Avnish Narayan
|
82395c4646
|
[RLlib] Put learning test into own folders (#26862)
Co-authored-by: Artur Niederfahrenhorst <artur@anyscale.com>
|
2022-07-22 11:20:47 -07:00 |
|
Artur Niederfahrenhorst
|
4ce9686d94
|
[RLlib] Fixes MARWIL release tests (#26586)
|
2022-07-15 11:13:15 -07:00 |
|
Jun Gong
|
c026374acb
|
[RLlib] Fix the 2 failing RLlib release tests. (#25603)
|
2022-06-14 14:51:08 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Rohan Potdar
|
a9d8da0100
|
[RLlib]: Doubly Robust Off-Policy Evaluation. (#25056)
|
2022-06-07 12:52:19 +02:00 |
|
Rohan Potdar
|
ab81c8e9ca
|
[RLlib]: Rename input_evaluation to off_policy_estimation_methods . (#25107)
|
2022-05-27 13:14:54 +02:00 |
|
Steven Morad
|
501d932449
|
[RLlib] SAC, RNNSAC, and CQL TrainerConfig objects (#25059)
|
2022-05-22 19:58:47 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Sven Mika
|
0cd7bc4054
|
[RLlib] Re-establish dashboard performance tests. (#24728)
|
2022-05-16 13:13:49 +02:00 |
|