Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Sven Mika
|
f066180ed5
|
[RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting . (#24372)
|
2022-05-02 12:51:14 +02:00 |
|
Sven Mika
|
e98808ce11
|
[RLlib] Fix 2 flakey test cases. (#14892)
|
2021-03-29 17:20:29 +02:00 |
|
Eric Liang
|
af8a93f2a4
|
Deflake some RLlib tests (#14947)
* fix
* update
* 100
* flake
|
2021-03-26 11:45:17 -07:00 |
|
Sven Mika
|
4e17f95927
|
[RLlib] Unflake 2 test cases (SAC cont. cartpole). (#14620)
|
2021-03-15 14:03:54 +01:00 |
|
Sven Mika
|
bb03e2499b
|
[RLlib] PyBullet Env native support via env str-specifier (if installed). (#12209)
|
2020-11-30 12:41:24 +01:00 |
|