Sven Mika
|
f066180ed5
|
[RLlib] Deprecate timesteps_per_iteration config key (in favor of min_[sample|train]_timesteps_per_reporting . (#24372)
|
2022-05-02 12:51:14 +02:00 |
|
Sven Mika
|
b2b1c95aa5
|
[RLlib] A2/3C Config objects (A2CConfig and A3CConfig). (#24332)
|
2022-04-30 09:51:09 +02:00 |
|
Sven Mika
|
ba14f0a41b
|
[RLlib] PGTrainer config object class (PGConfig ). (#24295)
|
2022-04-28 22:25:16 +02:00 |
|
Sven Mika
|
c82f6c62c8
|
[RLlib] Make RolloutWorkers (optionally) recoverable after failure. (#23739)
|
2022-04-08 15:33:28 +02:00 |
|
Sven Mika
|
2eaa54bd76
|
[RLlib] POC: Config objects instead of dicts (PPO only). (#23491)
|
2022-03-31 18:26:12 +02:00 |
|