Artur Niederfahrenhorst
|
0dceddb912
|
[RLlib] Move learning_starts logic from buffers into training_step() . (#26032)
|
2022-08-11 13:07:30 +02:00 |
|
Sven Mika
|
96693055bd
|
[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869)
|
2022-06-20 15:54:00 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Artur Niederfahrenhorst
|
94d6c212df
|
[RLlib] Replay Buffer API documentation. (#24683)
|
2022-06-10 16:47:51 +02:00 |
|
Artur Niederfahrenhorst
|
35bd397181
|
[RLlib] Better default values for training_intensity and target_network_update_freq for R2D2. (#25510)
|
2022-06-07 10:29:56 +02:00 |
|
Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|