Jun Gong
|
b383d987d1
|
[RLlib] Fix a bunch of issues related to connectors. (#26510)
|
2022-07-13 18:55:20 +02:00 |
|
Artur Niederfahrenhorst
|
efea87f0cb
|
[RLlib] SimpleQ PyTorch Multi GPU fix (#26109)
|
2022-06-28 12:12:56 +02:00 |
|
Sven Mika
|
d90c6cfbd6
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|