Avnish Narayan
|
804719876b
|
[RLlib] Remove execution plan code no longer used by RLlib. (#25624)
|
2022-06-14 10:57:27 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|
Sven Mika
|
30f6fc340b
|
[RLlib] AlphaZero TrainerConfig objects. (#25256)
|
2022-05-30 15:37:58 +02:00 |
|
Sven Mika
|
09886d7ab8
|
[RLlib] Upgrade gym 0.23 (#24171)
|
2022-05-23 08:18:44 +02:00 |
|
Sven Mika
|
8f50087908
|
[RLlib] AlphaZero uses training_iteration API. (#24507)
|
2022-05-18 09:58:25 +02:00 |
|