Charles Sun
|
70f94e6d63
|
[RLlib] Migrating DDPG to PolicyV2. (#26054)
|
2022-06-28 15:52:56 +02:00 |
|
Artur Niederfahrenhorst
|
dcbc225728
|
[RLlib] Fix DDPG test ignoring framework_iterator -modified config. (#25913)
|
2022-06-21 16:17:42 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|
Sven Mika
|
94557e3095
|
[RLlib] Apex-DDPG TrainerConfig objects. (#25279)
|
2022-05-30 19:45:38 +02:00 |
|
Sven Mika
|
f75ede1b81
|
[RLlib] MA-DDPG TrainerConfig objects. (#25255)
|
2022-05-30 15:38:24 +02:00 |
|
Sven Mika
|
baf8c2fa1e
|
[RLlib] TD3 config objects. (#25065)
|
2022-05-23 10:07:13 +02:00 |
|
kourosh hakhamaneshi
|
3815e52a61
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
|