Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|
Sven Mika
|
e73c37cc17
|
[RLlib] MADDPG: Move into main algorithms folder and add proper unit and learning tests. (#24579)
|
2022-05-24 12:53:53 +02:00 |
|
kourosh hakhamaneshi
|
3815e52a61
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Sven Mika
|
25001f6d8d
|
[RLlib] APPO Training iteration fn. (#24545)
|
2022-05-17 10:31:07 +02:00 |
|
Sven Mika
|
7ab19ddc32
|
[RLlib] MADDPG: Move into agents folder (from contrib) and use training_iteration method. (#24502)
|
2022-05-06 12:35:21 +02:00 |
|