Artur Niederfahrenhorst
|
0dceddb912
|
[RLlib] Move learning_starts logic from buffers into training_step() . (#26032)
|
2022-08-11 13:07:30 +02:00 |
|
kourosh hakhamaneshi
|
3b3c20209b
|
[RLlib] Fix dqn reproducibility (#27459)
|
2022-08-09 15:56:44 -07:00 |
|
Avnish Narayan
|
9063cc9d5e
|
[RLlib] Fix memory leak in APEX_DQN (#26691)
|
2022-07-19 16:16:24 -07:00 |
|
Sven Mika
|
59a967a3a0
|
[RLlib] Cleanup some deprecated metric keys and classes. (#26036)
|
2022-06-23 21:30:01 +02:00 |
|
Sven Mika
|
d90c6cfbd6
|
[RLlib] SimpleQ PolicyV2 (sub-classing). (#25871)
|
2022-06-17 20:12:16 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Sven Mika
|
7c39aa5fac
|
[RLlib] Trainer.training_iteration -> Trainer.training_step; Iterations vs reportings: Clarification of terms. (#25076)
|
2022-06-10 17:09:18 +02:00 |
|
Avnish Narayan
|
730df43656
|
[RLlib] Issue 25503: Replace torch.range with torch.arange. (#25640)
|
2022-06-10 13:21:54 +02:00 |
|
Sven Mika
|
b5bc2b93c3
|
[RLlib] Move all remaining algos into algorithms directory. (#25366)
|
2022-06-04 07:35:24 +02:00 |
|
Eric Liang
|
905258dbc1
|
Clean up docstyle in python modules and add LINT rule (#25272)
|
2022-06-01 11:27:54 -07:00 |
|
Jun Gong
|
eaf9c941ae
|
[RLlib] Migrate PPO Impala and APPO policies to use sub-classing implementation. (#25117)
|
2022-05-25 14:38:03 +02:00 |
|
Artur Niederfahrenhorst
|
d76ef9add5
|
[RLLib] Fix RNNSAC example failing on CI + fixes for recurrent models for other Q Learning Algos. (#24923)
|
2022-05-24 14:39:43 +02:00 |
|
Sven Mika
|
ec89fe5203
|
[RLlib] APEX-DQN and R2D2 config objects. (#25067)
|
2022-05-23 12:15:45 +02:00 |
|
Sven Mika
|
baf8c2fa1e
|
[RLlib] TD3 config objects. (#25065)
|
2022-05-23 10:07:13 +02:00 |
|
Steven Morad
|
501d932449
|
[RLlib] SAC, RNNSAC, and CQL TrainerConfig objects (#25059)
|
2022-05-22 19:58:47 +02:00 |
|
Rohan Potdar
|
5a70b732e8
|
[RLlib] MARWIL and BC Config. (#24853)
|
2022-05-21 12:50:20 +02:00 |
|
Kai Fricke
|
3e053c85ee
|
[RLlib] Fix broken links from agent -> algo conversion. (#25014)
|
2022-05-20 11:37:11 +02:00 |
|
kourosh hakhamaneshi
|
3815e52a61
|
[RLlib] Agents to algos: DQN w/o Apex and R2D2, DDPG/TD3, SAC, SlateQ, QMIX, PG, Bandits (#24896)
|
2022-05-19 18:30:42 +02:00 |
|