Artur Niederfahrenhorst
|
8d906f9bf8
|
[RLlib] SAC with new Replay Buffer API. (#24156)
|
2022-05-09 14:33:02 +02:00 |
|
Artur Niederfahrenhorst
|
306853b5b8
|
[RLlib] Issue 22693: RNN-SAC fixes. (#23814)
|
2022-04-25 09:19:24 +02:00 |
|
Avnish Narayan
|
5134e0dc12
|
[RLlib] Change type to tensortype for cql policies. (#23438)
|
2022-03-24 12:32:29 +01:00 |
|
Fabian Witter
|
2547055f38
|
[RLlib] Add support for complex observations in CQL (#23332)
|
2022-03-22 17:04:07 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Jun Gong
|
2317c693cf
|
[RLlib] Use SampleBrach instead of input dict whenever possible (#20746)
|
2021-12-02 13:11:26 +01:00 |
|
Sven Mika
|
08c09737fa
|
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550)
|
2021-09-14 19:58:10 +02:00 |
|
Sven Mika
|
811d71b368
|
[RLlib] Issue 17653: Torch multi-GPU (>1) broken for LSTMs. (#17657)
|
2021-08-11 12:44:35 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Sven Mika
|
839fc59224
|
[RLlib] CQL TensorFlow support (#15841)
|
2021-05-18 11:10:46 +02:00 |
|