Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
ee41800c16
|
[RLlib] Preparatory PR for multi-agent, multi-GPU learning agent (alpha-star style) #02. (#21649)
|
2022-01-27 22:07:05 +01:00 |
|
Sven Mika
|
c4636c7c05
|
[RLlib] Issue 21633: SimpleQ should not use a prio. replay buffer. (#21665)
|
2022-01-20 11:46:25 +01:00 |
|
Sven Mika
|
92f030331e
|
[RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)
|
2022-01-10 11:22:55 +01:00 |
|
Sven Mika
|
853d10871c
|
[RLlib] Issue 18499: PGTrainer with training_iteration fn does not support multi-GPU. (#21376)
|
2022-01-05 18:22:33 +01:00 |
|
Artur Niederfahrenhorst
|
d07e50e957
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
|