Artur Niederfahrenhorst
|
e57ce7efd6
|
[RLlib] Replay Buffer API and Training Iteration Fn for DQN. (#23420)
|
2022-04-18 12:20:12 +02:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
b10d5533be
|
[RLlib] Issue 20920 (partial solution): contrib/MADDPG + pettingzoo coop-pong-v4 not working. (#21452)
|
2022-01-10 11:19:40 +01:00 |
|
Sven Mika
|
b4790900f5
|
[RLlib] Sub-class Trainer (instead of build_trainer() ): All remaining classes; soft-deprecate build_trainer . (#20725)
|
2021-12-04 22:05:26 +01:00 |
|
Sven Mika
|
9c9b482661
|
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
2021-09-29 21:31:34 +02:00 |
|
ddworak94
|
fba8461663
|
[RLlib] Add RNN-SAC agent (#16577)
Shoutout to @ddworak94 :)
|
2021-07-25 10:04:52 -04:00 |
|