Max Pumperla
|
6a6c58b5b4
|
[RLlib] Config objects for DDPG and SimpleQ. (#24339)
|
2022-05-12 16:12:42 +02:00 |
|
Artur Niederfahrenhorst
|
95d4a83a87
|
[RLlib] R2D2 Replay Buffer API integration. (#24473)
|
2022-05-10 20:36:14 +02:00 |
|
Sven Mika
|
1bc6419e0e
|
[RLlib] R2D2 training iteration fn AND switch off execution_plan API by default. (#24165)
|
2022-05-03 07:59:26 +02:00 |
|
Artur Niederfahrenhorst
|
e57ce7efd6
|
[RLlib] Replay Buffer API and Training Iteration Fn for DQN. (#23420)
|
2022-04-18 12:20:12 +02:00 |
|
Artur Niederfahrenhorst
|
9a64bd4e9b
|
[RLlib] Simple-Q uses training iteration fn (instead of execution_plan); ReplayBuffer API for Simple-Q (#22842)
|
2022-03-29 14:44:40 +02:00 |
|
Balaji Veeramani
|
31ed9e5d02
|
[CI] Replace YAPF disables with Black disables (#21982)
|
2022-02-08 16:29:25 -08:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
b10d5533be
|
[RLlib] Issue 20920 (partial solution): contrib/MADDPG + pettingzoo coop-pong-v4 not working. (#21452)
|
2022-01-10 11:19:40 +01:00 |
|
Sven Mika
|
3d2e27485b
|
[RLlib] Trainer sub-class DQN/SimpleQ/APEX-DQN/R2D2 (instead of using build_trainer ). (#20633)
|
2021-11-30 18:05:44 +01:00 |
|
Sven Mika
|
73f5c4039b
|
[RLlib] Fix flakey test_a3c, test_maml, test_apex_dqn. (#19035)
|
2021-10-04 13:23:51 +02:00 |
|
Sven Mika
|
9c9b482661
|
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
2021-09-29 21:31:34 +02:00 |
|
Sven Mika
|
4888d7c9af
|
[RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999)
|
2021-08-31 12:21:49 +02:00 |
|
Sven Mika
|
8000258333
|
[RLlib] R2D2 Implementation. (#13933)
|
2021-02-25 12:18:11 +01:00 |
|