Sven Mika
|
762cfbdff1
|
[RLlib] IMPALA and APPO metrics fixes; remove deprecated async_parallel_requests utility. (#26117)
|
2022-06-28 15:14:37 +02:00 |
|
Avnish Narayan
|
eaed256d68
|
[RLlib] Async parallel execution manager. (#24423)
|
2022-05-25 17:54:08 +02:00 |
|
Artur Niederfahrenhorst
|
fb2915d26a
|
[RLlib] Replay Buffer API and Ape-X. (#24506)
|
2022-05-17 13:43:49 +02:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
ee41800c16
|
[RLlib] Preparatory PR for multi-agent, multi-GPU learning agent (alpha-star style) #02. (#21649)
|
2022-01-27 22:07:05 +01:00 |
|
Artur Niederfahrenhorst
|
d07e50e957
|
[RLlib] Replay buffer API (cleanups; docstrings; renames; move into rllib/execution/buffers dir) (#20552)
|
2021-11-19 11:57:37 +01:00 |
|
Sven Mika
|
4888d7c9af
|
[RLlib] Replay buffers: Add config option to store contents in checkpoints. (#17999)
|
2021-08-31 12:21:49 +02:00 |
|
Sven Mika
|
7718ec70fb
|
[RLlib] Remove old SegmentTree from tests dir and unflake respective segment tree test. (#14450)
|
2021-03-03 14:31:30 +01:00 |
|
Eric Liang
|
8f79b4e45e
|
[rllib] Replay buffer size inaccurate with replay_seq_len option (#10988)
* support replay seq len
* update
* fix warn
* add test
* test
|
2020-09-25 13:47:23 -07:00 |
|
Eric Liang
|
34bae27ac7
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|