Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Jun Gong
|
2317c693cf
|
[RLlib] Use SampleBrach instead of input dict whenever possible (#20746)
|
2021-12-02 13:11:26 +01:00 |
|
Sven Mika
|
cf21c634a3
|
[RLlib] Fix deprecated warning for torch_ops.py (soft-replaced by torch_utils.py). (#19982)
|
2021-11-03 10:00:46 +01:00 |
|
Sven Mika
|
b4300dd532
|
[RLlib] Issue 18812: Torch multi-GPU stats not protected against race conditions. (#18937)
|
2021-10-04 13:29:00 +02:00 |
|
Sven Mika
|
9c9b482661
|
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
2021-09-29 21:31:34 +02:00 |
|
Sven Mika
|
494ddd98c1
|
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
2021-08-21 17:05:48 +02:00 |
|
Sven Mika
|
a428f10ebe
|
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
2021-08-18 17:21:01 +02:00 |
|
ddworak94
|
fba8461663
|
[RLlib] Add RNN-SAC agent (#16577)
Shoutout to @ddworak94 :)
|
2021-07-25 10:04:52 -04:00 |
|