Sven Mika
|
de9e143938
|
[RLlib] Issue 23907: SampleBatch.shuffle does not flush intercepted_values dict (which it should). (#24005)
|
2022-04-19 17:55:59 +02:00 |
|
Artur Niederfahrenhorst
|
c0ade5f0b7
|
[RLlib] Issue 22625: MultiAgentBatch.timeslices() does not behave as expected. (#22657)
|
2022-03-08 14:25:48 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
mvindiola1
|
eadc7669c5
|
[RLlib] SampleBatch.concat_samples fix incorrect max_seq_len calculation (#20704)
|
2021-11-29 12:01:40 +01:00 |
|
Sven Mika
|
08c09737fa
|
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550)
|
2021-09-14 19:58:10 +02:00 |
|
Sven Mika
|
494ddd98c1
|
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928)
|
2021-08-21 17:05:48 +02:00 |
|
Sven Mika
|
2bd2ee7a73
|
[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485)
|
2021-08-16 06:08:14 +02:00 |
|
Sven Mika
|
e973b726c2
|
[RLlib] Support native tf.keras.Models (part 2) - Default keras models for Vision/RNN/Attention. (#15273)
|
2021-04-30 19:26:30 +02:00 |
|
Sven Mika
|
bb8a286cbc
|
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684)
|
2021-04-27 10:44:54 +02:00 |
|
Sven Mika
|
69202c6a7d
|
[RLlib] Obsolete usage tracking dict via sample batch. (#13065)
|
2021-03-17 08:18:15 +01:00 |
|