Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Jun Gong
|
eaf9c941ae
|
[RLlib] Migrate PPO Impala and APPO policies to use sub-classing implementation. (#25117)
|
2022-05-25 14:38:03 +02:00 |
|
Eric Liang
|
4963dfaae0
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
|
Sven Mika
|
6551922c21
|
[RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271)
|
2022-04-28 13:43:21 +02:00 |
|
kourosh hakhamaneshi
|
c38a29573f
|
[RLlib] Removed deprecated code with error=True (#23916)
|
2022-04-15 13:51:12 +02:00 |
|
Max Pumperla
|
60054995e6
|
[docs] fix doctests and activate CI (#23418)
|
2022-03-24 17:04:02 -07:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Sven Mika
|
92f030331e
|
[RLlib] Initial code/comment cleanups in preparation for decentralized multi-agent learner. (#21420)
|
2022-01-10 11:22:55 +01:00 |
|
Sven Mika
|
9e6b871739
|
[RLlib] Better utils for flattening complex inputs and enable prev-actions for LSTM/attention for complex action spaces. (#21330)
|
2022-01-05 11:29:44 +01:00 |
|
Sven Mika
|
0b308719f8
|
[RLlib; Docs overhaul] Docstring cleanup: rllib/utils (#19829)
|
2021-11-01 21:46:02 +01:00 |
|