Sven Mika
|
6551922c21
|
[RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271)
|
2022-04-28 13:43:21 +02:00 |
|
Sven Mika
|
29388fb25b
|
[RLlib] Reinstate flakey AlphaStar learning CI test (flakey due to 2 changed, bad config default values). (#24256)
|
2022-04-27 14:01:52 +02:00 |
|
Avnish Narayan
|
477b9d22d2
|
[RLlib][Training iteration fn] APEX conversion (#22937)
|
2022-04-20 17:56:18 +02:00 |
|
Jiajun Yao
|
5f37231842
|
Remove yapf dependency (#23656)
Yapf has been replaced by black.
|
2022-04-04 21:50:04 -07:00 |
|
Sven Mika
|
0bb82f29b6
|
[RLlib] AlphaStar polishing (fix logger.info bug). (#22281)
|
2022-04-01 09:49:41 +02:00 |
|
Sven Mika
|
c17a44cdfa
|
Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153)
|
2022-02-08 16:43:00 +01:00 |
|
SangBin Cho
|
a887763b38
|
Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni… (#22105)
This reverts commit 3f03ef8ba8 .
|
2022-02-04 00:54:50 -08:00 |
|
Sven Mika
|
3f03ef8ba8
|
[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learning via league-based self-play. (#21356)
|
2022-02-03 09:32:09 +01:00 |
|