Simon Mo
|
9f23affdc0
|
[Hotfix] Unbreak lint in master (#24794)
|
2022-05-13 15:05:05 -07:00 |
|
kourosh hakhamaneshi
|
ffcbb30552
|
[RLlib] Move from agents to algorithms - CQL, MARWIL, AlphaStar, MAML, Dreamer, MBMPO. (#24739)
|
2022-05-13 18:43:36 +02:00 |
|
Sven Mika
|
6d94b2acbe
|
[RLlib] AlphaStar config objects. (#24576)
|
2022-05-10 14:01:00 +02:00 |
|
Sven Mika
|
1bc6419e0e
|
[RLlib] R2D2 training iteration fn AND switch off execution_plan API by default. (#24165)
|
2022-05-03 07:59:26 +02:00 |
|
Sven Mika
|
6551922c21
|
[RLlib] Fix AlphaStar for tf2+tracing; smaller cleanups around avoiding to wrap a TFPolicy as_eager() or with_tracing more than once. (#24271)
|
2022-04-28 13:43:21 +02:00 |
|
Sven Mika
|
29388fb25b
|
[RLlib] Reinstate flakey AlphaStar learning CI test (flakey due to 2 changed, bad config default values). (#24256)
|
2022-04-27 14:01:52 +02:00 |
|
Avnish Narayan
|
477b9d22d2
|
[RLlib][Training iteration fn] APEX conversion (#22937)
|
2022-04-20 17:56:18 +02:00 |
|
Jiajun Yao
|
5f37231842
|
Remove yapf dependency (#23656)
Yapf has been replaced by black.
|
2022-04-04 21:50:04 -07:00 |
|
Sven Mika
|
0bb82f29b6
|
[RLlib] AlphaStar polishing (fix logger.info bug). (#22281)
|
2022-04-01 09:49:41 +02:00 |
|
Sven Mika
|
c17a44cdfa
|
Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153)
|
2022-02-08 16:43:00 +01:00 |
|
SangBin Cho
|
a887763b38
|
Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni… (#22105)
This reverts commit 3f03ef8ba8 .
|
2022-02-04 00:54:50 -08:00 |
|
Sven Mika
|
3f03ef8ba8
|
[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learning via league-based self-play. (#21356)
|
2022-02-03 09:32:09 +01:00 |
|