Sven Mika
|
4aea24c8a8
|
[RLlib] restart_failed_sub_environments now works for MA cases and crashes during reset() ; +more tests and logging; add eval worker sub-env fault tolerance test. (#26276)
|
2022-07-15 08:55:14 +02:00 |
|
Jun Gong
|
d83bbda281
|
[RLlib] Save serialized PolicySpec. Extract num_gpus related logics into a util function. (#25954)
|
2022-06-30 11:38:21 +02:00 |
|
Sven Mika
|
130b7eeaba
|
[RLlib] Trainer to Algorithm renaming. (#25539)
|
2022-06-11 15:10:39 +02:00 |
|
Artur Niederfahrenhorst
|
429d0f0eee
|
[RLlib] Fix multi agent environment checks for observations that contain only some agents' obs each step. (#25506)
|
2022-06-07 10:33:35 +02:00 |
|
Artur Niederfahrenhorst
|
5133978adc
|
[RLlib] PG policy subclassing conversion. (#25288)
|
2022-06-06 13:07:47 +02:00 |
|
Sven Mika
|
d95009a3ac
|
[RLlib] Vectorized envs: Gracefully handle sub-environments failing by restarting them (if configured so). (#24967)
|
2022-05-28 10:50:03 +02:00 |
|
Eric Liang
|
4963dfaae0
|
[api] Add API stability annotations for all RLlib symbols and add to LINT (#25060)
|
2022-05-24 22:14:25 -07:00 |
|
Noon van der Silk
|
38a028de2d
|
[RLlib] Don't add elements to _agent_ids during env pre-checking. (#24136)
|
2022-04-26 15:55:15 +02:00 |
|
Noon van der Silk
|
3589c21924
|
[RLlib] Fix some missing f-strings and a f-string related bug in tf eager policy. (#24148)
|
2022-04-25 11:25:28 +02:00 |
|
Avnish Narayan
|
3bf907bcf8
|
[RLlib] Don't modify environments via the env checker utilities. (#24083)
|
2022-04-22 18:39:47 +02:00 |
|
Sven Mika
|
a8494742a3
|
[RLlib] Memory leak finding toolset using tracemalloc + CI memory leak tests. (#15412)
|
2022-04-12 07:50:09 +02:00 |
|
Siyuan (Ryans) Zhuang
|
0c74ecad12
|
[Lint] Cleanup incorrectly formatted strings (Part 1: RLLib). (#23128)
|
2022-03-15 17:34:21 +01:00 |
|
Avnish Narayan
|
740def0a13
|
[RLlib] Put env-checker on critical path. (#22191)
|
2022-02-17 14:06:14 +01:00 |
|
Sven Mika
|
c17a44cdfa
|
Revert "Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni…" (#22153)
|
2022-02-08 16:43:00 +01:00 |
|
SangBin Cho
|
a887763b38
|
Revert "[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learni… (#22105)
This reverts commit 3f03ef8ba8 .
|
2022-02-04 00:54:50 -08:00 |
|
Sven Mika
|
3f03ef8ba8
|
[RLlib] AlphaStar: Parallelized, multi-agent/multi-GPU learning via league-based self-play. (#21356)
|
2022-02-03 09:32:09 +01:00 |
|
Balaji Veeramani
|
7f1bacc7dc
|
[CI] Format Python code with Black (#21975)
See #21316 and #21311 for the motivation behind these changes.
|
2022-01-29 18:41:57 -08:00 |
|
Avnish Narayan
|
12b087acb8
|
[RLlib] Base env pre-checker. (#21569)
|
2022-01-18 16:34:06 +01:00 |
|
Avnish Narayan
|
c0f1202278
|
[RLlib] MultiAgentEnv pre-checker (#21476)
|
2022-01-13 11:31:22 +01:00 |
|
Sven Mika
|
f94bd99ce4
|
[RLlib] Issue 21044: Improve error message for "multiagent" dict checks. (#21448)
|
2022-01-11 19:50:03 +01:00 |
|