Avnish Narayan
|
ad87ddf93e
|
[rllib] Add deterministic test to gpu (#19306)
Co-authored-by: sven1977 <svenmika1977@gmail.com>
|
2021-10-26 10:11:39 -07:00 |
|
Sven Mika
|
fd438d5630
|
[RLlib] Issue 18104: Cannot set remote_worker_envs=True for non local-mode and MultiAgentEnv. (#19133)
|
2021-10-07 22:39:21 +02:00 |
|
Sven Mika
|
ac3371a148
|
[RLlib] Discussion 3644: Fix bug for complex obs spaces containing Box([2D shape]) and discrete component. (#18917)
|
2021-09-30 16:39:38 +02:00 |
|
Sven Mika
|
05a55a9335
|
[RLlib] Issue 18668: Unity3D env client/server example not working (fix + add to test cases). (#18942)
|
2021-09-30 08:30:20 +02:00 |
|
Sven Mika
|
9c9b482661
|
[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939)
|
2021-09-29 21:31:34 +02:00 |
|
mvindiola1
|
62f5da0b65
|
[RLlib] Add unit tests for updating episode data in base_env (#17137)
|
2021-09-24 16:08:11 +02:00 |
|
Sven Mika
|
61a1274619
|
[RLlib] No Preprocessors (part 2). (#18468)
|
2021-09-23 12:56:45 +02:00 |
|
Sven Mika
|
a96dbd885b
|
[RLlib] Reinstate trajectory view API tests. (#18809)
|
2021-09-23 08:31:51 +02:00 |
|
Sven Mika
|
93208bb087
|
[RLlib] Increase size of (very flakey) action_masking example script test. (#18816)
|
2021-09-22 21:48:01 +02:00 |
|
Sven Mika
|
8a72824c63
|
[RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591)
|
2021-09-15 22:16:48 +02:00 |
|
Sven Mika
|
c5d20849ae
|
[RLlib] Rename rllib rollout into rllib evaluate (backward compatible) to match Trainer API. (#18467)
|
2021-09-15 08:45:17 +02:00 |
|
Sven Mika
|
08c09737fa
|
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550)
|
2021-09-14 19:58:10 +02:00 |
|
Ameer Haj Ali
|
e6807ecb43
|
Change tests owners for ml tests (#18417)
|
2021-09-14 01:04:52 -07:00 |
|
Sven Mika
|
ea4a22249c
|
[RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494)
|
2021-09-11 23:08:09 +02:00 |
|
Sven Mika
|
8a066474d4
|
[RLlib] No Preprocessors; preparatory PR #1 (#18367)
|
2021-09-09 08:10:42 +02:00 |
|
Sven Mika
|
56f142cac1
|
[RLlib] Add support for evaluation_num_episodes=auto (run eval for as long as the parallel train step takes). (#18380)
|
2021-09-07 08:08:37 +02:00 |
|
Sven Mika
|
e3e6ed7aaa
|
[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358)
|
2021-09-06 12:14:20 +02:00 |
|
Sven Mika
|
ba58f5edb1
|
[RLlib] Strictly run evaluation_num_episodes episodes each evaluation run (no matter the other eval config settings). (#18335)
|
2021-09-05 15:37:05 +02:00 |
|
Sven Mika
|
9a8ca6a69d
|
[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306)
|
2021-09-03 13:29:57 +02:00 |
|
gjoliver
|
336e79956a
|
[RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156)
|
2021-09-03 08:02:05 +02:00 |
|
Sven Mika
|
599e589481
|
[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065)
|
2021-08-31 14:56:53 +02:00 |
|
simonsays1980
|
60aee4a330
|
[RLlib] Add example script for bare metal Policy with custom view_requirements . (#17896)
|
2021-08-20 12:17:13 +02:00 |
|
Sven Mika
|
8248ba531b
|
[RLlib] Redo #17410: Example script: Remote worker envs with inference done on main node. (#17960)
|
2021-08-20 08:02:18 +02:00 |
|
Alex Wu
|
318ba6fae0
|
Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)" (#17951)
This reverts commit 8fc16b9a18 .
|
2021-08-19 07:55:10 -07:00 |
|
Sven Mika
|
8fc16b9a18
|
[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410)
|
2021-08-19 12:14:50 +02:00 |
|
Simon Mo
|
b573864928
|
[CI] Add test owners (#17893)
|
2021-08-18 18:38:31 -07:00 |
|
Kai Fricke
|
bf3eaa9264
|
[RLlib] Dreamer fixes and reinstate Dreamer test. (#17821)
Co-authored-by: sven1977 <svenmika1977@gmail.com>
|
2021-08-18 18:47:08 +02:00 |
|
Sven Mika
|
a428f10ebe
|
[RLlib] Add multi-GPU learning tests to nightly. (#17778)
|
2021-08-18 17:21:01 +02:00 |
|
Sven Mika
|
f18213712f
|
[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895)
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
* wip.
|
2021-08-17 09:13:35 -07:00 |
|
Sven Mika
|
f3bbe4ea44
|
[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640)
|
2021-08-16 22:01:01 +02:00 |
|
Sven Mika
|
2bd2ee7a73
|
[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485)
|
2021-08-16 06:08:14 +02:00 |
|
Sven Mika
|
8a844ff840
|
[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch|TFPolicy should not use ray.get_gpu_ids() (b/c no GPUs assigned by ray). (#17444)
|
2021-08-02 17:29:59 -04:00 |
|
kk-55
|
a7f8dc9d77
|
[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664)
|
2021-07-28 15:25:09 -04:00 |
|
Sven Mika
|
0d8fce8fd8
|
[RLlib] Discussion 2294: Custom vector env example and fix. (#16083)
|
2021-07-28 10:40:04 -04:00 |
|
Sven Mika
|
90b21ce27e
|
[RLlib] De-flake 3 test cases; Fix config.simple_optimizer and SampleBatch.is_training warnings. (#17321)
|
2021-07-27 14:39:06 -04:00 |
|
Sven Mika
|
0c5c70b584
|
[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118)
|
2021-07-25 16:55:51 -04:00 |
|
ddworak94
|
fba8461663
|
[RLlib] Add RNN-SAC agent (#16577)
Shoutout to @ddworak94 :)
|
2021-07-25 10:04:52 -04:00 |
|
Sven Mika
|
7bc4376466
|
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077)
|
2021-07-22 10:59:13 -04:00 |
|
Sven Mika
|
5a313ba3d6
|
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169)
|
2021-07-20 14:58:13 -04:00 |
|
Sven Mika
|
e0640ad0dc
|
[RLlib] Fix seeding for ES and ARS. (#16744)
|
2021-07-19 13:13:05 -04:00 |
|
Sven Mika
|
649580d735
|
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046)
|
2021-07-15 05:51:24 -04:00 |
|
Amog Kamsetty
|
38b5b6d24c
|
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036)
This reverts commit e4123fff27 .
|
2021-07-13 09:57:15 -07:00 |
|
Kai Fricke
|
27d80c4c88
|
[RLlib] ONNX export for tensorflow (1.x) and torch (#16805)
|
2021-07-13 12:38:11 -04:00 |
|
Sven Mika
|
e4123fff27
|
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)
|
2021-07-13 06:38:14 -04:00 |
|
Amog Kamsetty
|
df3dd81348
|
[rllib] skip highly flaky tests (#17010)
|
2021-07-12 11:18:28 -07:00 |
|
Kai Fricke
|
10fd7111b3
|
[rllib] Improve test learning check, fix flaky two step qmix (#16843)
|
2021-07-06 19:39:12 +01:00 |
|
Sven Mika
|
7eb1a29426
|
[RLlib] Fix ModelV2 custom metrics for torch. (#16734)
|
2021-07-01 13:01:40 +02:00 |
|
Sven Mika
|
53206dd440
|
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531)
|
2021-06-30 12:32:11 +02:00 |
|
Amog Kamsetty
|
abd16a8438
|
[RLlib] Skip two_step_game_qmix test (#16758)
|
2021-06-29 14:27:48 -07:00 |
|
Amog Kamsetty
|
be1f6d59fa
|
[CI] Re-try Tag rllib flaky tests (#16680)
|
2021-06-28 18:42:54 +02:00 |
|