hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

Author	SHA1	Message	Date
Rohan138	b9c9cc5946	[RLlib] Updated PettingZoo+RLlib tutorial; Removed pettingzoo example script (#19069 ) * Updated PettingZoo+RLlib tutorial Updated the tutorial and added link to the blog post by the PettingZoo team. * Ran linting * Converted link to tinyurl for linting * fixed line lengths * Decrease num_workers to 1 * Added comments * Decreased num_workers * Decreased timesteps * Increased num_workers * Update links and remove pettingzoo_env.py * remove pettingzoo.py script from tests Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-10-29 10:57:10 +02:00
Sven Mika	902e854af2	[RLlib; Docs overhaul] Docstring cleanup: Environments. (#19784 ) * wip. * Test: Make a change in tune to trigger tune tests, which are not run otherwise, but seem to fail nevertheless with this PR's changes. * remove bare_metal_policy_with_custom_view_reqs from tests	2021-10-29 10:46:52 +02:00
gjoliver	d81885c1f1	[RLlib] Fix all the CI tests that were broken by is_training and replay buffer changes; re-comment-in the failing RLlib tests (#19809 ) * Fix DDPG, since it is based on GenericOffPolicyTrainer. * Fix QMix, SAC, and MADDPA too. * Undo QMix change. * Fix DQN input batch type. Always use SampleBatch. * apex ddpg should not use replay_buffer_config yet. * Make eager tf policy to use SampleBatch. * lint * LINT. * Re-enable RLlib broken tests to make sure things work ok now. * fixes. Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-10-28 18:06:47 +02:00
Simon Mo	5e927b01ad	Revert "[CI] Remove config that disables Bazel test result cache" (#19818 ) * Revert "[CI] Remove config that disables Bazel test result cache (#18701)" This reverts commit `098ff36faa`. * Remove all RLlib tests from BUILD that currently fail. Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-10-28 15:54:53 +02:00
Avnish Narayan	ad87ddf93e	[rllib] Add deterministic test to gpu (#19306 ) Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-10-26 10:11:39 -07:00
Sven Mika	fd438d5630	[RLlib] Issue 18104: Cannot set remote_worker_envs=True for non local-mode and MultiAgentEnv. (#19133 )	2021-10-07 22:39:21 +02:00
Sven Mika	ac3371a148	[RLlib] Discussion 3644: Fix bug for complex obs spaces containing `Box([2D shape])` and discrete component. (#18917 )	2021-09-30 16:39:38 +02:00
Sven Mika	05a55a9335	[RLlib] Issue 18668: Unity3D env client/server example not working (fix + add to test cases). (#18942 )	2021-09-30 08:30:20 +02:00
Sven Mika	9c9b482661	[RLlib] Allow n-step > 1 and prio. replay for R2D2 and RNNSAC. (#18939 )	2021-09-29 21:31:34 +02:00
mvindiola1	62f5da0b65	[RLlib] Add unit tests for updating episode data in base_env (#17137 )	2021-09-24 16:08:11 +02:00
Sven Mika	61a1274619	[RLlib] No Preprocessors (part 2). (#18468 )	2021-09-23 12:56:45 +02:00
Sven Mika	a96dbd885b	[RLlib] Reinstate trajectory view API tests. (#18809 )	2021-09-23 08:31:51 +02:00
Sven Mika	93208bb087	[RLlib] Increase size of (very flakey) action_masking example script test. (#18816 )	2021-09-22 21:48:01 +02:00
Sven Mika	8a72824c63	[RLlib Testig] Split and unflake more CI tests (make sure all jobs are < 30min). (#18591 )	2021-09-15 22:16:48 +02:00
Sven Mika	c5d20849ae	[RLlib] Rename `rllib rollout` into `rllib evaluate` (backward compatible) to match Trainer API. (#18467 )	2021-09-15 08:45:17 +02:00
Sven Mika	08c09737fa	[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550 )	2021-09-14 19:58:10 +02:00
Ameer Haj Ali	e6807ecb43	Change tests owners for ml tests (#18417 )	2021-09-14 01:04:52 -07:00
Sven Mika	ea4a22249c	[RLlib] Add simple action-masking example script/env/model (tf and torch). (#18494 )	2021-09-11 23:08:09 +02:00
Sven Mika	8a066474d4	[RLlib] No Preprocessors; preparatory PR #1 (#18367 )	2021-09-09 08:10:42 +02:00
Sven Mika	56f142cac1	[RLlib] Add support for evaluation_num_episodes=auto (run eval for as long as the parallel train step takes). (#18380 )	2021-09-07 08:08:37 +02:00
Sven Mika	e3e6ed7aaa	[RLlib] Issues 17844, 18034: Fix n-step > 1 bug. (#18358 )	2021-09-06 12:14:20 +02:00
Sven Mika	ba58f5edb1	[RLlib] Strictly run `evaluation_num_episodes` episodes each evaluation run (no matter the other eval config settings). (#18335 )	2021-09-05 15:37:05 +02:00
Sven Mika	9a8ca6a69d	[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306 )	2021-09-03 13:29:57 +02:00
gjoliver	336e79956a	[RLlib] Make MultiAgentEnv inherit gym.Env to avoid direct class type manipulation (#18156 )	2021-09-03 08:02:05 +02:00
Sven Mika	599e589481	[RLlib] Move existing fake multi-GPU learning tests into separate buildkite job. (#18065 )	2021-08-31 14:56:53 +02:00
simonsays1980	60aee4a330	[RLlib] Add example script for bare metal Policy with custom `view_requirements`. (#17896 )	2021-08-20 12:17:13 +02:00
Sven Mika	8248ba531b	[RLlib] Redo #17410 : Example script: Remote worker envs with inference done on main node. (#17960 )	2021-08-20 08:02:18 +02:00
Alex Wu	318ba6fae0	Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410 )" (#17951 ) This reverts commit `8fc16b9a18`.	2021-08-19 07:55:10 -07:00
Sven Mika	8fc16b9a18	[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. (#17410 )	2021-08-19 12:14:50 +02:00
Simon Mo	b573864928	[CI] Add test owners (#17893 )	2021-08-18 18:38:31 -07:00
Kai Fricke	bf3eaa9264	[RLlib] Dreamer fixes and reinstate Dreamer test. (#17821 ) Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-08-18 18:47:08 +02:00
Sven Mika	a428f10ebe	[RLlib] Add multi-GPU learning tests to nightly. (#17778 )	2021-08-18 17:21:01 +02:00
Sven Mika	f18213712f	[RLlib] Redo: "fix self play example scripts" PR (17566) (#17895 ) * wip. * wip. * wip. * wip. * wip. * wip. * wip. * wip. * wip.	2021-08-17 09:13:35 -07:00
Sven Mika	f3bbe4ea44	[RLlib] Test cases/BUILD cleanup; split "everything else" (longest running one rn) tests in 2. (#17640 )	2021-08-16 22:01:01 +02:00
Sven Mika	2bd2ee7a73	[RLlib] SampleBatch: Docstring- and API cleanups; Add support for nested data. (#17485 )	2021-08-16 06:08:14 +02:00
Sven Mika	8a844ff840	[RLlib] Issues: 17397, 17425, 16715, 17174. When on driver, Torch\|TFPolicy should not use `ray.get_gpu_ids()` (b/c no GPUs assigned by ray). (#17444 )	2021-08-02 17:29:59 -04:00
kk-55	a7f8dc9d77	[RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664 )	2021-07-28 15:25:09 -04:00
Sven Mika	0d8fce8fd8	[RLlib] Discussion 2294: Custom vector env example and fix. (#16083 )	2021-07-28 10:40:04 -04:00
Sven Mika	90b21ce27e	[RLlib] De-flake 3 test cases; Fix `config.simple_optimizer` and `SampleBatch.is_training` warnings. (#17321 )	2021-07-27 14:39:06 -04:00
Sven Mika	0c5c70b584	[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118 )	2021-07-25 16:55:51 -04:00
ddworak94	fba8461663	[RLlib] Add RNN-SAC agent (#16577 ) Shoutout to @ddworak94 :)	2021-07-25 10:04:52 -04:00
Sven Mika	7bc4376466	[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077 )	2021-07-22 10:59:13 -04:00
Sven Mika	5a313ba3d6	[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169 )	2021-07-20 14:58:13 -04:00
Sven Mika	e0640ad0dc	[RLlib] Fix seeding for ES and ARS. (#16744 )	2021-07-19 13:13:05 -04:00
Sven Mika	649580d735	[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046 )	2021-07-15 05:51:24 -04:00
Amog Kamsetty	38b5b6d24c	Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )" (#17036 ) This reverts commit `e4123fff27`.	2021-07-13 09:57:15 -07:00
Kai Fricke	27d80c4c88	[RLlib] ONNX export for tensorflow (1.x) and torch (#16805 )	2021-07-13 12:38:11 -04:00
Sven Mika	e4123fff27	[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565 )	2021-07-13 06:38:14 -04:00
Amog Kamsetty	df3dd81348	[rllib] skip highly flaky tests (#17010 )	2021-07-12 11:18:28 -07:00
Kai Fricke	10fd7111b3	[rllib] Improve test learning check, fix flaky two step qmix (#16843 )	2021-07-06 19:39:12 +01:00

1 2 3 4 5

221 commits