hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Jiao	ff6515b5a3	Remove `requests` from blacklist of minimal install test (#20584 ) While working on https://github.com/ray-project/ray/pull/20577 we noticed `requests` module is not blacked listed in minimal install test, but not sure why. As a result we missed coverage on P0 issue like https://github.com/ray-project/ray/issues/20574. This is an attempt to see what would happen if we blacklist it and if we're able to get any signals from CI. Co-authored-by: Jiao Dong <jiaodong@anyscale.com> Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-04-04 16:15:58 +01:00
Yi Cheng	31483a003a	[syncer] skip ray_syncer_test on windows temporarily (#23610 ) ray_syncer_test is flaky on windows. It's not so easy to investigate what's happening there. The test timeout somehow. We disable it for short time.	2022-03-30 17:29:08 -07:00
Eric Liang	990b0ec934	Move linkcheck into a separate CI build Why are these changes needed? Linkcheck is inherently flaky, so separate it from the normal LINT build which is never flaky. This also separates the verbose linkcheck logs, making it easier to read the LINT output.	2022-03-29 01:08:53 -07:00
Matti Picus	77c4c1e48e	WINDOWS: enable and fix failures in test_runtime_env_complicated (#22449 )	2022-03-29 00:56:42 -07:00
ddelange	e109c13b83	[ci] Clean up ray-ml requirements (#23325 ) In https://github.com/ray-project/ray/blob/ray-1.11.0/docker/ray-ml/Dockerfile, the order of pip install commands currently matters (potentially a lot). It would be good to run one big pip install command to avoid ending up with a broken env. Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>	2022-03-25 15:59:54 +00:00
Kai Fricke	940c028540	[ci] Clean up artifacts before/after jobs (#23463 ) We sometimes end up with stale wheel uploads from previous runs of a Buildkite agent. The result is that commit wheels are being overwritten from old build jobs - effectively breaking the wheel build logic. Example: This Agent: https://buildkite.com/organizations/ray-project/agents/4b955117-2f6c-4849-b703-3457daf69f89 - builds wheels (in post-wheels tests) for a35ebc945b - and then runs both the Ray CPP worker and the Train + Tune tests in 6746e9f - Usually these two tests shouldn't provide artifacts at all, but they do - these are the wheels from a35ebc945b though! Meaning these are uncleaned leftovers from the first build task. - See here for proof of artifact upload: https://buildkite.com/ray-project/ray-builders-pr/builds/27622#d11bc514-ebd8-4e0c-a2ce-826b9bad27de The solution is thus to always clean up the artifacts directory in the worker, i.e. `rm -rf /artifact-mount/*` This PR adds two of such clean up instructions - once before commands are run and once after artifacts are uploaded. We can probably just do either, but it doesn't hurt to have both.	2022-03-25 13:07:20 +00:00
Max Pumperla	60054995e6	[docs] fix doctests and activate CI (#23418 )	2022-03-24 17:04:02 -07:00
Dmitri Gekhtman	9ce221f514	Disable KubeRay tests on windows. (#23453 ) This PR disables KubeRay tests on windows, because they're not relevant there.	2022-03-24 08:11:17 -07:00
shrekris-anyscale	b00977b1b1	[serve] Remove dashboard's dependency on Serve (#23389 )	2022-03-21 22:14:41 -07:00
Avnish Narayan	e008a48ef2	[release tests] Pin gym everywhere (#23349 )	2022-03-19 02:52:54 -07:00
Philipp Moritz	886cc4d674	Fix broken links in documentation and put linkcheck linter in place on CI (#23340 )	2022-03-18 21:02:52 -07:00
shrekris-anyscale	56ddea85a1	[Serve] Fix typo `language` (#23213 )	2022-03-16 10:14:44 -07:00
mwtian	6eb805b357	[CI] remove GCS-Ray CI tests (#23149 ) * remove redis ci tests * remove mac	2022-03-14 18:18:59 -07:00
Kai Yang	e9755d87a6	[Lint] One parameter/argument per line for C++ code (#22725 ) It's really annoying to deal with parameter/argument conflicts. This is even frustrating when we merge code from the community to Ant's internal code base with hundreds of conflicts caused by parameters/arguments. In this PR, I updated the clang-format style to make parameters/arguments stay on different lines if they can't fit into a single line. There are several benefits: * Conflict resolving is easier. * Less potential human mistakes when resolving conflicts. * Git history and Git blame are more straightforward. * Better readability. * Align with the new Python format style.	2022-03-13 17:05:44 +08:00
qicosmos	e4a9517739	[C++ Worker]Python call cpp worker (#22820 )	2022-03-10 11:06:14 -08:00
kyle-chen-uber	592656ca28	[horovod] remove deprecated slot concept, use worker instead (#22708 ) Horovod updated the attributes of DistributedTrainableCreator and args to create Horovod RayExecutor. horovod/horovod@a729ba7 The major issue is Horovod deprecated "slot" concept, use "worker" instead, which is more consistent with Generic Ray worker. The issue is currently blocking Uber DL trainers to use raytune. This commit updates the Horovod RayExecutor init args. Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-03-10 08:16:42 +00:00
Kai Fricke	b267be4758	[ml] Add Ray ML / AIR checkpoint implementation (#22691 ) This PR splits up the changes in #22393 and introduces an implementation of the ML Checkpoint interface used by Ray Tune. This means, the TuneCheckpoint class implements the to/from_[bytes\|dict\|directory\|object_ref\|uri] conversion functions, as well as more high-level functions to transition between the different TuneCheckpoint classes. It also includes test cases for Tune's main conversion modes, i.e. dict - intermediate - dict and fs - intermediate - fs. These changes will be the basis for refactoring the tune interface to use TuneCheckpoint objects instead of TrialCheckpoints (externally) and instead of paths/objects (internally).	2022-03-09 10:02:59 -08:00
matthewdeng	6b0169b23d	[ml] enable CI tests (#22926 ) Follow-up to #22748, enabling tests in CI. Conditions: A new RAY_CI_ML_AFFECTED condition is added for this test suite. The package currently depends on Ray Data, and will be triggered accordingly. Dependencies: Adding DATA_PROCESSING_TESTING dependencies (set for install-dependencies.sh) for now.	2022-03-09 14:31:53 +00:00
Jiajun Yao	4801e57c77	[Test] Add missing tests to bazel BUILD (#22827 )	2022-03-07 19:54:49 -08:00
Kai Fricke	84a163a2c4	[RLlib] Remove atari rom install script (#22797 )	2022-03-03 16:55:56 +01:00
Simon Mo	0bab8dbfe0	[Serve] Add test for controller managing Java Replica (#22628 )	2022-02-28 23:13:56 -08:00
Sven Mika	7b687e6cd8	[RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544 )	2022-02-25 21:58:16 +01:00
Simon Mo	3d3218d153	[CI] Add K8s Builder Step (#22035 )	2022-02-24 13:11:38 -08:00
Siyuan (Ryans) Zhuang	8f4f3cb79b	Make shellcheck optional	2022-02-24 12:04:05 -08:00
Siyuan (Ryans) Zhuang	ec23050df6	Error if shellcheck is not installed (#22556 )	2022-02-24 09:53:03 -08:00
Yi Cheng	e3051ebf67	[ci] Fix grpcio 1.44 break test_output (#22494 ) This PR limit grpc to be <= 1.42. This will fix testoutput.	2022-02-22 13:59:25 -08:00
Jialing He	4c73560b31	[runtime env] Support clone `virtualenv` from an existing `virtualenv` (#22309 ) Before this PR, we can't run ray in virtualenv, cause `runtime_env` does not support create a new virtualenv from an existing virtualenv. More details:https://github.com/ray-project/ray/pull/21801#discussion_r796848499 Co-authored-by: 捕牛 <hejialing.hjl@antgroup.com>	2022-02-15 12:51:01 -06:00
Gagandeep Singh	a8341dfc29	Replace `queue.Queue` with `multiprocessing.JoinableQueue` (#21860 ) Reason for not using `queue.Queue` for multiprocessing purposes on Windows is at https://stackoverflow.com/a/37244276 and in the second reply to https://stackoverflow.com/a/37245300 And reason for using `multiprocessing.JoinableQueue` over `multiprocessing.Queue` is https://stackoverflow.com/a/30725121 AFAIK, this is because in Windows each process gets it own `Queue` and hence nothing is shared among those processes. When `multiprocessing.Queue` is used, changes in it are shared via pipes internally along with proper locks.	2022-02-15 09:01:17 -08:00
Balaji Veeramani	ee1711fe41	[CI] Remove YAPF from `format.sh` (#21986 )	2022-02-07 16:05:27 -08:00
Balaji Veeramani	7f1bacc7dc	[CI] Format Python code with Black (#21975 ) See #21316 and #21311 for the motivation behind these changes.	2022-01-29 18:41:57 -08:00
SangBin Cho	e62c0052a0	[Dashboard] Agent in minimal ray installation (#21817 ) This is the second part of https://docs.google.com/document/d/12qP3x5uaqZSKS-A_kK0ylPOp0E02_l-deAbmm8YtdFw/edit#. After this PR, dashboard agents will fully work with minimal ray installation. Note that this PR requires to introduce "aioredis", "frozenlist", and "aiosignal" to the minimal installation. These dependencies are very small (or will be removed soon), and including them to minimal makes thing very easy. Please see the below for the reasoning.	2022-01-26 04:03:54 -08:00
Alex Wu	7a45f60dbc	[autoscaler] Fix ray.autoscaler.sdk import issue (#21795 ) This PR moves the sdk to its own folder, then includes everything in `import ray.autoscaler.sdk` in ray's import path. Note: that there were circular dependencies in naively doing this because the ray core now uses constants that were defined in the autoscaler for internal kv operations (and the autoscaler similarly calls into the ray core). The solution was to move those internal kv keys into ray core constants so the imports flow (more) one way. Co-authored-by: Alex Wu <alex@anyscale.com>	2022-01-25 14:43:24 -08:00
Matti Picus	d3d1e8559c	enable passing metric tests on windows (#21755 ) Resubmitting #21705 which was merged then reverted. It seems somehow sphinx building broke in the meantime, not clear how it is connected to this PR. Here is the original description: >Part of the effort to enable tests on windows, this enables test_metrics and test_metric_agents, which pass locally.	2022-01-25 09:20:16 -08:00
Lingxuan Zuo	ec62d7f510	[Streaming]Farewell : remove all of streaming related from ray repo. (#21770 ) New repo url is https://github.com/ray-project/mobius Co-authored-by: 林濯 <lingxuzn.zlx@antgroup.com>	2022-01-23 17:53:41 +08:00
SangBin Cho	b6d3e01e0b	Revert "WINDOWS: enable passing metric tests (#21705 )" (#21738 ) This reverts commit `8104fd5c76`.	2022-01-20 07:27:49 -08:00
Matti Picus	8104fd5c76	WINDOWS: enable passing metric tests (#21705 )	2022-01-19 17:09:34 -08:00
Kai Fricke	8fd5b7a5a8	Tune test autoscaler / fix stale node detection bug (#21516 ) See #21458. Currently, Tune keeps its own list of alive node IPs, but this information is only updated every 10 seconds and is usually stale when a new node is added. Because of this, the first trial scheduled on this node is usually marked as failed. This PR adds a test confirming this behavior and gets rid of the unneeded code path. Co-authored-by: Xiaowei Jiang <xwjiang2010@gmail.com> Co-authored-by: xwjiang2010 <87673679+xwjiang2010@users.noreply.github.com>	2022-01-18 16:20:16 -08:00
Gagandeep Singh	970b7b2a4b	Unskip tests from `ci.sh` (#21483 )	2022-01-17 15:22:57 -08:00
Archit Kulkarni	26057c433f	[CI] pin uvicorn to 0.16.0 to fix serve (#21612 )	2022-01-14 16:00:51 -08:00
Matti Picus	f4da0410b3	WINDOWS: unskip actor, component_failure, failure tests (#21492 ) Unskip windows tests that pass locally	2022-01-13 23:16:22 -08:00
mwtian	30968a9358	[GCS] support external Redis in GCS bootstrapping mode (#21436 ) External Redis should still be supported with GCS bootstrapping, to avoid breaking users. In GCS mode, some logic are removed for external Redis: - Printing external Redis addresses to terminal: hard to implement across `ray start`, `ray.init()` and Ray cluster util. - Starting local Redis if external Redis is unavailable: failing loudly here seems more appropriate. Also, re-enable a few tests which restarts GCS in GCS bootstrapping mode, by using external Redis for KV storage.	2022-01-13 16:01:11 -08:00
mwtian	cf6a54ca46	[CI] pin pytest-asyncio (#21579 )	2022-01-13 11:35:30 -08:00
Kai Fricke	a3442df584	[ci/multinode] Build multinode image with OpenSSH before running tests (#21544 ) Currently we install OpenSSH on the fly in fake multinode docker testing. Instead we can speed testing up a fair bit by building a Docker image which includes OpenSSH first and then run tests with this image.	2022-01-13 08:47:04 -08:00
Kai Fricke	5a7f6e4fdd	[rfc][ci] create fake docker-compose cluster environment (#20256 ) Following #18987 this PR adds a docker-compose based local multi node cluster. The fake multinode docker comprises two parts. The docker_monitor.py script is a watch script calling docker compose up whenever the docker-compose.yaml changes. The node provider creates and updates the docker compose according to the autoscaling requirements. This mode fully supports autoscaling and comes with test utilities to start and connect to docker-compose autoscaling environments. There's also a sample test case showing how this can be used.	2022-01-11 04:35:36 +00:00
Matti Picus	f3dcd1fac1	WINDOWS: re-enable runtime_env tests, skip cluster tests in serve (#21398 ) After enabling tests of test_runtime_env_plugin and test_runtime_env_env_vars (PR #21252) and python/ray/serve:* tests (PR #21107), the analysis at flaky-tests.ray.io starting showing failing tests in the windows://python/ray/test/serv:test_standalone. PR #21352 reverted 21252 (runtime_env tests), but the problem was more likely in the serve tests. Specifically `test_standalone` has a test that uses Cluster, which should be skipped on windows because it is flaky. So this PR - re-enables the runtime_env tests for windows - skips the Cluster test in serve/tests/test_standalone.py	2022-01-06 21:43:58 -08:00
Archit Kulkarni	fd02065ce5	[CI] [docker] Fix docker image name regex matching (#21409 )	2022-01-05 18:59:10 -08:00
Ian Rodney	1b42a49e71	[CI] [Docker Build] Allow Branches with Double digits in regex matching(#21401 )	2022-01-05 14:19:19 -08:00
mwtian	24da654d90	[Test] Shard "Small & Large" tests (#21351 )	2022-01-05 10:49:14 -08:00
Kai Fricke	94242e3e6e	[ci/repro] Add SYS_PTRACE to docker container, use unique name (#21377 ) This will start repro docker containers with SYS_PTRACE capabilities to enable debugging e.g. via py-spy. Additionally, default instance name tags for instance re-use will be generated using the buildkite build id and job id.	2022-01-04 16:59:12 +00:00
Archit Kulkarni	4581baa7dc	Revert "WINDOWS: unskip passing runtime_env tests (#21252 )" (#21352 ) This reverts commit `fcb952e1bc`.	2022-01-03 11:07:17 -08:00

1 2 3 4 5 ...

699 commits