hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

Author	SHA1	Message	Date
xwjiang2010	f77ec350fa	[release test] remove dask/modin_xgboost test completely. (#27865 ) The original script was removed in https://github.com/ray-project/ray/pull/27816 This is just to clean up some remainings. Signed-off-by: xwjiang2010 <xwjiang2010@gmail.com>	2022-08-15 16:55:33 +02:00
xwjiang2010	eb69c1ca28	[air] Add annotation for Tune module. (#27060 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-07-27 13:53:46 -07:00
Amog Kamsetty	862d10c162	[AIR] Remove ML code from `ray.util` (#27005 ) Removes all ML related code from `ray.util` Removes: - `ray.util.xgboost` - `ray.util.lightgbm` - `ray.util.horovod` - `ray.util.ray_lightning` Moves `ray.util.ml_utils` to other locations Closes #23900 Signed-off-by: Amog Kamsetty <amogkamsetty@yahoo.com> Signed-off-by: Kai Fricke <kai@anyscale.com> Co-authored-by: Kai Fricke <kai@anyscale.com>	2022-07-27 14:24:19 +01:00
Kai Fricke	753f5feaf4	[tune] Remove TrialCheckpoint class (#25406 ) The old user-facing TrialCheckpoint class has been deprecated in favor of `ray.ml.Checkpoint` and will be removed with this PR. The main change in this PR is to delete the old `TrialCheckpoint` class and replace remaining API calls (e.g. `checkpoint.local_path`) with the correct AIR equivalents. One issue that comes up is that with Ray client usage, checkpoint directories are not available on the local node (the client). Thus, we can't construct `Checkpoint` objects easily. (Previously, the TrialCheckpoint object held a reference to the location, even if it is not locally available). There are ongoing discussions on how to resolve this in the future. For now, we print an error when such a checkpoint is requested. Depends on #25805 Signed-off-by: Kai Fricke <kai@anyscale.com>	2022-07-11 20:08:10 +01:00
Kai Fricke	45bf925ef0	[train/serve] Fix torch tune serve test (#25547 ) #24772 broke the smoke test as it was not run on CI - this PR hotfixes this	2022-06-07 15:54:37 +01:00
Kai Fricke	1ed8bd0345	[release/xgboost/lightgbm] Fix app config dependency install overwriting ray (#25307 ) This line: ``` pip3 install -U --force-reinstall xgboost xgboost_ray lightgbm_ray petastorm ``` also re-installs the dependencies of these packages, and the `--force-reinstall` means we overwrite existing ones. This leads us to re-install the latest ray release, overwriting the wheels to be tested: ``` [INFO] 5/31/2022, 12:12:16 AM: Successfully installed ... ray-1.12.1 ... [INFO] 5/31/2022, 12:12:17 AM: * Executed RUN pip3 install -U --force-reinstall xgboost xgboost_ray petastorm (ff6ae9f9) ``` Instead, we should use `--no-deps` to avoid re-installing dependencies. Also, the wheels sanity check is moved to after installing additional packages in order to catch these errors earlier.	2022-05-31 13:46:17 +02:00
SangBin Cho	ec653e3196	[Nightly test] Move two line downloads to one line. (#25061 ) It fixes the mysterious error when all cluster env build is failing when pip uninstall / pip install is written in 2 lines. The root cause will be fixed later	2022-05-22 00:07:03 -07:00
Kai Fricke	6c5229295e	[ci/release] Support running tests with different python versions (#24843 ) OSS release tests currently run with hardcoded Python 3.7 base. In the future we will want to run tests on different python versions. This PR adds support for a new `python` field in the test configuration. The python field will determine both the base image used in the Buildkite runner docker container (for Ray client compatibility) and the base image for the Anyscale cluster environments. Note that in Buildkite, we will still only wait for the python 3.7 base image before kicking off tests. That is acceptable, as we can assume that most wheels finish in a similar time, so even if we wait for the 3.7 image and kick off a 3.8 test, that runner will wait maybe for 5-10 more minutes.	2022-05-17 17:03:12 +01:00
Amog Kamsetty	ae9c68e75f	[Train] Fully deprecate Ray SGD v1 (#24038 ) Ray SGD v1 has been denoted as a deprecated API for a while. This PR fully deprecates Ray SGD v1. An error will be raised if ray.util.sgd package is attempted to be imported. Closes #16435	2022-04-25 16:12:57 -07:00
Simon Mo	7b0c77dd38	[Serve] Fix torch_tune_serve_test client test (#24031 )	2022-04-20 16:52:27 -07:00
Amog Kamsetty	9ec5793bea	[Release] Fix XGBoost Golden Notebook Tests (#23996 ) Xgboost released a new version a few days ago. Due to caching of the Anyscale cluster env, this resulted in the server having an outdated xgboost version while the client has the most recent version causing the test to fail. Instead, we reinstall xgboost-ray and xgboost in the post build commands so that these dependencies are not being cached in the cluster env.	2022-04-18 21:44:47 -07:00
Kai Fricke	430ea3e636	[ci/release] Migrate golden notebook tests (#22949 ) Migrating golden notebook tests to new release test package. Tests are passing: https://buildkite.com/ray-project/release-tests-branch/builds/155	2022-03-13 21:39:41 +00:00
Max Pumperla	29d94a2211	[docs] sphinx gallery removal, migrate to ipynb (#22467 )	2022-02-19 01:19:07 -08:00
Balaji Veeramani	7f1bacc7dc	[CI] Format Python code with Black (#21975 ) See #21316 and #21311 for the motivation behind these changes.	2022-01-29 18:41:57 -08:00
Max Pumperla	f9b71a8bf6	[docs] new structure (#21776 ) This PR consolidates both #21667 and #21759 (look there for features), but improves on them in the following way: - [x] we reverted renaming of existing projects `tune`, `rllib`, `train`, `cluster`, `serve`, `raysgd` and `data` so that links won't break. I think my consolidation efforts with the `ray-` prefix were a little overeager in that regard. It's better like this. Only the creation of `ray-core` was a necessity, and some files moved into the `rllib` folder, so that should be relatively benign. - [x] Additionally, we added Algolia `docsearch`, screenshot below. This is _much_ better than our current search. Caveat: there's a sphinx dependency that needs to be replaced (`sphinx-tabs`) by another, newer one (`sphinx-panels`), as the former prevents loading of the `algolia.js` library. Will follow-up in the next PR (hoping this one doesn't get re-re-re-re-reverted).	2022-01-21 15:42:05 -08:00
SangBin Cho	b1308b1c8c	[Test Infra] Unrevert team col (#21700 ) This fixes the previous problems from team column revert. This has 2 additional changes; alert handler receives the team argument, which was the root cause of breakage; https://github.com/ray-project/ray/pull/21289 Previously, tests without a team column were raising an exception, but I made the condition weaker (warning logs). I will eventually change it to raise an exception, but for smoother transition, we will log warning instead for a short time	2022-01-19 13:29:53 -08:00
mwtian	0b3fed5ef3	Revert "[Nightly Test] Add a team column to each test config. (#21198 )" (#21289 ) This reverts commit `b5b11b2d06`.	2021-12-30 06:44:51 +09:00
SangBin Cho	b5b11b2d06	[Nightly Test] Add a team column to each test config. (#21198 ) Please review e2e.py and test_suite belonging to your team! This is the first part of https://docs.google.com/document/d/16IrwerYi2oJugnRf5hvzukgpJ6FAVEpB6stH_CiNMjY/edit# This PR adds a team name to each test suite. If the name is not specified, it will be reported as unspecified. If you are running a local test, and if the new test suite doesn't have a team name specified, it will raise an exception (in this way, we can avoid missing team names in the future). Note that we will aggregate all of test config into a single file, nightly_test.yaml.	2021-12-27 14:42:41 -08:00
xwjiang2010	46d2f2c160	[release test] Update torch_tune_serve test to be compatible with new TrialCheckpoint class. (#21010 )	2021-12-10 17:26:15 +00:00
Kai Fricke	b3a9d4d87d	[ci/release] Remove quotation marks from pip installs (#20638 ) Quotation marks were needed in Anyscale app configs to avoid install errors when # were used e.g. in URLs. Since this has been fixed on the Anyscale side, we can get rid of these.	2021-12-05 17:57:08 -08:00
Amog Kamsetty	18dcf1ac25	[Release] Use nightly Docker images (#20001 ) * use nightly * switch ml cpu to ray cpu * fix * add pytest * add more pytest * add constraint * add tensorflow * fix merge conflict * add tblib * fix * add back uninstall	2021-11-10 18:00:16 -08:00
matthewdeng	790e22f9ad	[tune] move force_on_current_node to ml_utils (#20211 )	2021-11-10 10:21:24 -08:00
Kai Fricke	ad94eb03c6	[ci/release] wrap pip github installs in quotation marks to prevent comment errors (#19464 )	2021-10-18 18:55:56 +01:00
matthewdeng	caa42d753c	[release] pin modin>=0.11.0 due to ray.services being removed (#19446 )	2021-10-18 11:23:05 +01:00
Antoni Baum	e9df253f5d	[CI/docs] Remove [default] from xgboost-ray (#19186 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-10-14 16:29:55 +01:00
matthewdeng	d998373968	[release] fix test by pinning filelock (#19334 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-10-13 22:27:04 +01:00
matthewdeng	3fbe135a24	[docs] add modin_xgboost and dask_xgboost notebook tutorials (#18775 ) * Add xgboost-dask golden notebook * [examples] add modin-xgboost Jupyter notebook * Add xgboost dast gn * update modin notebook to sphinx-gallery compatible python file * fix build file * fix test * fix test * Add modin notebook anyscale connect test * Add missing file * add dask_xgboost notebook * Add the new modin golden notebook to CI * fix lint and filter out tests with py37 * Update release/golden_notebook_tests_new/golden_notebook_tests.yaml Co-authored-by: matthewdeng <matthew.j.deng@gmail.com> * Add dask, wait for cluster client, remove pytest * Replace folder * Fix * Update dask_xgboost_app_config.yaml * Update modin_xgboost_app_config.yaml * comment on filtered out tests Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>	2021-10-05 09:17:33 -07:00
xwjiang2010	09e760a1fd	[Release] Change all cpus_per_actor in xgboost test. (#18717 )	2021-09-17 12:57:21 -07:00
xwjiang2010	2c92f737f9	Fix dask_xgboost_test (#18713 )	2021-09-17 11:25:54 -07:00
Antoni Baum	7e95f330d5	[ci] Fix xgboost_ray install from git (#18640 )	2021-09-15 18:07:15 +01:00
Antoni Baum	eeb67a42cc	pip install xgboost_ray -> xgboost_ray[default] (#18607 ) Co-authored-by: Kai Fricke <kai@anyscale.com>	2021-09-15 14:45:56 +01:00
Antoni Baum	65d5deae60	[tests] Increase golden notebook test timeout to 20 mins (#18554 )	2021-09-14 16:27:56 +01:00
Kai Fricke	7d1e6d3129	[ci/release] Add sanity check for ray wheels hash to release tests (#18489 )	2021-09-10 17:50:31 +01:00
matthewdeng	e66f154b14	[release] increase torch_tune_serve timeout to 20 min (#18481 )	2021-09-09 16:31:14 -07:00
Antoni Baum	2c0dcec18f	[test] Fix golden notebook tests always failing (#17873 )	2021-08-31 17:07:47 +02:00
Antoni Baum	0a1228ef6e	Add configurable autosuspend for connect tests (#17958 )	2021-08-20 10:57:41 +02:00
Kai Fricke	8580e450cb	[release] update/unify base images (#17859 )	2021-08-16 12:44:25 +02:00
matthewdeng	46c1db1aa7	[release] increase golden notebook test timeout (#17601 )	2021-08-05 10:00:38 +01:00
matthewdeng	264e2df7e2	[release] update modin_xgboost_test to use anyscale connect (#16942 )	2021-07-07 22:37:41 -07:00
matthewdeng	23088bd7ea	[release] update torch_tune_serve_test to use anyscale connect (#16754 ) * [release] update torch_tune_serve_test to use anyscale connect * use download_results to download model checkpoint * clean up code to support both OSS and Anyscale	2021-07-06 19:02:50 -07:00
matthewdeng	a3f89d9f53	[release] write output for golden notebook tests (#16825 )	2021-07-01 16:10:58 -07:00
matthewdeng	b0f304a1b5	[release] add golden notebook release test for torch/tune/serve (#16619 ) * [release] add golden notebook release test for torch/tune/serve * start serve on all nodes so remote localhost works	2021-06-29 09:13:23 -07:00
Kai Fricke	9352cb781c	[release tests] Fix microbenchmark base image, network overhead cluster wait time, add long running tests (#16355 )	2021-06-16 21:37:17 +01:00
matthewdeng	9c36ff81fa	[release] add golden notebook tests for dask/xgboost and modin/xgboost (#16231 ) Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>	2021-06-11 10:03:04 +01:00

44 commits