ray/release
xwjiang2010 ee7a458762
[release test] fix horovod release test. (#22781)
horovod_user_test_master is failing with recent horovod release[[link](https://buildkite.com/ray-project/periodic-ci/builds/2960#61dabda8-eea0-4b7b-93bf-9e341926d3fd)]. 
Error message is saying:
```
AttributeError: Can't get attribute '_ExecutorDriver' on <module 'horovod.ray.runner' from '/home/ray/anaconda3/lib/python3.7/site-packages/horovod/ray/runner.py'>
```
The horovod test is set up in such a way that it has the "driver" (a.k.a. client) part (which is the code that runs in a buildkite agent) and the "cluster" (a.k.a. server) part (which runs in Anyscale cluster). Driver's dependency is specified by `release/ml_user_tests/horovod/driver_setup_master.sh` while cluster's dependency is specified by `release/horovod_tests/app_config_master.yaml`.

The two communicate via Anyscale client. 
The above error message is complaining that while client's horovod version has _ExecutorDriver in runner.py, the server's horovod doesn't. This is due to the version mismatch of the above two files. This PR brings the two horovod dependency to both point to horovod master.
2022-03-03 08:24:26 -08:00
..
.buildkite Add Parquet metadata resolution nightly test to test set. (#22787) 2022-03-02 14:56:00 -08:00
alerts Revert "[Nightly Test] Add more metadata to test result (#21990)" (#22052) 2022-02-02 12:56:42 -08:00
golden_notebook_tests [docs] sphinx gallery removal, migrate to ipynb (#22467) 2022-02-19 01:19:07 -08:00
horovod_tests [release test] fix horovod release test. (#22781) 2022-03-03 08:24:26 -08:00
kubernetes_manual_tests [test][k8s] Restore kubernetes test directory, adds some info (#18982) 2021-10-01 11:23:22 +01:00
lightgbm_tests [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
long_running_distributed_tests [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
long_running_tests [rllib] Fix impala long running test (#22619) 2022-02-24 09:03:55 -08:00
microbenchmark [Release Test] Send release test result to db pipeline (#22667) 2022-03-02 06:19:31 -08:00
ml_user_tests [release test] fix horovod release test. (#22781) 2022-03-03 08:24:26 -08:00
nightly_tests mark threaded actor test unstable (#22696) 2022-02-28 15:25:14 -08:00
ray_release [ci/release] Add support for concurrency groups (#22728) 2022-03-02 16:35:54 +01:00
release_logs [Release 1.11.0] Release logs for 1.11.0rc1 (#22443) 2022-02-16 17:03:49 -08:00
rllib_tests [RLlib] SlateQ: Add a hard-task learning test to weekly regression suite. (#22544) 2022-02-25 21:58:16 +01:00
runtime_env_tests [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
serve_tests [serve] [release tests] Add health check grace period for 1k deployment (#22651) 2022-02-25 12:13:44 -06:00
sgd_tests [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
tune_tests [tune] Retry cloud sync up/down/delete on fail (#22029) 2022-02-15 12:27:29 +00:00
util [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
xgboost_tests [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
__init__.py [release] move release testing end to end script to main ray repo (#17070) 2021-07-14 12:39:07 -07:00
alert.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
BUILD [release] Fix special cases in release test package (e.g. smoke test) (#22442) 2022-02-28 21:05:01 +01:00
config_generator.html [release] move release testing end to end script to main ray repo (#17070) 2021-07-14 12:39:07 -07:00
e2e.py [Release Test] Send release test result to db pipeline (#22667) 2022-03-02 06:19:31 -08:00
README.md [Release] Remove release process doc (#19312) 2021-10-18 11:24:03 -07:00
release_tests.yaml [release] Migrate Tune + XGBoost tests to new infrastructure (#22705) 2022-03-01 08:10:06 +01:00
requirements.txt [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
requirements_buildkite.txt [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
run_e2e.sh [ci/release] Increase long running timeout, fix artifacts copy (#21905) 2022-01-26 21:25:03 +00:00
run_release_test.sh [release] Fix special cases in release test package (e.g. smoke test) (#22442) 2022-02-28 21:05:01 +01:00
setup.py [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
test_owners.yaml [ci] Add release test owner overview file (#18590) 2021-09-14 11:00:31 -07:00

Release Tests

While the exact release process relies on Anyscale internal tooling, the tests we run during the releases are located at https://github.com/ray-project/ray/blob/master/release/.buildkite/build_pipeline.py