ray/release/xgboost_tests
Kai Fricke 6c5229295e
[ci/release] Support running tests with different python versions (#24843)
OSS release tests currently run with hardcoded Python 3.7 base. In the future we will want to run tests on different python versions. 
This PR adds support for a new `python` field in the test configuration. The python field will determine both the base image used in the Buildkite runner docker container (for Ray client compatibility) and the base image for the Anyscale cluster environments. 

Note that in Buildkite, we will still only wait for the python 3.7 base image before kicking off tests. That is acceptable, as we can assume that most wheels finish in a similar time, so even if we wait for the 3.7 image and kick off a 3.8 test, that runner will wait maybe for 5-10 more minutes.
2022-05-17 17:03:12 +01:00
..
workloads [Release] Upgrade instance types for xgboost gpu release tests (#24002) 2022-04-20 15:18:22 -07:00
app_config.yaml [ci/release] Support running tests with different python versions (#24843) 2022-05-17 17:03:12 +01:00
app_config_gpu.yaml [ci/release] Support running tests with different python versions (#24843) 2022-05-17 17:03:12 +01:00
create_test_data.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
README.rst [xgboost] Update XGBoost release test configs (#13941) 2021-02-17 23:00:49 +01:00
tpl_cpu_moderate.yaml [release] Move xgboost tune small + microbenchmark release test to new release automation (#15619) 2021-05-08 20:38:39 +01:00
tpl_cpu_small.yaml [release] Move xgboost tune small + microbenchmark release test to new release automation (#15619) 2021-05-08 20:38:39 +01:00
tpl_gpu_small.yaml [Release] Upgrade instance types for xgboost gpu release tests (#24002) 2022-04-20 15:18:22 -07:00

XGBoost on Ray tests
====================

This directory contains various XGBoost on Ray release tests.

You should run these tests with the `releaser <https://github.com/ray-project/releaser>`_ tool.

Overview
--------
There are four kinds of tests:

1. ``distributed_api_test`` - checks general API functionality and should finish very quickly (< 1 minute)
2. ``train_*`` - checks single trial training on different setups.
3. ``tune_*`` - checks multi trial training via Ray Tune.
4. ``ft_*`` - checks fault tolerance.

Generally the releaser tool will run all tests in parallel, but if you do
it sequentially, be sure to do it in the order above. If ``train_*`` fails,
``tune_*`` will fail, too.

Acceptance criteria
-------------------
These tests are considered passing when they throw no error at the end of
the output log.