ray/release/xgboost_tests at 9e38f6f613982ecc309c23b5b8ac8ba53bef74a1 - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

SangBin Cho 140a180ebb [xgboost] Fix flaky train_small test (#20529 ) Xgboosts train_small timed out because of a CPU borrowing feature related to placement groups. The root bug will be fixed in the coming weeks, but this PR makes the release test consistently pass by requesting 0 CPUs for the remote wrapper script.		2021-11-18 10:20:08 +00:00
..
workloads	[xgboost] Fix flaky train_small test (#20529 )	2021-11-18 10:20:08 +00:00
app_config.yaml	[release/xgboost] xgboost release test fixes via app config (#20325 )	2021-11-15 10:03:21 -08:00
app_config_gpu.yaml	[release/xgboost] xgboost release test fixes via app config (#20325 )	2021-11-15 10:03:21 -08:00
create_test_data.py	[xgboost] Add XGBoost release tests (#13456 )	2021-01-20 18:40:23 +01:00
README.rst	[xgboost] Update XGBoost release test configs (#13941 )	2021-02-17 23:00:49 +01:00
tpl_cpu_moderate.yaml	[release] Move xgboost tune small + microbenchmark release test to new release automation (#15619 )	2021-05-08 20:38:39 +01:00
tpl_cpu_small.yaml	[release] Move xgboost tune small + microbenchmark release test to new release automation (#15619 )	2021-05-08 20:38:39 +01:00
tpl_gpu_small.yaml	[release] Move xgboost tune small + microbenchmark release test to new release automation (#15619 )	2021-05-08 20:38:39 +01:00
wait_cluster.py	[xgboost] Add XGBoost release tests (#13456 )	2021-01-20 18:40:23 +01:00
xgboost_tests.yaml	[Release] Refactor User Tests (#20028 )	2021-11-05 17:28:37 -07:00

README.rst

XGBoost on Ray tests
====================

This directory contains various XGBoost on Ray release tests.

You should run these tests with the `releaser <https://github.com/ray-project/releaser>`_ tool.

Overview
--------
There are four kinds of tests:

1. ``distributed_api_test`` - checks general API functionality and should finish very quickly (< 1 minute)
2. ``train_*`` - checks single trial training on different setups.
3. ``tune_*`` - checks multi trial training via Ray Tune.
4. ``ft_*`` - checks fault tolerance.

Generally the releaser tool will run all tests in parallel, but if you do
it sequentially, be sure to do it in the order above. If ``train_*`` fails,
``tune_*`` will fail, too.

Acceptance criteria
-------------------
These tests are considered passing when they throw no error at the end of
the output log.