ray/release/tune_tests/scalability_tests
Kai Fricke 1ef2a6790c
[tune] add scalability release tests (#13986)
* Add scalability tests

* Network overhead cluster

* Update xgboost tests

* Document release tests

* Don't raise on failed trial

* Update to multi node yamls

* Update yamls

* Revert xgboost test changes

* Fix import

* Update release/tune_tests/scalability_tests/workloads/test_bookkeeping_overhead.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Pass aws credentials (WIP)

* Update durable trainable example

* Update xgboost sweep

* Change xgboost scope, fix durable trainable stop condition

* Fix max depth to limit total test length

* Add cluster information to test descriptions. Update release checklist/process docs

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-10 17:16:31 +01:00
..
workloads [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_1x16.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_1x32_hd.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_1x96.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_16x2.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_16x64.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_16x64_data.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
cluster_200x2.yaml [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
create_test_data.py [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
requirements.txt [tune] buffer trainable results (#13236) 2021-01-12 18:52:47 +01:00
run.sh [tune] add scalability release tests (#13986) 2021-02-10 17:16:31 +01:00
wait_cluster.py [tune] buffer trainable results (#13236) 2021-01-12 18:52:47 +01:00