ray/release/horovod_tests/horovod_tests.yaml
Amog Kamsetty 474e44f7e0
[Release/Horovod] Add user test for Horovod (#19661)
* infra

* wip

* add test

* typo

* typo

* update

* rename

* fix

* full path

* formatting

* reorder

* update

* update

* Update release/horovod_tests/workloads/horovod_user_test.py

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* bump num_workers

* update installs

* try

* add pip_packages

* min_workers

* fix

* bump pg timeout

* Fix symlink

* fix

* fix

* cmake

* fix

* pin filelock

* final

* update

* fix

* Update release/horovod_tests/workloads/horovod_user_test.py

* fix

* fix

* separate compute template

* test latest and master

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
2021-11-01 18:28:07 -07:00

36 lines
800 B
YAML

- name: horovod_test
cluster:
app_config: app_config_master.yaml
compute_template: compute_tpl.yaml
run:
timeout: 36000
prepare: python wait_cluster.py 3 600
script: python workloads/horovod_test.py
long_running: True
smoke_test:
run:
timeout: 1800
- name: horovod_user_test_latest
cluster:
app_config: app_config.yaml
compute_template: compute_tpl_autoscaling.yaml
run:
use_connect: True
autosuspend_mins: 10
timeout: 1200
script: python workloads/horovod_user_test.py
- name: horovod_user_test_master
cluster:
app_config: app_config_master.yaml
compute_template: compute_tpl_autoscaling.yaml
run:
use_connect: True
autosuspend_mins: 10
timeout: 1200
script: python workloads/horovod_user_test.py