ray/release/rllib_tests/rllib_tests.yaml

# Heavy learning tests (Atari and HalfCheetah) for major algos.
- name: learning_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 8gpus_64cpus.yaml

  run:
    timeout: 14400
    script: python learning_tests/run.py

  smoke_test:
      run:
        timeout: 1200

# 2-GPU learning tests (CartPole and RepeatAfterMeEnv) for major algos.
- name: multi_gpu_learning_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 8gpus_96cpus.yaml

  run:
    timeout: 7200
    script: python multi_gpu_learning_tests/run.py

# 2-GPU learning tests (StatelessCartPole) + use_lstm=True for major algos
# (that support RNN models).
- name: multi_gpu_with_lstm_learning_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 8gpus_96cpus.yaml

  run:
    timeout: 7200
    script: python multi_gpu_with_lstm_learning_tests/run.py

# 2-GPU learning tests (StatelessCartPole) + use_attention=True for major
# algos (that support RNN models).
- name: multi_gpu_with_attention_learning_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 8gpus_96cpus.yaml

  run:
    timeout: 7200
    script: python multi_gpu_with_attention_learning_tests/run.py

# We'll have these as per-PR tests soon.
# - name: example_scripts_on_gpu_tests
#  cluster:
#    app_config: app_config.yaml
#    compute_template: 1gpu_4cpus.yaml

#  run:
#    timeout: 7200
#    script: bash unit_gpu_tests/run.sh

# IMPALA large machine stress tests (4x Atari).
- name: stress_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 4gpus_544_cpus.yaml

  run:
    timeout: 5400
    prepare: python wait_cluster.py 6 600
    script: python stress_tests/run_stress_tests.py

  smoke_test:
      run:
        timeout: 2000

# Tests that exercise auto-scaling and Anyscale connect.
- name: connect_tests
  cluster:
    app_config: app_config.yaml
    compute_template: auto_scale.yaml

  run:
    use_connect: True
    timeout: 3000
    script: python connect_tests/run_connect_tests.py

# Nightly performance regression for popular algorithms.
# These algorithms run nightly for pre-determined amount of time without
# passing criteria.
# Performance metrics, such as reward achieved and throughput, are then
# collected and tracked over time.
- name: performance_tests
  cluster:
    app_config: app_config.yaml
    compute_template: 12gpus_192cpus.yaml

  run:
    timeout: 7200
    script: python performance_tests/run.py
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`# Heavy learning tests (Atari and HalfCheetah) for major algos.`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00			`- name: learning_tests`
			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: 8gpus_64cpus.yaml`

			`run:`
[RLlib Testing] Lower `--smoke-test` "time_total_s" to make sure it doesn't time out. (#18670) 2021-09-16 18:22:23 +02:00			`timeout: 14400`
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`script: python learning_tests/run.py`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00
[RLlib; Testing] Fix smoke-test settings for nightly `learning_tests` and `stress_test`; Add `pybullet_envs` to app-config. (#18274) 2021-09-01 21:46:06 +02:00			`smoke_test:`
			`run:`
[RLlib Testing] Lower `--smoke-test` "time_total_s" to make sure it doesn't time out. (#18670) 2021-09-16 18:22:23 +02:00			`timeout: 1200`
[RLlib; Testing] Fix smoke-test settings for nightly `learning_tests` and `stress_test`; Add `pybullet_envs` to app-config. (#18274) 2021-09-01 21:46:06 +02:00
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`# 2-GPU learning tests (CartPole and RepeatAfterMeEnv) for major algos.`
[RLlib] Add multi-GPU learning tests to nightly. (#17778) 2021-08-18 17:21:01 +02:00			`- name: multi_gpu_learning_tests`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00			`cluster:`
			`app_config: app_config.yaml`
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`compute_template: 8gpus_96cpus.yaml`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00
			`run:`
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`timeout: 7200`
			`script: python multi_gpu_learning_tests/run.py`

[RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368) 2021-09-06 17:48:05 +02:00			`# 2-GPU learning tests (StatelessCartPole) + use_lstm=True for major algos`
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00			`# (that support RNN models).`
			`- name: multi_gpu_with_lstm_learning_tests`
			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: 8gpus_96cpus.yaml`

			`run:`
			`timeout: 7200`
			`script: python multi_gpu_with_lstm_learning_tests/run.py`
[RLlib] Add multi-GPU learning tests to nightly. (#17778) 2021-08-18 17:21:01 +02:00
[RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368) 2021-09-06 17:48:05 +02:00			`# 2-GPU learning tests (StatelessCartPole) + use_attention=True for major`
			`# algos (that support RNN models).`
[RLlib] Fix test name typo. (#18423) Co-authored-by: Jun Gong <jungong@mbpro.local> 2021-09-08 14:30:37 -07:00			`- name: multi_gpu_with_attention_learning_tests`
[RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368) 2021-09-06 17:48:05 +02:00			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: 8gpus_96cpus.yaml`

			`run:`
			`timeout: 7200`
			`script: python multi_gpu_with_attention_learning_tests/run.py`

[RLlib] Add multi-GPU learning tests to nightly. (#17778) 2021-08-18 17:21:01 +02:00			`# We'll have these as per-PR tests soon.`
			`# - name: example_scripts_on_gpu_tests`
			`# cluster:`
			`# app_config: app_config.yaml`
			`# compute_template: 1gpu_4cpus.yaml`

			`# run:`
			`# timeout: 7200`
			`# script: bash unit_gpu_tests/run.sh`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00
[RLlib Testing] Lower `--smoke-test` "time_total_s" to make sure it doesn't time out. (#18670) 2021-09-16 18:22:23 +02:00			`# IMPALA large machine stress tests (4x Atari).`
[RLlib] Upgrade RLlib regression test scripts to new testing tool - RLlib release logs for 1.4. (#16080) 2021-06-01 17:39:18 +02:00			`- name: stress_tests`
			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: 4gpus_544_cpus.yaml`

			`run:`
[Testing] Add RLlib release tests (#16651) 2021-08-03 17:34:27 +01:00			`timeout: 5400`
[ci/rllib] wait for stress test cluster (#18603) 2021-09-14 19:01:22 +01:00			`prepare: python wait_cluster.py 6 600`
[Testing] Add RLlib release tests (#16651) 2021-08-03 17:34:27 +01:00			`script: python stress_tests/run_stress_tests.py`
[RLlib; Testing] Fix smoke-test settings for nightly `learning_tests` and `stress_test`; Add `pybullet_envs` to app-config. (#18274) 2021-09-01 21:46:06 +02:00
			`smoke_test:`
			`run:`
Increase rllib stress tests timeout for smoke test (#18810) 2021-09-22 15:30:42 +02:00			`timeout: 2000`
[RLlib] Add an RLlib Tune experiment to UserTest suite. (#19807) * Add an RLlib Tune experiment to UserTest suite. * Add ray.init() * Move example script to example/tune/, so it can be imported as module. * add __init__.py so our new module will get included in python wheel. * Add block device to RLlib test instances. * Reduce disk size a little bit. * Add metrics reporting * Allow max of 5 workers to accomodate all the worker tasks. * revert disk size change. * Minor updates * Trigger build * set max num workers * Add a compute cfg for autoscaled cpu and gpu nodes. * use 1gpu instance. * install tblib for debugging worker crashes. * Manually upgrade to pytorch 1.9.0 * -y * torch=1.9.0 * install torch on driver * Add an RLlib Tune experiment to UserTest suite. * Add ray.init() * Move example script to example/tune/, so it can be imported as module. * add __init__.py so our new module will get included in python wheel. * Add block device to RLlib test instances. * Reduce disk size a little bit. * Add metrics reporting * Allow max of 5 workers to accomodate all the worker tasks. * revert disk size change. * Minor updates * Trigger build * set max num workers * Add a compute cfg for autoscaled cpu and gpu nodes. * use 1gpu instance. * install tblib for debugging worker crashes. * Manually upgrade to pytorch 1.9.0 * -y * torch=1.9.0 * install torch on driver * bump timeout * Write a more informational result dict. * Revert changes to compute config files that are not used. * add smoke test * update * reduce timeout * Reduce the # of env per worker to 1. * Small fix for getting trial_states * Trigger build * simply result dict * lint * more lint * fix smoke test Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com> 2021-11-03 17:04:27 -07:00
			`# Tests that exercise auto-scaling and Anyscale connect.`
			`- name: connect_tests`
			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: auto_scale.yaml`

			`run:`
			`use_connect: True`
			`timeout: 3000`
			`script: python connect_tests/run_connect_tests.py`
[RLlib] Create a set of performance benchmark tests to run nightly. (#19945) * Create a core set of algorithms tests to run nightly. * Run release tests under tf, tf2, and torch frameworks. * Fix * Add eager_tracing option for tf2 framework. * make sure core tests can run in parallel. * cql * Report progress while running nightly/weekly tests. * Innclude SAC in nightly lineup. * Revert changes to learning_tests * rebrand to performance test. * update build_pipeline.py with new performance_tests name. * Record stats. * bug fix, need to populate experiments dict. * Alphabetize yaml files. * Allow specifying frameworks. And do not run tf2 by default. * remove some debugging code. * fix * Undo testing changes. * Do not run CQL regression for now. * LINT. Co-authored-by: sven1977 <svenmika1977@gmail.com> 2021-11-08 09:15:13 -08:00
			`# Nightly performance regression for popular algorithms.`
			`# These algorithms run nightly for pre-determined amount of time without`
			`# passing criteria.`
			`# Performance metrics, such as reward achieved and throughput, are then`
			`# collected and tracked over time.`
			`- name: performance_tests`
			`cluster:`
			`app_config: app_config.yaml`
			`compute_template: 12gpus_192cpus.yaml`

			`run:`
			`timeout: 7200`
			`script: python performance_tests/run.py`