ray/release
Eric Liang 015181ab9a
Add random access support for Datasets (experimental feature) (#22749)
This PR adds experimental support for random access to datasets. A Dataset can be random access enabled by calling `ds.to_random_access_dataset(key, num_workers=N)`. This creates a RandomAccessDataset.

RandomAccessDataset partitions the dataset across the cluster by the given sort key, providing efficient random access to records via binary search. A number of worker actors are created, each of which has zero-copy access to the underlying sorted data blocks of the Dataset.

Performance-wise, you can expect each worker to provide ~3000 records / second via ``get_async()``, and ~10000 records / second via ``multiget()``.

Since Ray actor calls go direct from worker->worker, throughput scales linearly with the number of workers.
2022-03-17 15:01:12 -07:00
..
alerts Revert "[Nightly Test] Add more metadata to test result (#21990)" (#22052) 2022-02-02 12:56:42 -08:00
benchmarks [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
golden_notebook_tests [ci/release] Migrate golden notebook tests (#22949) 2022-03-13 21:39:41 +00:00
horovod_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
kubernetes_manual_tests [test][k8s] Restore kubernetes test directory, adds some info (#18982) 2021-10-01 11:23:22 +01:00
lightgbm_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
long_running_distributed_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
long_running_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
microbenchmark [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
ml_user_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
nightly_tests Add random access support for Datasets (experimental feature) (#22749) 2022-03-17 15:01:12 -07:00
ray_release [Remove Redis Pubsub 2/n] clean up remaining Redis references in gcs_utils.py (#23233) 2022-03-16 19:34:57 -07:00
release_logs [Release 1.11.0] Release logs for 1.11.0rc1 (#22443) 2022-02-16 17:03:49 -08:00
rllib_tests [RLlib] Change the slateq regression learning test with GPU to use torch only (#23168) 2022-03-16 09:15:59 +01:00
runtime_env_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
serve_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
sgd_tests/sgd_gpu [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
tune_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
util [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
xgboost_tests [ci/release] Remove old OSS release test infrastructure (#23134) 2022-03-14 15:10:52 +00:00
__init__.py [release] move release testing end to end script to main ray repo (#17070) 2021-07-14 12:39:07 -07:00
BUILD [serve][release tests] Add smoke test to CI for remaining tests (#22962) 2022-03-09 23:36:32 -06:00
README.md [Release] Remove release process doc (#19312) 2021-10-18 11:24:03 -07:00
release_tests.yaml Add random access support for Datasets (experimental feature) (#22749) 2022-03-17 15:01:12 -07:00
requirements.txt [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
requirements_buildkite.txt [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00
run_release_test.sh [ci/release] Disable infra retries for now (#23132) 2022-03-14 11:51:11 +00:00
setup.py [ci/release] Refactor release test e2e into package (#22351) 2022-02-16 17:35:02 +00:00

Release Tests

While the exact release process relies on Anyscale internal tooling, the tests we run during the releases are located at https://github.com/ray-project/ray/blob/master/release/.buildkite/build_pipeline.py