hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

SangBin Cho b1308b1c8c [Test Infra] Unrevert team col (#21700 ) This fixes the previous problems from team column revert. This has 2 additional changes; alert handler receives the team argument, which was the root cause of breakage; https://github.com/ray-project/ray/pull/21289 Previously, tests without a team column were raising an exception, but I made the condition weaker (warning logs). I will eventually change it to raise an exception, but for smoother transition, we will log warning instead for a short time		2022-01-19 13:29:53 -08:00
..
distributed	[Nightly Test] Fix broken scalability test #21201	2021-12-20 14:58:39 -08:00
object_store	[Nightly Test] Readjust nightly test schedule (#20717 )	2021-11-26 06:59:16 -08:00
single_node	Fix test_single_node json report (#19075 )	2021-10-04 13:05:32 -07:00
app_config.yaml	[nightly] Fix benchmark commit check failure (#21119 )	2021-12-15 14:54:03 -08:00
benchmark_tests.yaml	[Test Infra] Unrevert team col (#21700 )	2022-01-19 13:29:53 -08:00
distributed.yaml	Split scalability envelope + smoke tests (#17455 )	2021-07-30 10:20:19 -07:00
distributed_smoke_test.yaml	Split scalability envelope + smoke tests (#17455 )	2021-07-30 10:20:19 -07:00
many_nodes.yaml	Split scalability envelope + smoke tests (#17455 )	2021-07-30 10:20:19 -07:00
object_store.yaml	[Nightly Test] Readjust nightly test schedule (#20717 )	2021-11-26 06:59:16 -08:00
README.md	Move scalability envelope back down to 250 nodes (#15381 )	2021-04-16 19:39:24 -07:00
single_node.yaml	Integrate scalability envelope with releaser (#16417 )	2021-06-15 10:42:55 -07:00

README.md

Ray Scalability Envelope

Distributed Benchmarks

All distributed tests are run on 64 nodes with 64 cores/node. Maximum number of nodes is achieved by adding 4 core nodes.

Dimension	Quantity
# nodes in cluster (with trivial task workload)	250+
# actors in cluster (with trivial workload)	10k+
# simultaneously running tasks	10k+
# simultaneously running placement groups	1k+

Object Store Benchmarks

Dimension	Quantity
1 GiB object broadcast (# of nodes)	50+

Single Node Benchmarks.

All single node benchmarks are run on a single m4.16xlarge.

Dimension	Quantity
# of object arguments to a single task	10000+
# of objects returned from a single task	3000+
# of plasma objects in a single `ray.get` call	10000+
# of tasks queued on a single node	1,000,000+
Maximum `ray.get` numpy object size	100GiB+