mirror of
https://github.com/vale981/ray
synced 2025-03-10 13:26:39 -04:00
![]() ## What do these changes do? * distribute load and resource information on a heartbeat * for each raylet, maintain total and available resource capacity as well as measure of current load * this PR introduces a new notion of load, defined as a sum of all resource demand induced by queued ready tasks on the local raylet. This provides a heterogeneity-aware measure of load that supersedes legacy Ray's task count as a proxy for load. * modify the scheduling policy to perform *capacity-based*, *load-aware*, *optimistically concurrent* resource allocation * perform task spillover to the heartbeating node in response to a heartbeat, implementing heterogeneity-aware late-binding/work-stealing. |
||
---|---|---|
.. | ||
common | ||
global_scheduler | ||
local_scheduler | ||
plasma | ||
ray |