ray/doc/source/workflows/index.rst
Yi Cheng 2262ac02f3
[workflow][doc] First pass of workflow doc. (#27331)
Signed-off-by: Yi Cheng 74173148+iycheng@users.noreply.github.com

Why are these changes needed?
This PR update workflow doc to reflect the recent change.
Focusing on position change and others.
2022-08-16 18:48:05 -07:00

32 lines
1.4 KiB
ReStructuredText

.. _workflows:
Ray Workflows: Durable Ray Task Graphs
======================================
.. warning::
Ray Workflows is available as **alpha** in Ray 2.0+. Expect rough corners and
for its APIs and storage format to change. Please file feature requests and
bug reports on GitHub Issues or join the discussion on the
`Ray Slack <https://forms.gle/9TSdDYUgxYs8SA9e8>`__.
Ray Workflows implements high-performance, *durable* application workflows using
Ray tasks as the underlying execution engine. It enables task-based Ray jobs
to seamlessly resume execution even in the case of entire-cluster failure.
Why Ray Workflows?
------------------
**Flexibility:** Combine the flexibility of Ray's dynamic task graphs with
strong durability guarantees. Branch or loop conditionally based on runtime
data. Use Ray distributed libraries seamlessly within workflow tasks.
**Performance:** Ray Workflows offers sub-second overheads for task launch and
supports workflows with hundreds of thousands of tasks. Take advantage of the
Ray object store to pass distributed datasets between tasks with zero-copy
overhead.
You might find that Ray Workflows is *lower level* compared to engines such as
`AirFlow <https://www.astronomer.io/blog/airflow-ray-data-science-story>`__
(which can also run on Ray). This is because Ray Workflows focuses more on core
durability primitives as opposed to tools and integrations.