ray/doc/source/ray-air/getting-started.rst

.. _air:

Ray AI Runtime (AIR)
====================

.. tip::

    AIR is currently in **beta**. Fill out `this short form <https://forms.gle/wCCdbaQDtgErYycT6>`__ to get involved. We'll be holding office hours, development sprints, and other activities as we get closer to the GA release. Join us!

Ray AI Runtime (AIR) is a scalable and unified toolkit for ML applications. AIR enables easy scaling of individual workloads, end-to-end workflows, and popular ecosystem frameworks, all in just Python.

.. image:: images/ray-air.svg

AIR comes with ready-to-use libraries for :ref:`Preprocessing <datasets>`, :ref:`Training <train-docs>`, :ref:`Tuning <tune-main>`, :ref:`Scoring <air-predictors>`, :ref:`Serving <rayserve>`, and :ref:`Reinforcement Learning <rllib-index>`, as well as an ecosystem of integrations.

Ray AIR focuses on the compute aspects of ML:
 * It provides scalability by leveraging Ray’s distributed compute layer for ML workloads.
 * It is designed to interoperate with other systems for storage and metadata needs.

Get started by installing Ray AIR:

.. code:: bash

    pip install -U "ray[air]"

    # The below Ray AIR tutorial was written with the following libraries.
    # Consider running the following to ensure that the code below runs properly:
    pip install -U pandas>=1.3.5
    pip install -U torch>=1.12
    pip install -U numpy>=1.19.5
    pip install -U tensorflow>=2.6.2
    pip install -U pyarrow>=6.0.1

Quick Start
-----------

Below, we demonstrate how AIR enables simple scaling of end-to-end ML workflows, focusing on
a few of the popular frameworks AIR integrates with (XGBoost, Pytorch, and Tensorflow):

Preprocessing
~~~~~~~~~~~~~

Below, let's start by preprocessing your data with Ray AIR's ``Preprocessors``:

.. literalinclude:: examples/xgboost_starter.py
    :language: python
    :start-after: __air_generic_preprocess_start__
    :end-before: __air_generic_preprocess_end__

If using Tensorflow or Pytorch, format your data for use with your training framework:

.. tabbed:: XGBoost

    .. code-block:: python
        
        # No extra preprocessing is required for XGBoost.
        # The data is already in the correct format.

.. tabbed:: Pytorch

    .. literalinclude:: examples/pytorch_tabular_starter.py
        :language: python
        :start-after: __air_pytorch_preprocess_start__
        :end-before: __air_pytorch_preprocess_end__

.. tabbed:: Tensorflow

    .. literalinclude:: examples/tf_tabular_starter.py
        :language: python
        :start-after: __air_tf_preprocess_start__
        :end-before: __air_tf_preprocess_end__

Training
~~~~~~~~

Train a model with a ``Trainer`` with common ML frameworks:

.. tabbed:: XGBoost

    .. literalinclude:: examples/xgboost_starter.py
        :language: python
        :start-after: __air_xgb_train_start__
        :end-before: __air_xgb_train_end__

.. tabbed:: Pytorch

    .. literalinclude:: examples/pytorch_tabular_starter.py
        :language: python
        :start-after: __air_pytorch_train_start__
        :end-before: __air_pytorch_train_end__

.. tabbed:: Tensorflow

    .. literalinclude:: examples/tf_tabular_starter.py
        :language: python
        :start-after: __air_tf_train_start__
        :end-before: __air_tf_train_end__

Hyperparameter Tuning
~~~~~~~~~~~~~~~~~~~~~

You can specify a hyperparameter space to search over for each trainer:

.. tabbed:: XGBoost

    .. literalinclude:: examples/xgboost_starter.py
        :language: python
        :start-after: __air_xgb_tuner_start__
        :end-before: __air_xgb_tuner_end__

.. tabbed:: Pytorch

    .. literalinclude:: examples/pytorch_tabular_starter.py
        :language: python
        :start-after: __air_pytorch_tuner_start__
        :end-before: __air_pytorch_tuner_end__

.. tabbed:: Tensorflow

    .. literalinclude:: examples/tf_tabular_starter.py
        :language: python
        :start-after: __air_tf_tuner_start__
        :end-before: __air_tf_tuner_end__

Then use the ``Tuner`` to run the search:

.. literalinclude:: examples/pytorch_tabular_starter.py
    :language: python
    :start-after: __air_tune_generic_start__
    :end-before: __air_tune_generic_end__

Batch Inference
~~~~~~~~~~~~~~~

Use the trained model for scalable batch prediction with a ``BatchPredictor``.

.. tabbed:: XGBoost

    .. literalinclude:: examples/xgboost_starter.py
        :language: python
        :start-after: __air_xgb_batchpred_start__
        :end-before: __air_xgb_batchpred_end__

.. tabbed:: Pytorch

    .. literalinclude:: examples/pytorch_tabular_starter.py
        :language: python
        :start-after: __air_pytorch_batchpred_start__
        :end-before: __air_pytorch_batchpred_end__

.. tabbed:: Tensorflow

    .. literalinclude:: examples/tf_tabular_starter.py
        :language: python
        :start-after: __air_tf_batchpred_start__
        :end-before: __air_tf_batchpred_end__

Why Ray AIR?
------------

Ray AIR aims to simplify the ecosystem of machine learning frameworks, platforms, and tools. It does this by taking a scalable, single-system approach to ML infrastructure (i.e., leveraging Ray as a unified compute framework):

**1. Seamless Dev to Prod**: AIR reduces friction going from development to production. Traditional orchestration approaches introduce separate systems and operational overheads. With Ray and AIR, the same Python code scales seamlessly from a laptop to a large cluster.

**2. Unified API**: Want to switch between frameworks like XGBoost and PyTorch, or try out a new library like HuggingFace? Thanks to the flexibility of AIR, you can do this by just swapping out a single class, without needing to set up new systems or change other aspects of your workflow.

**3. Open and Evolvable**: Ray core and libraries are fully open-source and can run on any cluster, cloud, or Kubernetes, reducing the costs of platform lock-in. Want to go out of the box? Run any framework you want using AIR's integration APIs, or build advanced use cases directly on Ray core.

.. figure:: images/why-air.png

  AIR enables a single-system / single-script approach to scaling ML. Ray's
  distributed Python APIs enable scaling of ML workloads without the burden of
  setting up or orchestrating separate distributed systems.

AIR is for both data scientists and ML engineers. Consider using AIR when you want to:
 * Scale a single workload.
 * Scale end-to-end ML applications.
 * Build a custom ML platform for your organization.

AIR Ecosystem
-------------

AIR comes with built-in integrations with the most popular ecosystem libraries. The following diagram provides an overview of the AIR libraries, ecosystem integrations, and their readiness.
AIR's developer APIs also enable *custom integrations* to be easily created.

..
  https://docs.google.com/drawings/d/1pZkRrkAbRD8jM-xlGlAaVo3T66oBQ_HpsCzomMT7OIc/edit

.. image:: images/air-ecosystem.svg

Next Steps
----------

- :ref:`air-key-concepts`
- `Examples <https://github.com/ray-project/ray/tree/master/python/ray/air/examples>`__
- :ref:`Deployment Guide <air-deployment>`
- :ref:`API reference <air-api-ref>`
-												[Tune] Deprecate DistributedTrainableCreator (#24453)

Fully deprecate DistributedTrainableCreator for Ray 2.0

Closes #24453


											
										
										
											2022-05-10 11:06:43 -07:00
+								.. _air:
-												[air/docs] Update  documentation structure (#25475)

Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-06-06 15:15:11 -07:00
+								Ray AI Runtime (AIR)
 								====================
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] Update  documentation structure (#25475)

Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-06-06 15:15:11 -07:00
+								.. tip::
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								    AIR is currently in **beta**. Fill out `this short form <https://forms.gle/wCCdbaQDtgErYycT6>`__ to get involved. We'll be holding office hours, development sprints, and other activities as we get closer to the GA release. Join us!
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								Ray AI Runtime (AIR) is a scalable and unified toolkit for ML applications. AIR enables easy scaling of individual workloads, end-to-end workflows, and popular ecosystem frameworks, all in just Python.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								.. image:: images/ray-air.svg
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								AIR comes with ready-to-use libraries for :ref:`Preprocessing <datasets>`, :ref:`Training <train-docs>`, :ref:`Tuning <tune-main>`, :ref:`Scoring <air-predictors>`, :ref:`Serving <rayserve>`, and :ref:`Reinforcement Learning <rllib-index>`, as well as an ecosystem of integrations.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								Ray AIR focuses on the compute aspects of ML:
 								 * It provides scalability by leveraging Ray’s distributed compute layer for ML workloads.
 								 * It is designed to interoperate with other systems for storage and metadata needs.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								Get started by installing Ray AIR:
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								.. code:: bash
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												Fix Ray Air Docs Install (#27501)


											
										
										
											2022-08-04 10:47:10 -07:00
+								    pip install -U "ray[air]"
-												[air/docs] Update Trainer documentation (#27481)

Co-authored-by: xwjiang2010 <xwjiang2010@gmail.com>
Co-authored-by: Kai Fricke <kai@anyscale.com>
Co-authored-by: xwjiang2010 <87673679+xwjiang2010@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-08-05 11:21:19 -07:00
-												[AIR/docs] Adding Source Libraries (#27518)

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
											
										
										
											2022-08-04 15:56:40 -07:00
+								    # The below Ray AIR tutorial was written with the following libraries.
 								    # Consider running the following to ensure that the code below runs properly:
 								    pip install -U pandas>=1.3.5
 								    pip install -U torch>=1.12
 								    pip install -U numpy>=1.19.5
 								    pip install -U tensorflow>=2.6.2
 								    pip install -U pyarrow>=6.0.1
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								Quick Start
 								-----------
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								Below, we demonstrate how AIR enables simple scaling of end-to-end ML workflows, focusing on
 								a few of the popular frameworks AIR integrates with (XGBoost, Pytorch, and Tensorflow):
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
 								Preprocessing
 								~~~~~~~~~~~~~
 								Below, let's start by preprocessing your data with Ray AIR's ``Preprocessors``:
 								.. literalinclude:: examples/xgboost_starter.py
 								    :language: python
 								    :start-after: __air_generic_preprocess_start__
 								    :end-before: __air_generic_preprocess_end__
 								If using Tensorflow or Pytorch, format your data for use with your training framework:
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								.. tabbed:: XGBoost
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. code-block:: python
 								        # No extra preprocessing is required for XGBoost.
 								        # The data is already in the correct format.
 								.. tabbed:: Pytorch
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/pytorch_tabular_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_pytorch_preprocess_start__
 								        :end-before: __air_pytorch_preprocess_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: Tensorflow
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/tf_tabular_starter.py
 								        :language: python
 								        :start-after: __air_tf_preprocess_start__
 								        :end-before: __air_tf_preprocess_end__
 								Training
 								~~~~~~~~
 								Train a model with a ``Trainer`` with common ML frameworks:
 								.. tabbed:: XGBoost
 								    .. literalinclude:: examples/xgboost_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
 								        :start-after: __air_xgb_train_start__
 								        :end-before: __air_xgb_train_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: Pytorch
 								    .. literalinclude:: examples/pytorch_tabular_starter.py
 								        :language: python
 								        :start-after: __air_pytorch_train_start__
 								        :end-before: __air_pytorch_train_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: Tensorflow
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/tf_tabular_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_tf_train_start__
 								        :end-before: __air_tf_train_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								Hyperparameter Tuning
 								~~~~~~~~~~~~~~~~~~~~~
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								You can specify a hyperparameter space to search over for each trainer:
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: XGBoost
-												[AIR] `HuggingFaceTrainer`&`Predictor` implementation (#23876)

Implements HuggingFaceTrainer & HuggingFacePredictor.
											
										
										
											2022-04-29 23:31:54 +02:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/xgboost_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_xgb_tuner_start__
 								        :end-before: __air_xgb_tuner_end__
-												[AIR] `SklearnTrainer` & `Predictor` interfaces (#23803)

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
											
										
										
											2022-04-12 00:11:42 +02:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: Pytorch
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/pytorch_tabular_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_pytorch_tuner_start__
 								        :end-before: __air_pytorch_tuner_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								.. tabbed:: Tensorflow
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/tf_tabular_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_tf_tuner_start__
 								        :end-before: __air_tf_tuner_end__
 								Then use the ``Tuner`` to run the search:
 								.. literalinclude:: examples/pytorch_tabular_starter.py
 								    :language: python
 								    :start-after: __air_tune_generic_start__
 								    :end-before: __air_tune_generic_end__
 								Batch Inference
 								~~~~~~~~~~~~~~~
 								Use the trained model for scalable batch prediction with a ``BatchPredictor``.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: XGBoost
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/xgboost_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								        :start-after: __air_xgb_batchpred_start__
 								        :end-before: __air_xgb_batchpred_end__
 								.. tabbed:: Pytorch
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/pytorch_tabular_starter.py
 								        :language: python
 								        :start-after: __air_pytorch_batchpred_start__
 								        :end-before: __air_pytorch_batchpred_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								.. tabbed:: Tensorflow
-												[air/wip] Add batch predictor class (#23808)

What: This class adds a generic BatchPredictor class that offers an interface to run batch inference on Ray datasets. It takes a Predictor class and checkpoint as an input, and provides a predict(dataset) method to run scalable scoring inference.

Why: Currently users have to implement scorers themselves. This is mostly boilerplate and prone to errors, so we should provide a simple solution instead.

Note that this predictor also implements the Predictor interface.
											
										
										
											2022-04-13 08:58:08 +01:00
-												[air/docs] improve consistency of getting started (#26247)


											
										
										
											2022-07-11 20:16:37 -07:00
+								    .. literalinclude:: examples/tf_tabular_starter.py
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								        :language: python
 								        :start-after: __air_tf_batchpred_start__
 								        :end-before: __air_tf_batchpred_end__
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								Why Ray AIR?
 								------------
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								Ray AIR aims to simplify the ecosystem of machine learning frameworks, platforms, and tools. It does this by taking a scalable, single-system approach to ML infrastructure (i.e., leveraging Ray as a unified compute framework):
 								**1. Seamless Dev to Prod**: AIR reduces friction going from development to production. Traditional orchestration approaches introduce separate systems and operational overheads. With Ray and AIR, the same Python code scales seamlessly from a laptop to a large cluster.
 								**2. Unified API**: Want to switch between frameworks like XGBoost and PyTorch, or try out a new library like HuggingFace? Thanks to the flexibility of AIR, you can do this by just swapping out a single class, without needing to set up new systems or change other aspects of your workflow.
-												[AIR] `SklearnTrainer` & `Predictor` interfaces (#23803)

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
											
										
										
											2022-04-12 00:11:42 +02:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								**3. Open and Evolvable**: Ray core and libraries are fully open-source and can run on any cluster, cloud, or Kubernetes, reducing the costs of platform lock-in. Want to go out of the box? Run any framework you want using AIR's integration APIs, or build advanced use cases directly on Ray core.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								.. figure:: images/why-air.png
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								  AIR enables a single-system / single-script approach to scaling ML. Ray's
 								  distributed Python APIs enable scaling of ML workloads without the burden of
 								  setting up or orchestrating separate distributed systems.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								AIR is for both data scientists and ML engineers. Consider using AIR when you want to:
 								 * Scale a single workload.
 								 * Scale end-to-end ML applications.
 								 * Build a custom ML platform for your organization.
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Add ecosystem map to AIR guide (#26859)


											
										
										
											2022-07-21 19:06:47 -07:00
+								AIR Ecosystem
 								-------------
-												[docs] Improve the AIR introductory page (#27347)


											
										
										
											2022-08-03 16:04:04 -07:00
+								AIR comes with built-in integrations with the most popular ecosystem libraries. The following diagram provides an overview of the AIR libraries, ecosystem integrations, and their readiness.
 								AIR's developer APIs also enable *custom integrations* to be easily created.
-												[docs] Add ecosystem map to AIR guide (#26859)


											
										
										
											2022-07-21 19:06:47 -07:00
 								..
 								  https://docs.google.com/drawings/d/1pZkRrkAbRD8jM-xlGlAaVo3T66oBQ_HpsCzomMT7OIc/edit
 								.. image:: images/air-ecosystem.svg
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								Next Steps
 								----------
-												[ml] Add a starter page for docstrings (#23312)


											
										
										
											2022-03-21 17:20:45 -07:00
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								- :ref:`air-key-concepts`
-												[air] Move python/ray/ml to python/ray/air (#25449)

The package "ml" should be renamed to "air".

Main question: Keep a `ml.py` with `from ray.air import *` for some level of backwards compatibility?
I'd go for no to force people to use the new structure.
											
										
										
											2022-06-03 21:53:44 +01:00
+								- `Examples <https://github.com/ray-project/ray/tree/master/python/ray/air/examples>`__
-												[docs] Add initial AIR documentation (#24483)

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
											
										
										
											2022-05-13 01:29:59 -07:00
+								- :ref:`Deployment Guide <air-deployment>`
 								- :ref:`API reference <air-api-ref>`