ray/doc/source/tune/_tutorials/tune-tutorial.rst

.. _tune-tutorial:

A Basic Tune Tutorial
=====================

This tutorial will walk you through the process of setting up Tune. Specifically, we'll leverage early stopping and Bayesian Optimization (via HyperOpt) to optimize your PyTorch model.


.. tip:: If you have suggestions as to how to improve this tutorial, please `let us know <https://github.com/ray-project/ray/issues/new/choose>`_!

To run this example, you will need to install the following:

.. code-block:: bash

    $ pip install ray torch torchvision

Pytorch Model Setup
~~~~~~~~~~~~~~~~~~~

To start off, let's first import some dependencies:

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __tutorial_imports_begin__
   :end-before: __tutorial_imports_end__

Then, let's define the PyTorch model that we'll be training.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after:  __model_def_begin__
   :end-before:  __model_def_end__


Below, we have some boiler plate code for training and evaluating your model in Pytorch. :ref:`Skip ahead to the Tune usage <tutorial-tune-setup>`.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __train_def_begin__
   :end-before: __train_def_end__

.. _tutorial-tune-setup:

Setting up Tune
~~~~~~~~~~~~~~~

Below, we define a function that trains the Pytorch model for multiple epochs. This function will be executed on a separate :ref:`Ray Actor (process) <actor-guide>` underneath the hood, so we need to communicate the performance of the model back to Tune (which is on the main Python process).

To do this, we call :ref:`tune.report <tune-function-docstring>` in our training function, which sends the performance value back to Tune.

.. tip:: Since the function is executed on the separate process, make sure that the function is :ref:`serializable by Ray <serialization-guide>`.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __train_func_begin__
   :end-before: __train_func_end__

Let's run 1 trial by calling :ref:`tune.run <tune-run-ref>` and :ref:`randomly sample <tune-sample-docs>` from a uniform distribution for learning rate and momentum.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __eval_func_begin__
   :end-before: __eval_func_end__

``tune.run`` returns an :ref:`Analysis object <tune-analysis-docs>`. You can use this to plot the performance of this trial.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __plot_begin__
   :end-before: __plot_end__

.. note:: Tune will automatically run parallel trials across all available cores/GPUs on your machine or cluster. To limit the number of cores that Tune uses, you can call ``ray.init(num_cpus=<int>, num_gpus=<int>)`` before ``tune.run``. If you're using a Search Algorithm like Bayesian Optimization, you'll want to use the :ref:`ConcurrencyLimiter <limiter>`.


Early Stopping with ASHA
~~~~~~~~~~~~~~~~~~~~~~~~

Let's integrate early stopping into our optimization process. Let's use :ref:`ASHA <tune-scheduler-hyperband>`, a scalable algorithm for `principled early stopping`_.

.. _`principled early stopping`: https://blog.ml.cmu.edu/2018/12/12/massively-parallel-hyperparameter-optimization/

On a high level, ASHA terminates trials that are less promising and allocates more time and resources to more promising trials. As our optimization process becomes more efficient, we can afford to **increase the search space by 5x**, by adjusting the parameter ``num_samples``.

ASHA is implemented in Tune as a "Trial Scheduler". These Trial Schedulers can early terminate bad trials, pause trials, clone trials, and alter hyperparameters of a running trial. See :ref:`the TrialScheduler documentation <tune-schedulers>` for more details of available schedulers and library integrations.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __run_scheduler_begin__
   :end-before: __run_scheduler_end__

You can run the below in a Jupyter notebook to visualize trial progress.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __plot_scheduler_begin__
   :end-before: __plot_scheduler_end__

.. image:: /images/tune-df-plot.png
    :scale: 50%
    :align: center

You can also use :ref:`Tensorboard <tensorboard>` for visualizing results.

.. code:: bash

    $ tensorboard --logdir {logdir}


Search Algorithms in Tune
~~~~~~~~~~~~~~~~~~~~~~~~~

In addition to :ref:`TrialSchedulers <tune-schedulers>`, you can further optimize your hyperparameters by using an intelligent search technique like Bayesian Optimization. To do this, you can use a Tune :ref:`Search Algorithm <tune-search-alg>`. Search Algorithms leverage optimization algorithms to intelligently navigate the given hyperparameter space.

Note that each library has a specific way of defining the search space.

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __run_searchalg_begin__
   :end-before: __run_searchalg_end__

.. note:: Tune allows you to use some search algorithms in combination with different trial schedulers. See :ref:`this page for more details <tune-schedulers>`.

Evaluate your model
~~~~~~~~~~~~~~~~~~~

You can evaluate best trained model using the :ref:`Analysis object <tune-analysis-docs>` to retrieve the best model:

.. literalinclude:: /../../python/ray/tune/tests/tutorial.py
   :language: python
   :start-after: __run_analysis_begin__
   :end-before: __run_analysis_end__


Next Steps
----------

* Take a look at the :doc:`/tune/user-guide` for a more comprehensive overview of Tune's features.
* Check out the :ref:`Tune tutorials <tune-guides>` for guides on using Tune with your preferred machine learning library.
* Browse our :ref:`gallery of examples <tune-general-examples>` to see how to use Tune with PyTorch, XGBoost, Tensorflow, etc.
* `Let us know <https://github.com/ray-project/ray/issues>`__ if you ran into issues or have any questions by opening an issue on our Github.
[tune] New Doc edits, add Concepts page (#8083) Co-Authored-By: Sven Mika <sven@anyscale.io> 2020-04-25 18:25:56 -07:00			`.. _tune-tutorial:`

			`A Basic Tune Tutorial`
			`=====================`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`This tutorial will walk you through the process of setting up Tune. Specifically, we'll leverage early stopping and Bayesian Optimization (via HyperOpt) to optimize your PyTorch model.`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			.. tip:: If you have suggestions as to how to improve this tutorial, please `let us know <https://github.com/ray-project/ray/issues/new/choose>`_!
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`To run this example, you will need to install the following:`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`.. code-block:: bash`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`$ pip install ray torch torchvision`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`Pytorch Model Setup`
			`~~~~~~~~~~~~~~~~~~~`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`To start off, let's first import some dependencies:`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __tutorial_imports_begin__`
			`:end-before: __tutorial_imports_end__`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`Then, let's define the PyTorch model that we'll be training.`

			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
			`:language: python`
			`:start-after: __model_def_begin__`
			`:end-before: __model_def_end__`

[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			Below, we have some boiler plate code for training and evaluating your model in Pytorch. :ref:`Skip ahead to the Tune usage <tutorial-tune-setup>`.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`:start-after: __train_def_begin__`
			`:end-before: __train_def_end__`

			`.. _tutorial-tune-setup:`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`Setting up Tune`
			`~~~~~~~~~~~~~~~`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			Below, we define a function that trains the Pytorch model for multiple epochs. This function will be executed on a separate :ref:`Ray Actor (process) <actor-guide>` underneath the hood, so we need to communicate the performance of the model back to Tune (which is on the main Python process).
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			To do this, we call :ref:`tune.report <tune-function-docstring>` in our training function, which sends the performance value back to Tune.
[tune] New Doc edits, add Concepts page (#8083) Co-Authored-By: Sven Mika <sven@anyscale.io> 2020-04-25 18:25:56 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			.. tip:: Since the function is executed on the separate process, make sure that the function is :ref:`serializable by Ray <serialization-guide>`.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
			`:language: python`
			`:start-after: __train_func_begin__`
			`:end-before: __train_func_end__`

			Let's run 1 trial by calling :ref:`tune.run <tune-run-ref>` and :ref:`randomly sample <tune-sample-docs>` from a uniform distribution for learning rate and momentum.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __eval_func_begin__`
			`:end-before: __eval_func_end__`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			``tune.run`` returns an :ref:`Analysis object <tune-analysis-docs>`. You can use this to plot the performance of this trial.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __plot_begin__`
			`:end-before: __plot_end__`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			.. note:: Tune will automatically run parallel trials across all available cores/GPUs on your machine or cluster. To limit the number of cores that Tune uses, you can call ``ray.init(num_cpus=<int>, num_gpus=<int>)`` before ``tune.run``. If you're using a Search Algorithm like Bayesian Optimization, you'll want to use the :ref:`ConcurrencyLimiter <limiter>`.
[tune] Update trainable docs and support hparams (#5558) 2019-09-04 12:44:42 -07:00
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
			`Early Stopping with ASHA`
			`~~~~~~~~~~~~~~~~~~~~~~~~`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			Let's integrate early stopping into our optimization process. Let's use :ref:`ASHA <tune-scheduler-hyperband>`, a scalable algorithm for `principled early stopping`_.

			.. _`principled early stopping`: https://blog.ml.cmu.edu/2018/12/12/massively-parallel-hyperparameter-optimization/
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			On a high level, ASHA terminates trials that are less promising and allocates more time and resources to more promising trials. As our optimization process becomes more efficient, we can afford to increase the search space by 5x, by adjusting the parameter ``num_samples``.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			ASHA is implemented in Tune as a "Trial Scheduler". These Trial Schedulers can early terminate bad trials, pause trials, clone trials, and alter hyperparameters of a running trial. See :ref:`the TrialScheduler documentation <tune-schedulers>` for more details of available schedulers and library integrations.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __run_scheduler_begin__`
			`:end-before: __run_scheduler_end__`

			`You can run the below in a Jupyter notebook to visualize trial progress.`

[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __plot_scheduler_begin__`
			`:end-before: __plot_scheduler_end__`

[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. image:: /images/tune-df-plot.png`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:scale: 50%`
			`:align: center`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			You can also use :ref:`Tensorboard <tensorboard>` for visualizing results.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
			`.. code:: bash`

			`$ tensorboard --logdir {logdir}`


			`Search Algorithms in Tune`
			`~~~~~~~~~~~~~~~~~~~~~~~~~`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			In addition to :ref:`TrialSchedulers <tune-schedulers>`, you can further optimize your hyperparameters by using an intelligent search technique like Bayesian Optimization. To do this, you can use a Tune :ref:`Search Algorithm <tune-search-alg>`. Search Algorithms leverage optimization algorithms to intelligently navigate the given hyperparameter space.

			`Note that each library has a specific way of defining the search space.`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __run_searchalg_begin__`
			`:end-before: __run_searchalg_end__`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			.. note:: Tune allows you to use some search algorithms in combination with different trial schedulers. See :ref:`this page for more details <tune-schedulers>`.
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
			`Evaluate your model`
			`~~~~~~~~~~~~~~~~~~~`

[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			You can evaluate best trained model using the :ref:`Analysis object <tune-analysis-docs>` to retrieve the best model:
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00
[tune] Improve user guides and API docs (#7716) * create guide gallery for Tune * mods * ok * fix * fix_up_gallery * ok * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> * Apply suggestions from code review Co-Authored-By: Sven Mika <sven@anyscale.io> Co-authored-by: Sven Mika <sven@anyscale.io> 2020-04-06 12:16:35 -07:00			`.. literalinclude:: /../../python/ray/tune/tests/tutorial.py`
[tune] Distributed example + walkthrough (#5157) 2019-08-02 09:17:20 -07:00			`:language: python`
			`:start-after: __run_analysis_begin__`
			`:end-before: __run_analysis_end__`


			`Next Steps`
			`----------`
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00
[TUNE] Tune Docs re-organization (#9600) Co-authored-by: Richard Liaw <rliaw@berkeley.edu> 2020-07-29 11:22:44 -07:00			* Take a look at the :doc:`/tune/user-guide` for a more comprehensive overview of Tune's features.
			* Check out the :ref:`Tune tutorials <tune-guides>` for guides on using Tune with your preferred machine learning library.
[tune] Fix up examples (#9201) 2020-07-05 01:16:20 -07:00			* Browse our :ref:`gallery of examples <tune-general-examples>` to see how to use Tune with PyTorch, XGBoost, Tensorflow, etc.
			* `Let us know <https://github.com/ray-project/ray/issues>`__ if you ran into issues or have any questions by opening an issue on our Github.