ray/doc/source/tune/api_docs/logging.rst

.. _loggers-docstring:

Loggers (tune.logger)
=====================

Tune has default loggers for Tensorboard, CSV, and JSON formats. By default, Tune only logs the returned result dictionaries from the training function.

If you need to log something lower level like model weights or gradients, see :ref:`Trainable Logging <trainable-logging>`.

Custom Loggers
--------------

You can create a custom logger by inheriting the Logger interface (:ref:`logger-interface`):

.. code-block:: python

    from ray.tune.logger import Logger

    class MLFLowLogger(Logger):
        """MLFlow logger.

        Requires the experiment configuration to have a MLFlow Experiment ID
        or manually set the proper environment variables.
        """

        def _init(self):
            from mlflow.tracking import MlflowClient
            client = MlflowClient()

            # self.config is the same config that your Trainable will see.
            run = client.create_run(self.config.get("mlflow_experiment_id"))
            self._run_id = run.info.run_id
            for key, value in self.config.items():
                client.log_param(self._run_id, key, value)
            self.client = client

        def on_result(self, result):
            for key, value in result.items():
                if not isinstance(value, float):
                    continue
                self.client.log_metric(
                    self._run_id, key, value, step=result.get(TRAINING_ITERATION))

        def close(self):
            self.client.set_terminated(self._run_id)

You can then pass in your own logger as follows:

.. code-block:: python

    from ray.tune.logger import DEFAULT_LOGGERS

    tune.run(
        MyTrainableClass,
        name="experiment_name",
        loggers=DEFAULT_LOGGERS + (CustomLogger1, CustomLogger2)
    )

These loggers will be called along with the default Tune loggers. You can also check out `logger.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/logger.py>`__ for implementation details.

An example of creating a custom logger can be found in `logging_example.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/examples/logging_example.py>`__.

.. _trainable-logging:

Trainable Logging
-----------------

By default, Tune only logs the *training result dictionaries* from your Trainable. However, you may want to visualize the model weights, model graph, or use a custom logging library that requires multi-process logging. For example, you may want to do this if:

 * you're using `Weights and Biases <https://www.wandb.com/>`_
 * you're using `MLFlow <https://github.com/mlflow/mlflow/>`__
 * you're trying to log images to Tensorboard.

You can do this in the trainable, as shown below:

.. tip:: Make sure that any logging calls or objects stay within scope of the Trainable. You may see Pickling/serialization errors or inconsistent logs otherwise.

**Function API**:

.. code-block:: python

    def trainable(config):
        library.init(
            name=trial_id,
            id=trial_id,
            resume=trial_id,
            reinit=True,
            allow_val_change=True)
        library.set_log_path(tune.track.logdir)

        for step in range(100):
            library.log_model(...)
            library.log(results, step=step)
            tune.track.log(results)


**Class API**:

.. code-block:: python

    class CustomLogging(tune.Trainable)
        def _setup(self, config):
            trial_id = self.trial_id
            library.init(
                name=trial_id,
                id=trial_id,
                resume=trial_id,
                reinit=True,
                allow_val_change=True)
            library.set_log_path(self.logdir)

        def _train(self):
            library.log_model(...)

        def _log_result(self, result):
            res_dict = {
                str(k): v
                for k, v in result.items()
                if (v and "config" not in k and not isinstance(v, str))
            }
            step = result["training_iteration"]
            library.log(res_dict, step=step)

Use ``self.logdir`` (only for Class API) or ``tune.track.logdir`` (only for Function API) for the trial log directory.

In the distributed case, these logs will be sync'ed back to the driver under your logger path. This will allow you to visualize and analyze logs of all distributed training workers on a single machine.


Log Directory
-------------

Tune will log the results of each trial to a subfolder under a specified local dir, which defaults to ``~/ray_results``.

.. code-block:: python

    # This logs to 2 different trial folders:
    # ~/ray_results/trainable_name/trial_name_1 and ~/ray_results/trainable_name/trial_name_2
    # trainable_name and trial_name are autogenerated.
    tune.run(trainable, num_samples=2)

You can specify the ``local_dir`` and ``trainable_name``:

.. code-block:: python

    # This logs to 2 different trial folders:
    # ./results/test_experiment/trial_name_1 and ./results/test_experiment/trial_name_2
    # Only trial_name is autogenerated.
    tune.run(trainable, num_samples=2, local_dir="./results", name="test_experiment")

To specify custom trial folder names, you can pass use the ``trial_name_creator`` argument
to `tune.run`.  This takes a function with the following signature:

.. code-block:: python

    def trial_name_string(trial):
        """
        Args:
            trial (Trial): A generated trial object.

        Returns:
            trial_name (str): String representation of Trial.
        """
        return str(trial)

    tune.run(
        MyTrainableClass,
        name="example-experiment",
        num_samples=1,
        trial_name_creator=trial_name_string
    )

See the documentation on Trials: :ref:`trial-docstring`.


Viskit
------

Tune automatically integrates with `Viskit <https://github.com/vitchyr/viskit>`_ via the ``CSVLogger`` outputs. To use VisKit (you may have to install some dependencies), run:

.. code-block:: bash

    $ git clone https://github.com/rll/rllab.git
    $ python rllab/rllab/viskit/frontend.py ~/ray_results/my_experiment

The nonrelevant metrics (like timing stats) can be disabled on the left to show only the relevant ones (like accuracy, loss, etc.).

.. image:: /ray-tune-viskit.png


UnifiedLogger
-------------

.. autoclass:: ray.tune.logger.UnifiedLogger

TBXLogger
---------

.. autoclass:: ray.tune.logger.TBXLogger

JsonLogger
----------

.. autoclass:: ray.tune.logger.JsonLogger

CSVLogger
---------

.. autoclass:: ray.tune.logger.CSVLogger

MLFLowLogger
------------

Tune also provides a default logger for `MLFlow <https://mlflow.org>`_. You can install MLFlow via ``pip install mlflow``. An example can be found `mlflow_example.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/examples/mlflow_example.py>`__. Note that this currently does not include artifact logging support. For this, you can use the native MLFlow APIs inside your Trainable definition.

.. autoclass:: ray.tune.logger.MLFLowLogger


.. _logger-interface:

Logger
------

.. autoclass:: ray.tune.logger.Logger
[tune] New Doc edits, add Concepts page (#8083) Co-Authored-By: Sven Mika <sven@anyscale.io> 2020-04-25 18:25:56 -07:00			`.. _loggers-docstring:`

			`Loggers (tune.logger)`
			`=====================`

[docs][tune] Make search algorithm, scheduler docs better! (#8179) 2020-05-17 12:19:44 -07:00			`Tune has default loggers for Tensorboard, CSV, and JSON formats. By default, Tune only logs the returned result dictionaries from the training function.`
[tune] New Doc edits, add Concepts page (#8083) Co-Authored-By: Sven Mika <sven@anyscale.io> 2020-04-25 18:25:56 -07:00
[docs][tune] Make search algorithm, scheduler docs better! (#8179) 2020-05-17 12:19:44 -07:00			If you need to log something lower level like model weights or gradients, see :ref:`Trainable Logging <trainable-logging>`.

			`Custom Loggers`
			`--------------`

			You can create a custom logger by inheriting the Logger interface (:ref:`logger-interface`):

			`.. code-block:: python`

			`from ray.tune.logger import Logger`

			`class MLFLowLogger(Logger):`
			`"""MLFlow logger.`

			`Requires the experiment configuration to have a MLFlow Experiment ID`
			`or manually set the proper environment variables.`
			`"""`

			`def _init(self):`
			`from mlflow.tracking import MlflowClient`
			`client = MlflowClient()`

			`# self.config is the same config that your Trainable will see.`
			`run = client.create_run(self.config.get("mlflow_experiment_id"))`
			`self._run_id = run.info.run_id`
			`for key, value in self.config.items():`
			`client.log_param(self._run_id, key, value)`
			`self.client = client`

			`def on_result(self, result):`
			`for key, value in result.items():`
			`if not isinstance(value, float):`
			`continue`
			`self.client.log_metric(`
			`self._run_id, key, value, step=result.get(TRAINING_ITERATION))`

			`def close(self):`
			`self.client.set_terminated(self._run_id)`

			`You can then pass in your own logger as follows:`

			`.. code-block:: python`

			`from ray.tune.logger import DEFAULT_LOGGERS`

			`tune.run(`
			`MyTrainableClass,`
			`name="experiment_name",`
			`loggers=DEFAULT_LOGGERS + (CustomLogger1, CustomLogger2)`
			`)`

			These loggers will be called along with the default Tune loggers. You can also check out `logger.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/logger.py>`__ for implementation details.

			An example of creating a custom logger can be found in `logging_example.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/examples/logging_example.py>`__.

			`.. _trainable-logging:`

			`Trainable Logging`
			`-----------------`

			`By default, Tune only logs the training result dictionaries from your Trainable. However, you may want to visualize the model weights, model graph, or use a custom logging library that requires multi-process logging. For example, you may want to do this if:`

			* you're using `Weights and Biases <https://www.wandb.com/>`_
			* you're using `MLFlow <https://github.com/mlflow/mlflow/>`__
			`* you're trying to log images to Tensorboard.`

			`You can do this in the trainable, as shown below:`

			`.. tip:: Make sure that any logging calls or objects stay within scope of the Trainable. You may see Pickling/serialization errors or inconsistent logs otherwise.`

			`Function API:`

			`.. code-block:: python`

			`def trainable(config):`
			`library.init(`
			`name=trial_id,`
			`id=trial_id,`
			`resume=trial_id,`
			`reinit=True,`
			`allow_val_change=True)`
			`library.set_log_path(tune.track.logdir)`

			`for step in range(100):`
			`library.log_model(...)`
			`library.log(results, step=step)`
			`tune.track.log(results)`


			`Class API:`

			`.. code-block:: python`

			`class CustomLogging(tune.Trainable)`
			`def _setup(self, config):`
			`trial_id = self.trial_id`
			`library.init(`
			`name=trial_id,`
			`id=trial_id,`
			`resume=trial_id,`
			`reinit=True,`
			`allow_val_change=True)`
			`library.set_log_path(self.logdir)`

			`def _train(self):`
			`library.log_model(...)`

			`def _log_result(self, result):`
			`res_dict = {`
			`str(k): v`
			`for k, v in result.items()`
			`if (v and "config" not in k and not isinstance(v, str))`
			`}`
			`step = result["training_iteration"]`
			`library.log(res_dict, step=step)`

			Use ``self.logdir`` (only for Class API) or ``tune.track.logdir`` (only for Function API) for the trial log directory.

			`In the distributed case, these logs will be sync'ed back to the driver under your logger path. This will allow you to visualize and analyze logs of all distributed training workers on a single machine.`


			`Log Directory`
			`-------------`
[tune] New Doc edits, add Concepts page (#8083) Co-Authored-By: Sven Mika <sven@anyscale.io> 2020-04-25 18:25:56 -07:00
			Tune will log the results of each trial to a subfolder under a specified local dir, which defaults to ``~/ray_results``.

			`.. code-block:: python`

			`# This logs to 2 different trial folders:`
			`# ~/ray_results/trainable_name/trial_name_1 and ~/ray_results/trainable_name/trial_name_2`
			`# trainable_name and trial_name are autogenerated.`
			`tune.run(trainable, num_samples=2)`

			You can specify the ``local_dir`` and ``trainable_name``:

			`.. code-block:: python`

			`# This logs to 2 different trial folders:`
			`# ./results/test_experiment/trial_name_1 and ./results/test_experiment/trial_name_2`
			`# Only trial_name is autogenerated.`
			`tune.run(trainable, num_samples=2, local_dir="./results", name="test_experiment")`

			To specify custom trial folder names, you can pass use the ``trial_name_creator`` argument
			to `tune.run`. This takes a function with the following signature:

			`.. code-block:: python`

			`def trial_name_string(trial):`
			`"""`
			`Args:`
			`trial (Trial): A generated trial object.`

			`Returns:`
			`trial_name (str): String representation of Trial.`
			`"""`
			`return str(trial)`

			`tune.run(`
			`MyTrainableClass,`
			`name="example-experiment",`
			`num_samples=1,`
			`trial_name_creator=trial_name_string`
			`)`

			See the documentation on Trials: :ref:`trial-docstring`.


			`Viskit`
			`------`

			Tune automatically integrates with `Viskit <https://github.com/vitchyr/viskit>`_ via the ``CSVLogger`` outputs. To use VisKit (you may have to install some dependencies), run:

			`.. code-block:: bash`

			`$ git clone https://github.com/rll/rllab.git`
			`$ python rllab/rllab/viskit/frontend.py ~/ray_results/my_experiment`

			`The nonrelevant metrics (like timing stats) can be disabled on the left to show only the relevant ones (like accuracy, loss, etc.).`

			`.. image:: /ray-tune-viskit.png`


			`UnifiedLogger`
			`-------------`

			`.. autoclass:: ray.tune.logger.UnifiedLogger`

			`TBXLogger`
			`---------`

			`.. autoclass:: ray.tune.logger.TBXLogger`

			`JsonLogger`
			`----------`

			`.. autoclass:: ray.tune.logger.JsonLogger`

			`CSVLogger`
			`---------`

			`.. autoclass:: ray.tune.logger.CSVLogger`

			`MLFLowLogger`
			`------------`

			Tune also provides a default logger for `MLFlow <https://mlflow.org>`_. You can install MLFlow via ``pip install mlflow``. An example can be found `mlflow_example.py <https://github.com/ray-project/ray/blob/master/python/ray/tune/examples/mlflow_example.py>`__. Note that this currently does not include artifact logging support. For this, you can use the native MLFlow APIs inside your Trainable definition.

			`.. autoclass:: ray.tune.logger.MLFLowLogger`
[docs][tune] Make search algorithm, scheduler docs better! (#8179) 2020-05-17 12:19:44 -07:00

			`.. _logger-interface:`

			`Logger`
			`------`

			`.. autoclass:: ray.tune.logger.Logger`