2022-01-20 15:30:56 -08:00
.. include :: we_are_hiring.rst
2021-11-10 22:20:06 +01:00
.. _rllib-index:
2021-11-13 22:10:47 +01:00
RLlib: Industry-Grade Reinforcement Learning
============================================
2021-11-10 22:20:06 +01:00
[docs] new structure (#21776)
This PR consolidates both #21667 and #21759 (look there for features), but improves on them in the following way:
- [x] we reverted renaming of existing projects `tune`, `rllib`, `train`, `cluster`, `serve`, `raysgd` and `data` so that links won't break. I think my consolidation efforts with the `ray-` prefix were a little overeager in that regard. It's better like this. Only the creation of `ray-core` was a necessity, and some files moved into the `rllib` folder, so that should be relatively benign.
- [x] Additionally, we added Algolia `docsearch`, screenshot below. This is _much_ better than our current search. Caveat: there's a sphinx dependency that needs to be replaced (`sphinx-tabs`) by another, newer one (`sphinx-panels`), as the former prevents loading of the `algolia.js` library. Will follow-up in the next PR (hoping this one doesn't get re-re-re-re-reverted).
2022-01-22 00:42:05 +01:00
.. figure :: images/rllib-index-header.svg
2021-11-10 22:20:06 +01:00
**RLlib** is an open-source library for reinforcement learning (RL), offering support for
production-level, highly distributed RL workloads while maintaining
unified and simple APIs for a large variety of industry applications.
Whether you would like to train your agents in a multi-agent setup,
purely from offline (historic) datasets, or using externally
connected simulators, RLlib offers a simple solution for each of your decision
making needs.
2021-11-18 18:08:40 +01:00
You **don't need** to be an **RL expert** to use RLlib, nor do you need to learn Ray or any
other of its libraries! If you either have your problem coded (in python) as an
`RL environment <https://medium.com/distributed-computing-with-ray/anatomy-of-a-custom-environment-for-rllib-327157f269e5> `_
or own lots of pre-recorded, historic behavioral data to learn from, you will be
up and running in only a few days.
RLlib is already used in production by industry leaders in many different verticals, such as
`climate control <https://www.anyscale.com/events/2021/06/23/applying-ray-and-rllib-to-real-life-industrial-use-cases> `_ ,
`manufacturing and logistics <https://www.anyscale.com/events/2021/06/22/offline-rl-with-rllib> `_ ,
`finance <https://www.anyscale.com/events/2021/06/22/a-24x-speedup-for-reinforcement-learning-with-rllib-+-ray> `_ ,
`gaming <https://www.anyscale.com/events/2021/06/22/using-reinforcement-learning-to-optimize-iap-offer-recommendations-in-mobile-games> `_ ,
`automobile <https://www.anyscale.com/events/2021/06/23/using-rllib-in-an-enterprise-scale-reinforcement-learning-solution> `_ ,
`robotics <https://www.anyscale.com/events/2021/06/23/introducing-amazon-sagemaker-kubeflow-reinforcement-learning-pipelines-for> `_ ,
`boat design <https://www.youtube.com/watch?v=cLCK13ryTpw> `_ ,
and many others.
2021-11-10 22:20:06 +01:00
RLlib in 60 seconds
-------------------
It'll only take a few steps to get your first RLlib workload up and running on your laptop:
**TensorFlow or PyTorch** :
RLlib does not automatically install a deep-learning framework, but supports
TensorFlow (both 1.x with static-graph and 2.x with eager mode) as well as
PyTorch. Depending on your needs, make sure to install either TensorFlow or
PyTorch (or both as shown below):
.. code-block :: bash
$ conda create -n rllib python=3.8
$ conda activate rllib
$ pip install "ray[rllib]" tensorflow torch
2021-11-13 22:10:47 +01:00
To be able to run our Atari examples, you should also install:
2021-11-10 22:20:06 +01:00
.. code-block :: bash
$ pip install "gym[atari]" "gym[accept-rom-license]" atari_py
After these quick pip installs, you can start coding against RLlib.
Here is an example of running a PPO Trainer on the "`Taxi domain <https://gym.openai.com/envs/Taxi-v3/> `_ "
for a few training iterations, then perform a single evaluation loop
(with rendering enabled):
.. literalinclude :: ../../../rllib/examples/documentation/rllib_in_60s.py
:language: python
:start-after: __rllib-in-60s-begin__
:end-before: __rllib-in-60s-end__
2021-11-13 22:10:47 +01:00
`See here for a simple example on how to write an action inference loop after training. <https://github.com/ray-project/ray/blob/master/rllib/examples/inference_and_serving/policy_inference_after_training.py> `_
2021-11-10 22:20:06 +01:00
Feature Overview
----------------
2021-11-13 22:10:47 +01:00
The following is a summary of RLlib's most striking features.
2021-11-25 09:35:19 +01:00
For an in-depth overview, check out our `documentation <http://docs.ray.io/en/latest/rllib/index.html> `_ .
2021-11-13 22:10:47 +01:00
Click on the images below to see an example script for each of the listed features:
2021-11-18 18:08:40 +01:00
.. include :: feature_overview.rst
2021-11-10 22:20:06 +01:00
Customizations
--------------
RLlib provides simple APIs to customize all aspects of your training- and experimental workflows.
For example, you may code your own `environments <../rllib-env.html#configuring-environments> `__
in python using openAI's gym or DeepMind's OpenSpiel, provide custom
`TensorFlow/Keras- <../rllib-models.html#tensorflow-models> `__ or ,
`Torch models <../rllib-models.html#torch-models> `_ , write your own
`policy- and loss definitions <../rllib-concepts.html#policies> `__ , or define
custom `exploratory behavior <../rllib-training.htmlexploration-api> `_ .
Via mapping one or more agents in your environments to (one or more) policies, multi-agent
RL (MARL) becomes an easy-to-use low-level primitive for our users.
[docs] new structure (#21776)
This PR consolidates both #21667 and #21759 (look there for features), but improves on them in the following way:
- [x] we reverted renaming of existing projects `tune`, `rllib`, `train`, `cluster`, `serve`, `raysgd` and `data` so that links won't break. I think my consolidation efforts with the `ray-` prefix were a little overeager in that regard. It's better like this. Only the creation of `ray-core` was a necessity, and some files moved into the `rllib` folder, so that should be relatively benign.
- [x] Additionally, we added Algolia `docsearch`, screenshot below. This is _much_ better than our current search. Caveat: there's a sphinx dependency that needs to be replaced (`sphinx-tabs`) by another, newer one (`sphinx-panels`), as the former prevents loading of the `algolia.js` library. Will follow-up in the next PR (hoping this one doesn't get re-re-re-re-reverted).
2022-01-22 00:42:05 +01:00
.. figure :: images/rllib-stack.svg
2021-11-10 22:20:06 +01:00
:align: left
2021-11-13 22:10:47 +01:00
:width: 650
2021-11-10 22:20:06 +01:00
**RLlib's API stack:** Built on top of Ray, RLlib offers off-the-shelf, highly distributed
algorithms, policies, loss functions, and default models (including the option to
auto-wrap a neural network with an LSTM or an attention net). Furthermore, our library
comes with a built-in Server/Client setup, allowing you to connect
hundreds of external simulators (clients) via the network to an RLlib server process,
which provides learning functionality and serves action queries. User customizations
are realized via sub-classing the existing abstractions and - by overriding certain
methods in those sub-classes - define custom behavior.