hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-12 14:16:39 -04:00

History

Eric Liang baadbdf8d4 [rllib] Execute PPO using training workflow (#8206 ) * wip * add kl * kl * works now * doc update * reorg * add ddppo * add stats * fix fetch * comment * fix learner stat regression * test fixes * fix test		2020-04-30 01:18:09 -07:00
..
agents	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
contrib	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
env	[RLlib] SAC Torch (incl. Atari learning) (#7984 )	2020-04-15 13:25:16 +02:00
evaluation	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
examples	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
execution	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
models	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
offline	Remove six and cloudpickle from setup.py. (#7694 )	2020-03-23 11:42:05 -07:00
optimizers	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
policy	[RLlib] Remove TupleActions and support arbitrarily nested action spaces. (#8143 )	2020-04-28 14:59:16 +02:00
tests	[rllib] Execute PPO using training workflow (#8206 )	2020-04-30 01:18:09 -07:00
tuned_examples	[RLlib] PyTorch version of APPO. (#8120 )	2020-04-23 09:11:12 +02:00
utils	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
__init__.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
asv.conf.json	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
BUILD	[RLlib] Deprecate all Model(v1) usage. (#8146 )	2020-04-29 12:12:59 +02:00
README.md	Replace all instances of ray.readthedocs.io with ray.io (#7994 )	2020-04-13 16:17:05 -07:00
rollout.py	[RLlib] Nested action space PR (minimally invasive; torch only + test). (#8101 )	2020-04-23 09:09:22 +02:00
scripts.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
train.py	[rllib] Port DQN/Ape-X to training workflow api (#8077 )	2020-04-23 12:39:19 -07:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}

Development Install

You can develop RLlib locally without needing to compile Ray by using the setup-dev.py script. This sets up links between the rllib dir in your git repo and the one bundled with the ray package. When using this script, make sure that your git branch is in sync with the installed Ray binaries (i.e., you are up-to-date on master and have the latest wheel installed.)