ray/rllib
2021-08-16 12:29:37 +02:00
..
agents - Multi-GPU APPO +vtrace fix. 2021-08-13 15:05:38 +02:00
contrib [RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 2021-07-19 13:16:03 -04:00
env [RLlib] New and changed version of parametric actions cartpole example + small suggested update in policy_client.py (#15664) 2021-07-28 15:25:09 -04:00
evaluation wip. 2021-08-11 12:38:43 +02:00
examples [RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566) 2021-08-05 11:41:18 -04:00
execution [RLlib] Torch algos use now-framework-agnostic MultiGPUTrainOneStep execution op (~33% speedup for PPO-torch + GPU). (#17371) 2021-08-03 11:35:49 -04:00
models [RLlib] CV2 to Skimage dependency change (#16841) 2021-07-21 22:24:18 -04:00
offline [RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 2021-07-19 13:16:03 -04:00
policy wip. 2021-08-11 12:49:10 +02:00
tests [RLlib](deps): Bump tensorflow from 2.4.1 to 2.5.0 in /python/requirements/rllib (#15849) 2021-08-16 12:29:37 +02:00
tuned_examples [RLlib] De-flake 3 test cases; Fix config.simple_optimizer and SampleBatch.is_training warnings. (#17321) 2021-07-27 14:39:06 -04:00
utils [RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587) 2021-08-05 11:39:15 -04:00
__init__.py [RLlib] Allow rllib rollout to run distributed via evaluation workers. (#13718) 2021-02-08 12:05:16 +01:00
asv.conf.json [rllib] Try moving RLlib to top level dir (#5324) 2019-08-05 23:25:49 -07:00
BUILD Make (RLlib) test_appo size=large to avoid time-outs. 2021-08-13 20:53:26 +02:00
README.md [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
rollout.py [RLlib] Refactor if __name__ == "__main__" into main() method in rollout/train.py for better reusability (#17315) 2021-07-26 11:12:59 -04:00
scripts.py [tune] Add leading zeros to checkpoint directory (#14152) 2021-03-01 12:12:19 +01:00
train.py [RLlib] Refactor if __name__ == "__main__" into main() method in rollout/train.py for better reusability (#17315) 2021-07-26 11:12:59 -04:00

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}

Development Install

You can develop RLlib locally without needing to compile Ray by using the setup-dev.py script. This sets up links between the rllib dir in your git repo and the one bundled with the ray package. When using this script, make sure that your git branch is in sync with the installed Ray binaries (i.e., you are up-to-date on master and have the latest wheel installed.)