hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika 19c8033df2 [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 ) * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT and fixes. MB-MPO and MAML not working yet. * wip * update * update * rmeove * remove dep * higher * Update requirements_rllib.txt * Update requirements_rllib.txt * relpos * no mbmpo Co-authored-by: Eric Liang <ekhliang@gmail.com>		2020-12-01 17:41:10 -08:00
..
agents	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
contrib	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
env	[RLlib] Add ResetOnExceptionWrapper with tests for unstable 3rd party envs (#12353 )	2020-11-25 08:41:58 +01:00
evaluation	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
examples	[RLlib] Attention Net prep PR #2 : Smaller cleanups. (#12449 )	2020-12-01 08:21:45 +01:00
execution	[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945 )	2020-11-29 12:31:24 +01:00
models	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
offline	[RLlib] Fix offline logp vs prob bug in OffPolicyEstimator class. (#12158 )	2020-11-20 08:59:43 +01:00
policy	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
tests	Revert "Re-Revert "[Core] zero-copy serializer for pytorch (#12344 )" (#12478 )" (#12515 )	2020-11-30 19:05:55 -08:00
tuned_examples	[RLlib] PyBullet Env native support via env str-specifier (if installed). (#12209 )	2020-11-30 12:41:24 +01:00
utils	[RLlib] Curiosity exploration module: tf/tf2.x/tf-eager support. (#11945 )	2020-11-29 12:31:24 +01:00
__init__.py	[RLlib] First attempt at cleaning up algo code in RLlib: PG. (#10115 )	2020-08-20 17:05:57 +02:00
asv.conf.json	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
BUILD	[RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366 )	2020-12-01 17:41:10 -08:00
README.md	[docs] Move all /latest links to /master (#11897 )	2020-11-10 10:53:28 -08:00
rollout.py	[RLlib] rollout batch, handle rewards that are None (unknown) in a multi-agent env (#11858 ) (#11911 )	2020-11-25 13:39:22 +01:00
scripts.py	[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544 )	2020-09-06 10:58:00 +02:00
train.py	[tune] move _SCHEDULERS to tune.schedulers and add all available schedulers (#11218 )	2020-10-08 16:10:23 -07:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}

Development Install

You can develop RLlib locally without needing to compile Ray by using the setup-dev.py script. This sets up links between the rllib dir in your git repo and the one bundled with the ray package. When using this script, make sure that your git branch is in sync with the installed Ray binaries (i.e., you are up-to-date on master and have the latest wheel installed.)