hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 02:21:39 -05:00

History

Sven Mika 775e685531 [RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )		2021-02-18 21:36:32 +01:00
..
agents	[RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )	2021-02-18 21:36:32 +01:00
contrib	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
env	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
evaluation	[RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )	2021-02-18 21:36:32 +01:00
examples	[RLlib] Issue #14022 : Trajectory View API fails in MA-env where a new agent terminates right away (done=True right after initial obs). (#14031 )	2021-02-18 14:07:49 +01:00
execution	[RLlib] Issue #13824 : `compress_observations=True` crashes for all algos not using a replay buffer. (#14034 )	2021-02-18 21:36:32 +01:00
models	[RLlib] Unify fcnet initializers for the value output layer (std=1.0 in torch, but 0.01 in tf). (#13733 )	2021-02-02 18:42:49 +01:00
offline	[RLlib] Support for D4RL + Semi-working CQL Benchmark (#13550 )	2021-01-21 16:43:55 +01:00
policy	[RLlib] Issue #13507 : Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037 )	2021-02-11 18:58:46 +01:00
tests	[RLlib] Issue #14022 : Trajectory View API fails in MA-env where a new agent terminates right away (done=True right after initial obs). (#14031 )	2021-02-18 14:07:49 +01:00
tuned_examples	[RLlib] Allow SAC to use custom models as Q- or policy nets and deprecate "state-preprocessor" for image spaces. (#13522 )	2021-02-02 13:05:58 +01:00
utils	[RLlib] Extend on_learn_on_batch callback to allow for custom metrics to be added. (#13584 )	2021-02-08 15:02:19 +01:00
__init__.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
asv.conf.json	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
BUILD	[RLlib] Issue #13507 : Fix MB-MPO CartPole Env's reward function as well as MB-MPO running into a traj. view API related issue. (#14037 )	2021-02-11 18:58:46 +01:00
README.md	[docs] Move all /latest links to /master (#11897 )	2020-11-10 10:53:28 -08:00
rollout.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00
scripts.py	[RLlib] Deprecate old classes, methods, functions, config keys (in prep for RLlib 1.0). (#10544 )	2020-09-06 10:58:00 +02:00
train.py	[RLlib] Allow `rllib rollout` to run distributed via evaluation workers. (#13718 )	2021-02-08 12:05:16 +01:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}

Development Install

You can develop RLlib locally without needing to compile Ray by using the setup-dev.py script. This sets up links between the rllib dir in your git repo and the one bundled with the ray package. When using this script, make sure that your git branch is in sync with the installed Ray binaries (i.e., you are up-to-date on master and have the latest wheel installed.)