hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

History

Sven Mika e2edca45d4 [RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 ) * Take out stats to analyze memory leak in torch PPO. * WIP * WIP * WIP * WIP * WIP * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * WIP. * LINT. * Fix determine_tests_to_run.py. * minor change to re-test after determine_tests_to_run.py. * LINT. * update comments. * WIP * WIP * WIP * FIX. * Fix sequence_mask being dependent on torch being installed. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case. * Fix strange ray-core tf-error in test_memory_scheduling test case.		2020-02-22 11:02:31 -08:00
..
agents	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
contrib	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00
env	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
evaluation	[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213 )	2020-02-19 16:07:37 -08:00
examples	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
models	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
offline	[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155 )	2020-02-19 12:18:45 -08:00
optimizers	[rllib] Fix bad sample count assert	2020-02-15 17:22:23 -08:00
policy	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
tests	Fix old exploration configs. (#7240 )	2020-02-20 08:39:16 -08:00
tuned_examples	Fix old exploration configs. (#7240 )	2020-02-20 08:39:16 -08:00
utils	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
__init__.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
asv.conf.json	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
BUILD	[RLlib] PPO torch memory leak and unnecessary torch.Tensor creation and gc'ing. (#7238 )	2020-02-22 11:02:31 -08:00
README.md	MADDPG implementation in RLlib (#5348 )	2019-08-06 16:22:06 -07:00
rollout.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
scripts.py	Remove future imports (#6724 )	2020-01-09 00:15:48 -08:00
train.py	[RLlib] Move all jenkins RLlib-tests into bazel (rllib/BUILD). (#7178 )	2020-02-15 14:50:44 -08:00

README.md

RLlib: Scalable Reinforcement Learning

RLlib is an open-source library for reinforcement learning that offers both high scalability and a unified API for a variety of applications.

For an overview of RLlib, see the documentation.

If you've found RLlib useful for your research, you can cite the paper as follows:

@inproceedings{liang2018rllib,
    Author = {Eric Liang and
              Richard Liaw and
              Robert Nishihara and
              Philipp Moritz and
              Roy Fox and
              Ken Goldberg and
              Joseph E. Gonzalez and
              Michael I. Jordan and
              Ion Stoica},
    Title = {{RLlib}: Abstractions for Distributed Reinforcement Learning},
    Booktitle = {International Conference on Machine Learning ({ICML})},
    Year = {2018}
}

Development Install

You can develop RLlib locally without needing to compile Ray by using the setup-dev.py script. This sets up links between the rllib dir in your git repo and the one bundled with the ray package. When using this script, make sure that your git branch is in sync with the installed Ray binaries (i.e., you are up-to-date on master and have the latest wheel installed.)