hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 18:41:40 -05:00

Author	SHA1	Message	Date
xwjiang2010	9af8f11191	Revert "[docs] Clean up doc structure (first part) (#21667 )" (#21763 ) This reverts commit `38e46c9fb3`.	2022-01-20 15:30:56 -08:00
Max Pumperla	38e46c9fb3	[docs] Clean up doc structure (first part) (#21667 )	2022-01-20 16:19:04 +01:00
Sven Mika	e5ead6a4b0	[RLlib; Documentation] Minor fixes "rllib in 60s" and per-feature sigils. (#20248 )	2021-11-13 22:10:47 +01:00
Sven Mika	143d23a278	[RLlib] Issue 20062: Action inference examples missing (#20144 )	2021-11-10 18:49:06 +01:00
Philipp Moritz	a64e32c53b	[docs] Fix broken links in documentation and add linkcheck to documentation (#20030 ) Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2021-11-04 13:19:43 -07:00
Sven Mika	2d24ef0d32	[RLlib] Add all simple learning tests as `framework=tf2`. (#19273 ) * Unpin gym and deprecate pendulum v0 Many tests in rllib depended on pendulum v0, however in gym 0.21, pendulum v0 was deprecated in favor of pendulum v1. This may change reward thresholds, so will have to potentially rerun all of the pendulum v1 benchmarks, or use another environment in favor. The same applies to frozen lake v0 and frozen lake v1 Lastly, all of the RLlib tests and Tune tests have been moved to python 3.7 * fix tune test_sampler::testSampleBoundsAx * fix re-install ray for py3.7 tests Co-authored-by: avnishn <avnishn@uw.edu>	2021-11-02 12:10:17 +01:00
Rohan138	b9c9cc5946	[RLlib] Updated PettingZoo+RLlib tutorial; Removed pettingzoo example script (#19069 ) * Updated PettingZoo+RLlib tutorial Updated the tutorial and added link to the blog post by the PettingZoo team. * Ran linting * Converted link to tinyurl for linting * fixed line lengths * Decrease num_workers to 1 * Added comments * Decreased num_workers * Decreased timesteps * Increased num_workers * Update links and remove pettingzoo_env.py * remove pettingzoo.py script from tests Co-authored-by: sven1977 <svenmika1977@gmail.com>	2021-10-29 10:57:10 +02:00
Steven Morad	d8eed68af2	[RLlib] Add differentiable neural computer example (#14844 )	2021-05-19 09:15:39 +02:00
Sven Mika	d89fb82bfb	[RLlib] Add simple curriculum learning API and example script. (#15740 )	2021-05-16 17:35:10 +02:00
Sven Mika	ebc6d8692a	[RLlib] Docs: Example scripts and blogs documentation update. (#15763 )	2021-05-16 15:24:38 +02:00
Simon Mo	f6a8a9be59	[Serve] Add RLlib tutorial (#14194 )	2021-02-22 13:23:12 -08:00
Jeroen Boeye	2af1f0616d	Fix broken link to Flow docs (#14058 )	2021-02-11 13:20:34 -08:00
Eric Liang	9b8218aabd	[docs] Move all /latest links to /master (#11897 ) * use master link * remae * revert non-ray * more * mre	2020-11-10 10:53:28 -08:00
Eric Liang	deea1861ab	[rllib] Try fixing torch GPU and masking errors (#10168 )	2020-08-25 18:34:19 -07:00
Sven Mika	66d204e078	[RLlib] Model documentation enhancements. (#10011 )	2020-08-13 13:36:40 +02:00
Sven Mika	78dfed2683	[RLlib] Issue 8384: QMIX doesn't learn anything. (#9527 )	2020-07-17 12:14:34 +02:00
Sven Mika	d8a081a185	[RLlib] Unity3D integration (n Unity3D clients vs learning server). (#8590 )	2020-05-30 22:48:34 +02:00
Sven Mika	2746fc0476	[RLlib] Auto-framework, retire `use_pytorch` in favor of `framework=...` (#8520 )	2020-05-27 16:19:13 +02:00
Sven Mika	42991d723f	[RLlib] rllib/examples folder restructuring (#8250 ) Cleans up of the rllib/examples folder by moving all example Envs into rllibexamples/env (so they can be used by other scripts and tests as well).	2020-05-01 22:59:34 +02:00
hubcity	3d0a8662b3	#7246 - Fixing broken links (#7247 ) * #7246 - Fixing broken links * Apply suggestions from code review Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-03-25 21:46:13 -07:00
Ameer Haj Ali	1a9948eef9	Update rllib-examples.rst (#6396 )	2019-12-08 16:21:50 -08:00
Eric Liang	249ca2cf9e	[rllib] add blog posts to examples list (#5762 ) * add blog post * remove * link	2019-09-23 10:42:21 -07:00
Eric Liang	592f313210	[rllib] Centralized critic / PPO example on TwoStepGame (#5392 )	2019-08-08 14:03:28 -07:00
Eric Liang	5d7afe8092	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
Eric Liang	20450a4e82	[rllib] Add rock paper scissors multi-agent example (#5336 )	2019-08-01 13:03:59 -07:00
Samir Al-Stouhi	51b8915c0a	Added CARLA Community Example (#5333 )	2019-07-31 18:10:50 -07:00
Eric Liang	a62c5f40f6	[rllib] Document ModelV2 and clean up the models/ directory (#5277 )	2019-07-27 02:08:16 -07:00
Eric Liang	34d054ff19	[rllib] ModelV2 API (#4926 )	2019-07-03 15:59:47 -07:00
Eric Liang	9e328fbe6f	[rllib] Add docs on how to use TF eager execution (#4927 )	2019-06-07 16:42:37 -07:00
Eric Liang	7501ee51db	[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )	2019-06-03 06:49:24 +08:00
Eric Liang	4f46d3e9bf	[rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554 )	2019-04-09 00:36:49 -07:00
Eric Liang	4b8b703561	[rllib] Some API cleanups and documentation improvements (#4409 )	2019-03-21 21:34:22 -07:00
Eric Liang	6e3384a719	[rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} (#4215 )	2019-03-04 14:05:42 -08:00
Robert Nishihara	4b89eebfc7	Move test folders under rllib/tune from test -> tests. (#4214 )	2019-03-02 13:37:16 -08:00
Eric Liang	d9da183c7d	[rllib] Custom supervised loss API (#4083 )	2019-02-24 15:36:13 -08:00
Eric Liang	fb73cedf70	[rllib] Add examples page, add hierarchical training example, delete SC2 examples (#3815 ) * wip * lint * wip * up * wip * update examples * wip * remove carla * update * improve envspec * link to custom * Update rllib-env.rst * update * fix * fn * lint * ds * ssd games * desc * fix up docs * fix	2019-01-29 21:06:09 -08:00

36 commits