hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Eric Liang	7e3e4cd321	[rllib] Execution plan API documentation (#10000 ) * wip * updte * comments	2020-08-11 23:58:41 -07:00
Eric Liang	4b62a888cc	[rllib] Remove deprecated policy optimizer package. (#9262 )	2020-07-02 14:39:40 -07:00
Eric Liang	9a83908c46	[rllib] Deprecate policy optimizers (#8345 )	2020-05-21 10:16:18 -07:00
Richard Liaw	b506f87117	[tune] New Doc edits, add Concepts page (#8083 ) Co-Authored-By: Sven Mika <sven@anyscale.io>	2020-04-25 18:25:56 -07:00
hubcity	3d0a8662b3	#7246 - Fixing broken links (#7247 ) * #7246 - Fixing broken links * Apply suggestions from code review Co-authored-by: Richard Liaw <rliaw@berkeley.edu>	2020-03-25 21:46:13 -07:00
Eric Liang	dd70720578	[rllib] Rename sample_batch_size => rollout_fragment_length (#7503 ) * bulk rename * deprecation warn * update doc * update fig * line length * rename * make pytest comptaible * fix test * fi sys * rename * wip * fix more * lint * update svg * comments * lint * fix use of batch steps	2020-03-14 12:05:04 -07:00
Yutai Zhou	9b6794cbb0	[rllib] updated policy definition link (#6989 )	2020-01-31 16:22:11 -08:00
Sven Mika	c957ed58ed	[RLlib] Implement PPO torch version. (#6826 )	2020-01-20 23:06:50 -08:00
Sven Mika	e6227082bd	[RLlib] Add `torch` flag to train.py (#6807 )	2020-01-17 18:48:44 -08:00
gehring	8903bcd0c3	[rllib] Tracing for eager tensorflow policies with `tf.function` (#5705 ) * Added tracing of eager policies with `tf.function` * lint * add config option * add docs * wip * tracing now works with a3c * typo * none * file doc * returns * syntax error * syntax error	2019-09-17 01:44:20 -07:00
Richard Liaw	34f6d2fc5c	[tune] Update trainable docs and support hparams (#5558 )	2019-09-04 12:44:42 -07:00
gehring	b520f6141e	[rllib] Adds eager support with a generic `TFEagerPolicy` class (#5436 )	2019-08-23 14:21:11 +08:00
Eric Liang	a1d2e17623	[rllib] Autoregressive action distributions (#5304 )	2019-08-10 14:05:12 -07:00
Eric Liang	5d7afe8092	[rllib] Try moving RLlib to top level dir (#5324 )	2019-08-05 23:25:49 -07:00
Richard Liaw	1eaa57c98f	[tune] Distributed example + walkthrough (#5157 )	2019-08-02 09:17:20 -07:00
Eric Liang	20450a4e82	[rllib] Add rock paper scissors multi-agent example (#5336 )	2019-08-01 13:03:59 -07:00
Eric Liang	a62c5f40f6	[rllib] Document ModelV2 and clean up the models/ directory (#5277 )	2019-07-27 02:08:16 -07:00
Eric Liang	f9043cc49a	[rllib] Remove experimental eager support	2019-07-21 12:27:17 -07:00
Eric Liang	047f4ccd61	[rllib] Fix rollout.py with tuple action space (#5201 ) * fix it * update doc too * fix rollout	2019-07-16 10:52:35 -07:00
Eric Liang	34d054ff19	[rllib] ModelV2 API (#4926 )	2019-07-03 15:59:47 -07:00
Eric Liang	9e328fbe6f	[rllib] Add docs on how to use TF eager execution (#4927 )	2019-06-07 16:42:37 -07:00
Eric Liang	7501ee51db	[rllib] Rename PolicyEvaluator => RolloutWorker (#4820 )	2019-06-03 06:49:24 +08:00
Eric Liang	9aa1cd613d	[rllib] Allow Torch policies access to full action input dict in extra_action_out_fn (#4894 ) * fix torch extra out * preserve setitem * fix docs	2019-06-01 16:58:49 +08:00
Eric Liang	1c073e92e4	[rllib] Fix documentation on custom policies (#4910 ) * wip * add docs * lint * todo sections * fix doc	2019-06-01 16:13:21 +08:00
Eric Liang	a45c61e19b	[rllib] Update concepts docs and add "Building Policies in Torch/TensorFlow" section (#4821 ) * wip * fix index * fix bugs * todo * add imports * note on get ph * note on get ph * rename to building custom algs * add rnn state info	2019-05-27 14:17:32 -07:00
Eric Liang	02583a8598	[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819 ) This implements some of the renames proposed in #4813 We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.	2019-05-20 16:46:05 -07:00
Eric Liang	71b2dec3b4	[rllib] Fix bounds of space returned by preprocessor.observation_space (#4736 )	2019-05-05 18:25:38 -07:00
Eric Liang	6848dfd179	[rllib] Replace ray.get() with ray_get_and_free() to optimize memory usage (#4586 )	2019-04-17 20:30:03 -04:00
Eric Liang	6e7680bf21	[rllib] Clean up concepts documentation and policy optimizer creation (#4592 )	2019-04-12 21:03:26 -07:00
Eric Liang	59901a88a0	[rllib] Native support for Dict and Tuple spaces; fix Tuple action spaces; add prev a, r to LSTM (#3051 )	2018-10-20 15:21:22 -07:00
Sergey Kolesnikov	05490b8cb9	[rllib] dqn/ddpg policy customization (#2445 ) * dqn policy update - more customization * docs for custom DQN graph * Update rllib-training.rst * Update rllib-models.rst * Update rllib.rst * Update rllib-training.rst * Update rllib-concepts.rst * yapf codestyle	2018-07-22 14:47:14 -07:00
Eric Liang	b316afeb43	[rllib] Add debug info back to PPO and fix optimizer compatibility (#2366 )	2018-07-12 19:22:46 +02:00
Eric Liang	4ef9d15315	[rllib] Add concepts section of docs (#2373 ) This fills in the rllib concepts documentation.	2018-07-08 18:46:52 -07:00

33 commits