hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 04:46:38 -04:00

Author	SHA1	Message	Date
Eric Liang	b45bed4bce	[rllib] Propagate model options correctly in ARS / ES, to action dist of PPO (#2974 ) * fix * fix * fix it * propagate conf to action dist * move carla example too * rr * Update policies.py * wip * lint	2018-10-01 12:49:39 -07:00
Eric Liang	715737cc06	[docs] Add backlinks from hyperopt / rl algorithm examples to the built-on Ray libraries (#1356 )	2017-12-23 00:31:33 -08:00
Eric Liang	fbf1806b8a	[tune] Clean up result logging: move out of /tmp, add timestamp (#1297 )	2017-12-15 14:19:08 -08:00
Eric Liang	316f9e2bb7	[tune] Support user-defined trainable functions / classes / envs with a shared object registry (#1226 )	2017-11-20 17:52:43 -08:00
Eric Liang	90013eda2d	[rllib] Fix docs to reference new code locations (#1092 ) * fix rllib docs * Update example-a3c.rst	2017-10-09 22:58:58 -07:00
Eric Liang	a674ec958c	[rllib] Move policy gradient and evolution strategies algorithms from examples/ to ray/rllib/ (#694 ) * rllib v0 * fix imports * lint * comments * update docs	2017-06-25 22:13:03 +00:00
Eric Liang	06241daf61	Policy gradient example: record stats for tensorboard (#577 ) * add tf metrics * comments * fix network scopes * add doc * use format string * fix trace level * plot intermediate and final sgd stats * add back a global step	2017-05-21 14:51:24 -07:00
Philipp Moritz	4af0aa6258	Atari on pixels (#364 ) * pong on pixels working (not cleaned up) * make training compatible with all atari games * cartpole runs * Update documentation and usage for policy gradients.	2017-03-14 13:31:29 -07:00
Philipp Moritz	555dcf35a2	Add policy gradient example. (#344 ) * add policy gradient example * fix typos * Minor changes plus some documentation. * Minor fixes.	2017-03-07 23:42:44 -08:00

9 commits