hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-09 12:56:46 -04:00

Author	SHA1	Message	Date
Eric Liang	a674ec958c	[rllib] Move policy gradient and evolution strategies algorithms from examples/ to ray/rllib/ (#694 ) * rllib v0 * fix imports * lint * comments * update docs	2017-06-25 22:13:03 +00:00
Eric Liang	06241daf61	Policy gradient example: record stats for tensorboard (#577 ) * add tf metrics * comments * fix network scopes * add doc * use format string * fix trace level * plot intermediate and final sgd stats * add back a global step	2017-05-21 14:51:24 -07:00
Philipp Moritz	4af0aa6258	Atari on pixels (#364 ) * pong on pixels working (not cleaned up) * make training compatible with all atari games * cartpole runs * Update documentation and usage for policy gradients.	2017-03-14 13:31:29 -07:00
Philipp Moritz	555dcf35a2	Add policy gradient example. (#344 ) * add policy gradient example * fix typos * Minor changes plus some documentation. * Minor fixes.	2017-03-07 23:42:44 -08:00