Eric Liang
b45bed4bce
[rllib] Propagate model options correctly in ARS / ES, to action dist of PPO ( #2974 )
...
* fix
* fix
* fix it
* propagate conf to action dist
* move carla example too
* rr
* Update policies.py
* wip
* lint
2018-10-01 12:49:39 -07:00
Eric Liang
715737cc06
[docs] Add backlinks from hyperopt / rl algorithm examples to the built-on Ray libraries ( #1356 )
2017-12-23 00:31:33 -08:00
Eric Liang
fbf1806b8a
[tune] Clean up result logging: move out of /tmp, add timestamp ( #1297 )
2017-12-15 14:19:08 -08:00
Eric Liang
316f9e2bb7
[tune] Support user-defined trainable functions / classes / envs with a shared object registry ( #1226 )
2017-11-20 17:52:43 -08:00
Eric Liang
90013eda2d
[rllib] Fix docs to reference new code locations ( #1092 )
...
* fix rllib docs
* Update example-a3c.rst
2017-10-09 22:58:58 -07:00
Eric Liang
a674ec958c
[rllib] Move policy gradient and evolution strategies algorithms from examples/ to ray/rllib/ ( #694 )
...
* rllib v0
* fix imports
* lint
* comments
* update docs
2017-06-25 22:13:03 +00:00
Eric Liang
06241daf61
Policy gradient example: record stats for tensorboard ( #577 )
...
* add tf metrics
* comments
* fix network scopes
* add doc
* use format string
* fix trace level
* plot intermediate and final sgd stats
* add back a global step
2017-05-21 14:51:24 -07:00
Philipp Moritz
4af0aa6258
Atari on pixels ( #364 )
...
* pong on pixels working (not cleaned up)
* make training compatible with all atari games
* cartpole runs
* Update documentation and usage for policy gradients.
2017-03-14 13:31:29 -07:00
Philipp Moritz
555dcf35a2
Add policy gradient example. ( #344 )
...
* add policy gradient example
* fix typos
* Minor changes plus some documentation.
* Minor fixes.
2017-03-07 23:42:44 -08:00