Eric Liang
1251abf0d1
[rllib] Modularize Torch and TF policy graphs ( #2294 )
...
* wip
* cls
* re
* wip
* wip
* a3c working
* torch support
* pg works
* lint
* rm v2
* consumer id
* clean up pg
* clean up more
* fix python 2.7
* tf session management
* docs
* dqn wip
* fix compile
* dqn
* apex runs
* up
* impotrs
* ddpg
* quotes
* fix tests
* fix last r
* fix tests
* lint
* pass checkpoint restore
* kwar
* nits
* policy graph
* fix yapf
* com
* class
* pyt
* vectorization
* update
* test cpe
* unit test
* fix ddpg2
* changes
* wip
* args
* faster test
* common
* fix
* add alg option
* batch mode and policy serving
* multi serving test
* todo
* wip
* serving test
* doc async env
* num envs
* comments
* thread
* remove init hook
* update
* fix ppo
* comments1
* fix
* updates
* add jenkins tests
* fix
* fix pytorch
* fix
* fixes
* fix a3c policy
* fix squeeze
* fix trunc on apex
* fix squeezing for real
* update
* remove horizon test for now
* multiagent wip
* update
* fix race condition
* fix ma
* t
* doc
* st
* wip
* example
* wip
* working
* cartpole
* wip
* batch wip
* fix bug
* make other_batches None default
* working
* debug
* nit
* warn
* comments
* fix ppo
* fix obs filter
* update
* wip
* tf
* update
* fix
* cleanup
* cleanup
* spacing
* model
* fix
* dqn
* fix ddpg
* doc
* keep names
* update
* fix
* com
* docs
* clarify model outputs
* Update torch_policy_graph.py
* fix obs filter
* pass thru worker index
* fix
* rename
* vlad torch comments
* fix log action
* debug name
* fix lstm
* remove unused ddpg net
* remove conv net
* revert lstm
* cast
* clean up
* fix lstm check
* move to end
* fix sphinx
* fix cmd
* remove bad doc
* clarify
* copy
* async sa
* fix
2018-06-26 13:17:15 -07:00
Eric Liang
882a649f0c
[rllib] [docs] Cleanup RLlib API and make docs consistent with upcoming blog post ( #1708 )
...
* wip
* more work
* fix apex
* docs
* apex doc
* pool comment
* clean up
* make wrap stack pluggable
* Mon Mar 12 21:45:50 PDT 2018
* clean up comment
* table
* Mon Mar 12 22:51:57 PDT 2018
* Mon Mar 12 22:53:05 PDT 2018
* Mon Mar 12 22:55:03 PDT 2018
* Mon Mar 12 22:56:18 PDT 2018
* Mon Mar 12 22:59:54 PDT 2018
* Update apex_optimizer.py
* Update index.rst
* Update README.rst
* Update README.rst
* comments
* Wed Mar 14 19:01:02 PDT 2018
2018-03-15 15:57:31 -07:00
alvkao58
81a4be8f65
[rllib] Added vanilla policy gradient ( #1497 )
2018-02-10 13:54:51 -08:00
Eric Liang
e216766bbc
[rllib] Update docs with api and components overview figures ( #1443 )
2018-01-19 10:08:45 -08:00
Eric Liang
6e6674a824
[rllib] Split docs into user and development guide ( #1377 )
...
* docs
* Update README.rst
* Sat Dec 30 15:23:49 PST 2017
* comments
* Sun Dec 31 23:33:30 PST 2017
* Sun Dec 31 23:33:38 PST 2017
* Sun Dec 31 23:37:46 PST 2017
* Sun Dec 31 23:39:28 PST 2017
* Sun Dec 31 23:43:05 PST 2017
* Sun Dec 31 23:51:55 PST 2017
* Sun Dec 31 23:52:51 PST 2017
2018-01-01 11:10:44 -08:00