Eric Liang
5d7afe8092
[rllib] Try moving RLlib to top level dir ( #5324 )
2019-08-05 23:25:49 -07:00
Eric Liang
b5799b5286
[rllib] Set PPO observation filter to NoFilter by default ( #4191 )
2019-03-01 13:19:33 -08:00
Eric Liang
e4bea8d10e
[rllib] Default to truncate_episodes and add some more config validators ( #2967 )
...
* update
* link it
* warn about truncation
* fix
* Update rllib-training.rst
* deprecate tests failing
2018-09-30 18:37:55 -07:00
Eric Liang
995ac24a2c
[rllib] clarify train batch size for PPO ( #2793 )
...
It's possible to configure PPO in a way that ends up discarding most of the samples (they are treated as "stragglers"). Add a warning when this happens, and raise an exception if the waste is particularly egregious.
2018-09-05 12:06:13 -07:00
Eric Liang
8aa56c12e6
[rllib] Document "v2" APIs ( #2316 )
...
* re
* wip
* wip
* a3c working
* torch support
* pg works
* lint
* rm v2
* consumer id
* clean up pg
* clean up more
* fix python 2.7
* tf session management
* docs
* dqn wip
* fix compile
* dqn
* apex runs
* up
* impotrs
* ddpg
* quotes
* fix tests
* fix last r
* fix tests
* lint
* pass checkpoint restore
* kwar
* nits
* policy graph
* fix yapf
* com
* class
* pyt
* vectorization
* update
* test cpe
* unit test
* fix ddpg2
* changes
* wip
* args
* faster test
* common
* fix
* add alg option
* batch mode and policy serving
* multi serving test
* todo
* wip
* serving test
* doc async env
* num envs
* comments
* thread
* remove init hook
* update
* fix ppo
* comments1
* fix
* updates
* add jenkins tests
* fix
* fix pytorch
* fix
* fixes
* fix a3c policy
* fix squeeze
* fix trunc on apex
* fix squeezing for real
* update
* remove horizon test for now
* multiagent wip
* update
* fix race condition
* fix ma
* t
* doc
* st
* wip
* example
* wip
* working
* cartpole
* wip
* batch wip
* fix bug
* make other_batches None default
* working
* debug
* nit
* warn
* comments
* fix ppo
* fix obs filter
* update
* wip
* tf
* update
* fix
* cleanup
* cleanup
* spacing
* model
* fix
* dqn
* fix ddpg
* doc
* keep names
* update
* fix
* com
* docs
* clarify model outputs
* Update torch_policy_graph.py
* fix obs filter
* pass thru worker index
* fix
* rename
* vlad torch comments
* fix log action
* debug name
* fix lstm
* remove unused ddpg net
* remove conv net
* revert lstm
* wip
* wip
* cast
* wip
* works
* fix a3c
* works
* lstm util test
* doc
* clean up
* update
* fix lstm check
* move to end
* fix sphinx
* fix cmd
* remove bad doc
* envs
* vec
* doc prep
* models
* rl
* alg
* up
* clarify
* copy
* async sa
* fix
* comments
* fix a3c conf
* tune lstm
* fix reshape
* fix
* back to 16
* tuned a3c update
* update
* tuned
* optional
* merge
* wip
* fix up
* move pg class
* rename env
* wip
* update
* tip
* alg
* readme
* fix catalog
* readme
* doc
* context
* remove prep
* comma
* add env
* link to paper
* paper
* update
* rnn
* update
* wip
* clean up ev creation
* fix
* fix
* fix
* fix lint
* up
* no comma
* ma
* Update run_multi_node_tests.sh
* fix
* sphinx is stupid
* sphinx is stupid
* clarify torch graph
* no horizon
* fix config
* sb
* Update test_optimizers.py
2018-07-01 00:05:08 -07:00
Eric Liang
7ab890f4a1
[tune] [rllib] Automatically determine RLlib resources and add queueing mechanism for autoscaling ( #1848 )
2018-04-16 16:58:15 -07:00
Eric Liang
72595cca0d
[tune] Change tune resource request syntax to be less confusing ( #1764 )
...
* update
* update examples
* Wed Mar 21 15:19:56 PDT 2018
* Wed Mar 21 15:21:32 PDT 2018
* Update train_a3c.py
* Update train.py
* fix resources accounting
2018-03-23 06:25:01 -07:00
Eric Liang
b41bdcefa0
[rllib] Update RLlib to work with new actor scheduling behavior ( #1754 )
...
* Mon Mar 19 21:23:01 PDT 2018
* Mon Mar 19 21:23:07 PDT 2018
* Mon Mar 19 21:30:49 PDT 2018
* Mon Mar 19 21:32:05 PDT 2018
* Mon Mar 19 21:35:43 PDT 2018
* fix cpu limits
* Mon Mar 19 22:25:07 PDT 2018
2018-03-20 19:29:52 -07:00
Eric Liang
316f9e2bb7
[tune] Support user-defined trainable functions / classes / envs with a shared object registry ( #1226 )
2017-11-20 17:52:43 -08:00
Eric Liang
52888e4c6f
[tune] Improve the tune Python API and variant generation ( #1154 )
...
* new variant gen
* wip
* Sat Oct 21 18:21:34 PDT 2017
* update
* comment
* fix
* update
* update readme
* fix
* Update README.rst
* Update README.rst
* fix repeat
* update
* note on restore
2017-11-06 23:41:17 -08:00
Eric Liang
3b157ab933
[tune] Allow resources to not all be assigned to the driver ( #1150 )
...
* dgpu
* update
* update
* update
* also support cmdline
* limit
* Update README.rst
* documentation
* typo
* small coverage for driver_gpu_limit
* lint
* fix lint
2017-10-28 22:16:05 -07:00
Eric Liang
5a50e0e1d7
[rllib] Add the ability to run arbitrary Python scripts with ray.tune ( #1132 )
...
* fix yaml bug
* add ext agent
* gpus
* update
* tuning
* docs
* Sun Oct 15 21:09:25 PDT 2017
* lint
* update
* Sun Oct 15 22:39:55 PDT 2017
* Sun Oct 15 22:40:17 PDT 2017
* Sun Oct 15 22:43:06 PDT 2017
* Sun Oct 15 22:46:06 PDT 2017
* Sun Oct 15 22:46:21 PDT 2017
* Sun Oct 15 22:48:11 PDT 2017
* Sun Oct 15 22:48:44 PDT 2017
* Sun Oct 15 22:49:23 PDT 2017
* Sun Oct 15 22:50:21 PDT 2017
* Sun Oct 15 22:53:00 PDT 2017
* Sun Oct 15 22:53:34 PDT 2017
* Sun Oct 15 22:54:33 PDT 2017
* Sun Oct 15 22:54:50 PDT 2017
* Sun Oct 15 22:55:20 PDT 2017
* Sun Oct 15 22:56:56 PDT 2017
* Sun Oct 15 22:59:03 PDT 2017
* fix
* Update tune_mnist_ray.py
* remove script trial
* fix
* reorder
* fix ex
* py2 support
* upd
* comments
* comments
* cleanup readme
* fix trial
* annotate
* Update rllib.rst
2017-10-18 11:49:28 -07:00
Eric Liang
79ea205b3e
[rllib] Initial work on integrating hyperparameter search tool ( #1107 )
...
* clean up train
* update
* update train script
* add tuned examples
* add agent catalog
* add tune lib
* update
* fix
* testS
* remove
* train docs
* comments
* todo
* fix resource parsing
* fix cr test
* add test
* try to fix travis test
2017-10-13 16:18:16 -07:00