Eric Liang
|
288933ec6b
|
[rllib] Fix shared metrics context in parallel iterators (#7666)
* debug
* build
* update
* wip
* wpi
* update
* recurisve sync
* comment
* stream
* fix
* Update .travis.yml
|
2020-03-22 14:15:01 -07:00 |
|
Eric Liang
|
dd70720578
|
[rllib] Rename sample_batch_size => rollout_fragment_length (#7503)
* bulk rename
* deprecation warn
* update doc
* update fig
* line length
* rename
* make pytest comptaible
* fix test
* fi sys
* rename
* wip
* fix more
* lint
* update svg
* comments
* lint
* fix use of batch steps
|
2020-03-14 12:05:04 -07:00 |
|
Eric Liang
|
f5d12a958b
|
[rllib] Port Ape-X to distributed execution API (#7497)
|
2020-03-12 00:54:08 -07:00 |
|
Eric Liang
|
a644060daa
|
[rllib] First pass at pipeline implementation of DQN (#7433)
* wip iters
* add test
* speed up
* update docs
* document it
* support serial sampling
* add test
* spacing
* annotate it
* update
* rename to pipeline
* comment
* iter2 wip
* update
* update
* context test
* update
* fix
* fix
* a3c pipeline
* doc
* update
* move timer
* comment
* add piepline test
* fix
* clean up
* document
* iter s
* wip dqn
* wip
* wip
* metrics
* metrics rename
* metrics ctx
* wip
* constants
* add todo
* suppport .union
* wip
* support union
* remove prints
* add todo
* remove auto timer
* fix up
* fix pipeline test
* typing
* fix breakage
* remove bad assert
* wip
* fix multiagent example
* fixapply
* update a3c
* remove a2c pl
* 0 workers
* wip
* wip
* share metrics
* wip
* wip
* doc
* fix weight sync and global var updates
* mode
* fix
* fix
* doc
* fix
|
2020-03-07 14:47:58 -08:00 |
|
Eric Liang
|
0f88444686
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
|
Eric Liang
|
3c6b94f3f5
|
[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299)
|
2020-02-28 16:44:17 -08:00 |
|
Eric Liang
|
46af992efd
|
[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213)
|
2020-02-19 16:07:37 -08:00 |
|