Eric Liang
|
f5d12a958b
|
[rllib] Port Ape-X to distributed execution API (#7497)
|
2020-03-12 00:54:08 -07:00 |
|
Eric Liang
|
a644060daa
|
[rllib] First pass at pipeline implementation of DQN (#7433)
* wip iters
* add test
* speed up
* update docs
* document it
* support serial sampling
* add test
* spacing
* annotate it
* update
* rename to pipeline
* comment
* iter2 wip
* update
* update
* context test
* update
* fix
* fix
* a3c pipeline
* doc
* update
* move timer
* comment
* add piepline test
* fix
* clean up
* document
* iter s
* wip dqn
* wip
* wip
* metrics
* metrics rename
* metrics ctx
* wip
* constants
* add todo
* suppport .union
* wip
* support union
* remove prints
* add todo
* remove auto timer
* fix up
* fix pipeline test
* typing
* fix breakage
* remove bad assert
* wip
* fix multiagent example
* fixapply
* update a3c
* remove a2c pl
* 0 workers
* wip
* wip
* share metrics
* wip
* wip
* doc
* fix weight sync and global var updates
* mode
* fix
* fix
* doc
* fix
|
2020-03-07 14:47:58 -08:00 |
|
Eric Liang
|
0f88444686
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
|
Eric Liang
|
3c6b94f3f5
|
[rllib] Enable performance metrics reporting for RLlib pipelines, add A3C (#7299)
|
2020-02-28 16:44:17 -08:00 |
|
Eric Liang
|
46af992efd
|
[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213)
|
2020-02-19 16:07:37 -08:00 |
|