Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Eric Liang
|
6bf1dc0888
|
[rllib] [hotfix] Build broken due to merge conflict: MixInReplay has no attribute buffer
|
2020-05-13 12:21:04 -07:00 |
|
Eric Liang
|
96f4d82cc3
|
[rllib] Qmix replay ratio is wrong
|
2020-05-12 13:07:19 -07:00 |
|
Eric Liang
|
2c599dbf05
|
[rllib] Port QMIX, MADDPG to new execution API (#8344)
|
2020-05-07 23:41:10 -07:00 |
|
Eric Liang
|
ee0eb44a32
|
Rename async_queue_depth -> num_async (#8207)
* rename
* lint
|
2020-05-05 01:38:10 -07:00 |
|
Eric Liang
|
2298f6fb40
|
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
2020-04-23 12:39:19 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|