Michael Luo
|
becca1424d
|
[RLLib] Execution-Folder Type Annotations (#12760)
|
2020-12-14 19:16:44 +01:00 |
|
Eric Liang
|
ecdaaffc67
|
add large data warning (#10957)
|
2020-09-23 15:46:06 -07:00 |
|
Sven Mika
|
805dad3bc4
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
|
Sven Mika
|
2256047876
|
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
2020-08-15 13:24:22 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Eric Liang
|
34bae27ac7
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Eric Liang
|
6bf1dc0888
|
[rllib] [hotfix] Build broken due to merge conflict: MixInReplay has no attribute buffer
|
2020-05-13 12:21:04 -07:00 |
|
Eric Liang
|
96f4d82cc3
|
[rllib] Qmix replay ratio is wrong
|
2020-05-12 13:07:19 -07:00 |
|
Eric Liang
|
2c599dbf05
|
[rllib] Port QMIX, MADDPG to new execution API (#8344)
|
2020-05-07 23:41:10 -07:00 |
|
Eric Liang
|
ee0eb44a32
|
Rename async_queue_depth -> num_async (#8207)
* rename
* lint
|
2020-05-05 01:38:10 -07:00 |
|
Eric Liang
|
2298f6fb40
|
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
2020-04-23 12:39:19 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|