Eric Liang
|
ecdaaffc67
|
add large data warning (#10957)
|
2020-09-23 15:46:06 -07:00 |
|
Eric Liang
|
daa03ba6e6
|
[rllib] Add execution module to package ref (#10941)
* add init
* add
* update
|
2020-09-21 23:03:06 -07:00 |
|
Sven Mika
|
805dad3bc4
|
[RLlib] SAC algo cleanup. (#10825)
|
2020-09-20 11:27:02 +02:00 |
|
Sven Mika
|
ef18893fb5
|
[RLlib] PPO, APPO, and DD-PPO code cleanup. (#10420)
|
2020-09-02 14:03:01 +02:00 |
|
Sven Mika
|
2256047876
|
[RLlib] Rename rllib.utils.types into typing to match built-in python module's name. (#10114)
|
2020-08-15 13:24:22 +02:00 |
|
Barak Michener
|
8e76796fd0
|
ci: Redo format.sh --all script & backfill lint fixes (#9956)
|
2020-08-07 16:49:49 -07:00 |
|
Sven Mika
|
b0b0463161
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
|
Sven Mika
|
fcdf410ae1
|
[RLlib] Tf2.x native. (#8752)
|
2020-07-11 22:06:35 +02:00 |
|
Sven Mika
|
43043ee4d5
|
[RLlib] Tf2x preparation; part 2 (upgrading try_import_tf() ). (#9136)
* WIP.
* Fixes.
* LINT.
* WIP.
* WIP.
* Fixes.
* Fixes.
* Fixes.
* Fixes.
* WIP.
* Fixes.
* Test
* Fix.
* Fixes and LINT.
* Fixes and LINT.
* LINT.
|
2020-06-30 10:13:20 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Sven Mika
|
7008902cff
|
[RLlib] Minor rllib.utils cleanup. (#8932)
|
2020-06-16 08:52:20 +02:00 |
|
Eric Liang
|
34bae27ac7
|
[rllib] Flexible multi-agent replay modes and replay_sequence_length (#8893)
|
2020-06-12 20:17:27 -07:00 |
|
mehrdadn
|
f93bb008bb
|
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839)
Co-authored-by: Mehrdad <noreply@github.com>
|
2020-06-08 21:29:46 -07:00 |
|
Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Eric Liang
|
aa7a58e92f
|
[rllib] Support training intensity for dqn / apex (#8396)
|
2020-05-20 11:22:30 -07:00 |
|
Sven Mika
|
c9435cad43
|
WIP. (#8456)
Fix multi-GPU histogram metrics for > 0D tensors.
|
2020-05-15 21:43:27 +02:00 |
|
Eric Liang
|
6bf1dc0888
|
[rllib] [hotfix] Build broken due to merge conflict: MixInReplay has no attribute buffer
|
2020-05-13 12:21:04 -07:00 |
|
Eric Liang
|
96f4d82cc3
|
[rllib] Qmix replay ratio is wrong
|
2020-05-12 13:07:19 -07:00 |
|
Eric Liang
|
9d012626e5
|
[rllib] Distributed exec workflow for impala (#8321)
|
2020-05-11 20:24:43 -07:00 |
|
Eric Liang
|
2c599dbf05
|
[rllib] Port QMIX, MADDPG to new execution API (#8344)
|
2020-05-07 23:41:10 -07:00 |
|
Eric Liang
|
9f04a65922
|
[rllib] Add PPO+DQN two trainer multiagent workflow example (#8334)
|
2020-05-07 23:40:29 -07:00 |
|
Eric Liang
|
b14cc16616
|
[rllib] Enable functional execution workflow API by default (#8221)
|
2020-05-05 12:36:42 -07:00 |
|
Eric Liang
|
ee0eb44a32
|
Rename async_queue_depth -> num_async (#8207)
* rename
* lint
|
2020-05-05 01:38:10 -07:00 |
|
Eric Liang
|
baadbdf8d4
|
[rllib] Execute PPO using training workflow (#8206)
* wip
* add kl
* kl
* works now
* doc update
* reorg
* add ddppo
* add stats
* fix fetch
* comment
* fix learner stat regression
* test fixes
* fix test
|
2020-04-30 01:18:09 -07:00 |
|
Eric Liang
|
2298f6fb40
|
[rllib] Port DQN/Ape-X to training workflow api (#8077)
|
2020-04-23 12:39:19 -07:00 |
|
Eric Liang
|
d92c5f1a9e
|
[rllib] Add init file for exec module
|
2020-04-17 17:24:28 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|