Sven Mika
|
b0b0463161
|
[RLlib] Trajectory View API (preparatory cleanup and enhancements). (#9678)
|
2020-07-29 21:15:09 +02:00 |
|
Eric Liang
|
1e0e1a45e6
|
[rllib] Add type annotations for evaluation/, env/ packages (#9003)
|
2020-06-19 13:09:05 -07:00 |
|
Eric Liang
|
9f04a65922
|
[rllib] Add PPO+DQN two trainer multiagent workflow example (#8334)
|
2020-05-07 23:40:29 -07:00 |
|
Eric Liang
|
baadbdf8d4
|
[rllib] Execute PPO using training workflow (#8206)
* wip
* add kl
* kl
* works now
* doc update
* reorg
* add ddppo
* add stats
* fix fetch
* comment
* fix learner stat regression
* test fixes
* fix test
|
2020-04-30 01:18:09 -07:00 |
|
Eric Liang
|
31b40b00f6
|
[rllib] Pull out experimental dsl into rllib.execution module, add initial unit tests (#7958)
|
2020-04-10 00:56:08 -07:00 |
|