Sven Mika
|
2746fc0476
|
[RLlib] Auto-framework, retire use_pytorch in favor of framework=... (#8520)
|
2020-05-27 16:19:13 +02:00 |
|
Eric Liang
|
9a83908c46
|
[rllib] Deprecate policy optimizers (#8345)
|
2020-05-21 10:16:18 -07:00 |
|
Sven Mika
|
754290daad
|
[RLlib] Add light-weight Trainer.compute_action() tests for all Algos. (#8356)
|
2020-05-08 16:31:31 +02:00 |
|
Eric Liang
|
f5d12a958b
|
[rllib] Port Ape-X to distributed execution API (#7497)
|
2020-03-12 00:54:08 -07:00 |
|
Eric Liang
|
0f88444686
|
[rllib] Support multi-agent training in pipeline impls, add easy flag to enable (#7338)
|
2020-03-02 15:16:37 -08:00 |
|
Eric Liang
|
46af992efd
|
[rllib] [experimental] custom RL training pipelines (PG_pl, A2C_pl) (#7213)
|
2020-02-19 16:07:37 -08:00 |
|