Sven Mika
|
c9435cad43
|
WIP. (#8456)
Fix multi-GPU histogram metrics for > 0D tensors.
|
2020-05-15 21:43:27 +02:00 |
|
Sven Mika
|
66df8b8c35
|
[RLlib] Working/learning example: PPO + torch + LSTM. (#7797)
|
2020-03-31 22:00:28 -07:00 |
|
Eric Liang
|
9a590ac6a5
|
[rllib] Fix custom model metrics in multi-device case (#7640)
* fix example
* add example test
* lin
|
2020-03-23 12:40:22 -07:00 |
|
Sven Mika
|
d537e9f0d8
|
[RLlib] Exploration API: merge deterministic flag with exploration classes (SoftQ and StochasticSampling). (#7155)
|
2020-02-19 12:18:45 -08:00 |
|
Eric Liang
|
2fb53396ad
|
[rllib] [experimental] Decentralized Distributed PPO for torch (DD-PPO) (#6918)
|
2020-01-25 22:36:43 -08:00 |
|