Eric Liang
|
bc6a95deb0
|
[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent (#5683)
|
2019-09-11 12:15:34 -07:00 |
|
Eric Liang
|
daf38c8723
|
[tune] Deprecate tune.function (#5601)
* remove tune function
* remove examples
* Update tune-usage.rst
|
2019-08-31 16:00:10 -07:00 |
|
gehring
|
b520f6141e
|
[rllib] Adds eager support with a generic TFEagerPolicy class (#5436)
|
2019-08-23 14:21:11 +08:00 |
|
Eric Liang
|
a1d2e17623
|
[rllib] Autoregressive action distributions (#5304)
|
2019-08-10 14:05:12 -07:00 |
|
Eric Liang
|
592f313210
|
[rllib] Centralized critic / PPO example on TwoStepGame (#5392)
|
2019-08-08 14:03:28 -07:00 |
|