Stefan Otte
|
d70abcfd70
|
Fix typo in examples/centralized_critic.py (#5943)
`opp_ops` should be `opp_obs`.
|
2019-10-17 08:42:50 -07:00 |
|
Matthew A. Wright
|
4aa06918ae
|
Qmix on gpu and with non-stacked-obs environment state support (#5751)
|
2019-10-08 13:18:07 -07:00 |
|
Eric Liang
|
bc6a95deb0
|
[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent (#5683)
|
2019-09-11 12:15:34 -07:00 |
|
Eric Liang
|
daf38c8723
|
[tune] Deprecate tune.function (#5601)
* remove tune function
* remove examples
* Update tune-usage.rst
|
2019-08-31 16:00:10 -07:00 |
|
gehring
|
b520f6141e
|
[rllib] Adds eager support with a generic TFEagerPolicy class (#5436)
|
2019-08-23 14:21:11 +08:00 |
|
Eric Liang
|
a1d2e17623
|
[rllib] Autoregressive action distributions (#5304)
|
2019-08-10 14:05:12 -07:00 |
|
Eric Liang
|
592f313210
|
[rllib] Centralized critic / PPO example on TwoStepGame (#5392)
|
2019-08-08 14:03:28 -07:00 |
|