Eric Liang
|
fbc545c03b
|
[rllib] Support parallel, parameterized evaluation (#6981)
* eval api
* update
* sync eval filters
* sync fix
* docs
* update
* docs
* update
* link
* nit
* doc updates
* format
|
2020-02-01 22:12:12 -08:00 |
|
Eric Liang
|
6bb30c9f1b
|
fix links (#6883)
|
2020-01-22 01:06:07 -08:00 |
|
Eric Liang
|
14016535a5
|
[rllib] Add TF and Torch icons to show which are available for each algo (#6869)
|
2020-01-20 15:22:21 -08:00 |
|
Victor Le
|
4e24c805ee
|
AlphaZero and Ranked reward implementation (#6385)
|
2019-12-07 12:08:40 -08:00 |
|
Eric Liang
|
bc5e259264
|
[rllib] Add a doc section on computing actions (#6326)
* options doc
* add note
* hint shr
* doc update
|
2019-12-03 00:10:50 -08:00 |
|
Eric Liang
|
a0dcb45dc3
|
[rllib] Fix APEX priorities returning zero all the time (#5980)
* fix
* move example tests to end
* level err
* guard against none
* no trace test
* ignore thumbs
* np
* fix multi node
* fix
|
2019-10-26 13:23:42 -07:00 |
|
Eric Liang
|
bc6a95deb0
|
[rllib] Eager execution for centralized critic example, fix simple optimizer for multiagent (#5683)
|
2019-09-11 12:15:34 -07:00 |
|
Eric Liang
|
1455a19c85
|
Consolidate and clean up documentation (#5645)
|
2019-09-07 11:50:18 -07:00 |
|
Eric Liang
|
79949fb8a0
|
[rllib] RLlib in 60 seconds documentation (#5430)
|
2019-08-12 17:39:02 -07:00 |
|