Eric Liang
|
02583a8598
|
[rllib] Rename PolicyGraph => Policy, move from evaluation/ to policy/ (#4819)
This implements some of the renames proposed in #4813
We leave behind backwards-compatibility aliases for *PolicyGraph and SampleBatch.
|
2019-05-20 16:46:05 -07:00 |
|
Eric Liang
|
37208216ae
|
[rllib] Rename Agent to Trainer (#4556)
|
2019-04-07 00:36:18 -07:00 |
|
Eric Liang
|
4b8b703561
|
[rllib] Some API cleanups and documentation improvements (#4409)
|
2019-03-21 21:34:22 -07:00 |
|
Eric Liang
|
d9da183c7d
|
[rllib] Custom supervised loss API (#4083)
|
2019-02-24 15:36:13 -08:00 |
|
Eric Liang
|
2dccf383dd
|
[rllib] Basic infrastructure for off-policy estimation (IS, WIS) (#3941)
|
2019-02-13 16:25:05 -08:00 |
|
Eric Liang
|
fb73cedf70
|
[rllib] Add examples page, add hierarchical training example, delete SC2 examples (#3815)
* wip
* lint
* wip
* up
* wip
* update examples
* wip
* remove carla
* update
* improve envspec
* link to custom
* Update rllib-env.rst
* update
* fix
* fn
* lint
* ds
* ssd games
* desc
* fix up docs
* fix
|
2019-01-29 21:06:09 -08:00 |
|
Eric Liang
|
03fe760616
|
[rllib] Model self loss isn't included in all algorithms (#3679)
|
2019-01-04 22:30:35 -08:00 |
|
Eric Liang
|
ca864faece
|
[rllib] Documentation for I/O API and multi-agent support / cleanup (#3650)
|
2019-01-03 15:15:36 +08:00 |
|