Eric Liang
|
5d7afe8092
|
[rllib] Try moving RLlib to top level dir (#5324)
|
2019-08-05 23:25:49 -07:00 |
|
Eric Liang
|
20450a4e82
|
[rllib] Add rock paper scissors multi-agent example (#5336)
|
2019-08-01 13:03:59 -07:00 |
|
Samir Al-Stouhi
|
51b8915c0a
|
Added CARLA Community Example (#5333)
|
2019-07-31 18:10:50 -07:00 |
|
Eric Liang
|
a62c5f40f6
|
[rllib] Document ModelV2 and clean up the models/ directory (#5277)
|
2019-07-27 02:08:16 -07:00 |
|
Eric Liang
|
34d054ff19
|
[rllib] ModelV2 API (#4926)
|
2019-07-03 15:59:47 -07:00 |
|
Eric Liang
|
9e328fbe6f
|
[rllib] Add docs on how to use TF eager execution (#4927)
|
2019-06-07 16:42:37 -07:00 |
|
Eric Liang
|
7501ee51db
|
[rllib] Rename PolicyEvaluator => RolloutWorker (#4820)
|
2019-06-03 06:49:24 +08:00 |
|
Eric Liang
|
4f46d3e9bf
|
[rllib] Add multi-agent examples for hand-coded policy, centralized VF (#4554)
|
2019-04-09 00:36:49 -07:00 |
|
Eric Liang
|
4b8b703561
|
[rllib] Some API cleanups and documentation improvements (#4409)
|
2019-03-21 21:34:22 -07:00 |
|
Eric Liang
|
6e3384a719
|
[rllib] Add three new long-running stress tests {APEX, IMPALA, PBT} (#4215)
|
2019-03-04 14:05:42 -08:00 |
|
Robert Nishihara
|
4b89eebfc7
|
Move test folders under rllib/tune from test -> tests. (#4214)
|
2019-03-02 13:37:16 -08:00 |
|
Eric Liang
|
d9da183c7d
|
[rllib] Custom supervised loss API (#4083)
|
2019-02-24 15:36:13 -08:00 |
|
Eric Liang
|
fb73cedf70
|
[rllib] Add examples page, add hierarchical training example, delete SC2 examples (#3815)
* wip
* lint
* wip
* up
* wip
* update examples
* wip
* remove carla
* update
* improve envspec
* link to custom
* Update rllib-env.rst
* update
* fix
* fn
* lint
* ds
* ssd games
* desc
* fix up docs
* fix
|
2019-01-29 21:06:09 -08:00 |
|