Eric Liang
4f46d3e9bf
[rllib] Add multi-agent examples for hand-coded policy, centralized VF ( #4554 )
2019-04-09 00:36:49 -07:00
ctombumila37
7746d20d30
[rllib] ExternalMultiAgentEnv ( #4200 )
2019-04-06 19:58:14 -07:00
Eric Liang
0d94f3eeef
[rllib] Improve datapath throughput of IMPALA / APPO ( #4324 )
2019-03-31 12:25:52 -07:00
bjg2
77005d1814
[rllib] Make batch timeout for remote workers tunable ( #4435 )
2019-03-29 13:19:42 -07:00
Eric Liang
2ffe67c5c3
[rllib] Minor cleanups to TFPolicyGraph: add init args, constants for loss inputs ( #4478 )
2019-03-29 12:44:23 -07:00
Eric Liang
8ee240f40e
[rllib] Use 64-byte aligned memory when concatenating arrays ( #4408 )
2019-03-25 23:56:51 -07:00
Eric Liang
57c1aeb427
[rllib] Use suppress_output instead of run_silent.sh script for tests ( #4386 )
...
* fix
* enable custom loss
* Update run_rllib_tests.sh
* enable tests
* fix action prob
* Update suppress_output
* fix example
* fix
2019-03-21 00:15:24 -07:00
Eric Liang
a45019d98c
[rllib] Add option to proceed even if some workers crashed ( #4376 )
2019-03-16 13:34:09 -07:00
Stefan Pantic
2202a81773
Fix multi discrete ( #4338 )
...
* Revert "Revert "[wingman -> rllib] IMPALA MultiDiscrete changes (#3967 )" (#4332 )"
This reverts commit 3c41cb9b60
.
* Fix a bug with log rhos for vtrace
* Reformat
* lint
2019-03-12 20:32:11 -07:00
Eric Liang
3c41cb9b60
Revert "[wingman -> rllib] IMPALA MultiDiscrete changes ( #3967 )" ( #4332 )
...
This reverts commit 962b17f567
.
2019-03-11 22:51:26 -07:00
Eric Liang
c7f74dbdc7
[rllib] Add async remote workers ( #4253 )
2019-03-08 15:39:48 -08:00
Yuhong Guo
d5fb7b70a9
Update arrow version to fix plasma bugs ( #4127 )
...
* Update arrow
* Change to 2c511979b13b230e73a179dab1d55b03cd81ec02 which is rebased on Arrow 46f75d7
* Update to fix comment
* disable tests which use python/ray/rllib/tests/data/cartpole_small
* Fix get order of meta and data in MockObjectStore.java
2019-03-08 18:03:58 +08:00
Eric Liang
b0332551dd
[rllib] Fix APPO + continuous spaces, feed prev_rew/act to A3C properly ( #4286 )
2019-03-06 21:36:26 -08:00
Eric Liang
30bf8e46c7
[rllib] Use nested scope in custom loss example
2019-03-04 18:29:22 -08:00
Richard Liaw
a27cb225b6
Modularize Tune tests from multi-node tests ( #4204 )
2019-03-02 19:21:08 -08:00
Robert Nishihara
4b89eebfc7
Move test folders under rllib/tune from test -> tests. ( #4214 )
2019-03-02 13:37:16 -08:00
bjg2
962b17f567
[wingman -> rllib] IMPALA MultiDiscrete changes ( #3967 )
2019-03-01 19:47:06 -08:00
Eric Liang
b809ef0107
[rllib] Silent tests ( #4151 )
2019-02-28 16:32:22 -08:00