Commit graph

210 commits

Author SHA1 Message Date
xwjiang2010
09e760a1fd
[Release] Change all cpus_per_actor in xgboost test. (#18717) 2021-09-17 12:57:21 -07:00
xwjiang2010
2c92f737f9
Fix dask_xgboost_test (#18713) 2021-09-17 11:25:54 -07:00
Jiao
ca3be60291
[Releaes] change headnode type for serve benchmark (#18672)
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-09-16 10:57:36 -07:00
Sven Mika
ba1c489b79
[RLlib Testing] Lower --smoke-test "time_total_s" to make sure it doesn't time out. (#18670) 2021-09-16 18:22:23 +02:00
gjoliver
df32ed35fd
Extend --smoke-test deadlines for learning and stress regression tests. (#18667) 2021-09-16 09:18:39 +01:00
Antoni Baum
7e95f330d5
[ci] Fix xgboost_ray install from git (#18640) 2021-09-15 18:07:15 +01:00
Antoni Baum
eeb67a42cc
pip install xgboost_ray -> xgboost_ray[default] (#18607)
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-09-15 14:45:56 +01:00
Kai Fricke
15a83d104d
[ci/release] remove legacy release tests (#18592) 2021-09-15 14:42:58 +01:00
SangBin Cho
b8c361d3fb
[Test] Mark app config failure as a infra failure (#18614) 2021-09-14 17:20:05 -07:00
Kai Fricke
c8188ea70e
[ci/rllib] wait for stress test cluster (#18603) 2021-09-14 19:01:22 +01:00
Kai Fricke
6777e24293
[ci] Add release test owner overview file (#18590) 2021-09-14 11:00:31 -07:00
Sven Mika
08c09737fa
[RLlib] Fix R2D2 (torch) multi-GPU issue. (#18550) 2021-09-14 19:58:10 +02:00
SangBin Cho
51d94ebee0
[Tests] Make nightly test work + Remove work stealing logs (#18300)
* make tests work

* .
2021-09-14 09:52:58 -07:00
Antoni Baum
65d5deae60
[tests] Increase golden notebook test timeout to 20 mins (#18554) 2021-09-14 16:27:56 +01:00
Jiao
d3734d803d
[serve] Change nightly test docker image and enable micro benchmark (#18566) 2021-09-14 09:41:21 -05:00
Kai Fricke
e4754f1e19
[ci] wheel URLs - give some time for wheels to be built (#18505) 2021-09-14 09:56:34 +01:00
Guyang Song
beff857cc1
[release][C++ API] support sanity check C++ (#18545) 2021-09-14 11:39:08 +08:00
gjoliver
2924afa41e
[Release] Create soft links for libcusolver.so.10 as a temporary fix. (#18562)
Co-authored-by: Jun Gong <jungong@anyscale.com>
2021-09-13 14:37:12 -07:00
Jiajun Yao
ec6f5ae9ab
Upgrade serve_tests and runtime_env_tests base image to 1.6.0 (#18563) 2021-09-13 12:47:06 -07:00
Kai Fricke
b543c0e923
[ci] Do not use anyscale connect for xgboost_tests/train_small (#18569) 2021-09-13 20:38:00 +01:00
Kai Fricke
b6392aa6ea
[ci] upgrade microbenchmark base image to 1.6.0 (#18542) 2021-09-13 17:13:01 +01:00
Kai Fricke
7d1e6d3129
[ci/release] Add sanity check for ray wheels hash to release tests (#18489) 2021-09-10 17:50:31 +01:00
Kai Fricke
be438fb600
[release] Also download Ray CPP wheels (#18383) 2021-09-10 17:49:37 +01:00
SangBin Cho
7b2ed4c1f8
[Placement group] Placement group scheduling hangs due to creation/removal race condition (#18419) 2021-09-09 20:39:01 -07:00
matthewdeng
e66f154b14
[release] increase torch_tune_serve timeout to 20 min (#18481) 2021-09-09 16:31:14 -07:00
Simon Mo
6d24214085
[Release] Make sure to uninstall ray for rllib_tests (#18448) 2021-09-08 23:29:40 +01:00
gjoliver
50cdf551ce
[RLlib] Fix test name typo. (#18423)
Co-authored-by: Jun Gong <jungong@mbpro.local>
2021-09-08 23:30:37 +02:00
Yi Cheng
6011d4197f
Open [nightly] Add many_nodes_actor_test to nightly test (#18406) 2021-09-08 11:15:48 -07:00
Sven Mika
cabaa3b3c6
[RLlib Testing] Add A3C/APPO/BC/DDPPO/MARWIL/CQL/ES/ARS/TD3 to weekly learning tests. (#18381) 2021-09-07 11:48:41 +02:00
Sven Mika
5292b70fc6
[RLlib] Add multi-GPU attention net tests to nightly test suite (+ R2D2 tests for LSTM and attention nets). (#18368) 2021-09-06 17:48:05 +02:00
Kai Fricke
d9552e6795
Update release process doc and checklist (#18336)
Co-authored-by: Qing Wang <kingchin1218@126.com>
2021-09-06 14:09:31 +01:00
Sven Mika
59f796edf3
[RLlib] Fix crash when using StochasticSampling exploration (most PG-style algos) w/ tf and numpy > 1.19.5 (#18366) 2021-09-06 12:14:00 +02:00
Yi Cheng
23e9af0601
[test] Add x nodes y actors test to nightly tests (#18291) 2021-09-03 18:54:23 -07:00
Kai Fricke
6aa8a4eddc
[release] prettier output of release test results and artifacts (#18337) 2021-09-03 14:00:55 +01:00
Sven Mika
9a8ca6a69d
[RLlib] Fix Atari learning test regressions (2 bugs) and 1 minor attention net bug. (#18306) 2021-09-03 13:29:57 +02:00
Kai Fricke
fb38d06cfb
Move RLLib GPU release test dependencies to ml docker (#18208) 2021-09-03 09:35:18 +01:00
SangBin Cho
814095add6
Revert "Change instance type for some tests (#18248)" (#18320)
This reverts commit 34026a7bd5.
2021-09-02 17:45:02 -07:00
Sven Mika
a7670d9fab
[RLlib; Testing] Fix smoke-test settings for nightly learning_tests and stress_test; Add pybullet_envs to app-config. (#18274) 2021-09-01 21:46:06 +02:00
SangBin Cho
34026a7bd5
Change instance type for some tests (#18248) 2021-08-31 10:10:46 -07:00
Antoni Baum
2c0dcec18f
[test] Fix golden notebook tests always failing (#17873) 2021-08-31 17:07:47 +02:00
SangBin Cho
eab506cc37
[Test] Disable non streaming shuffle 5000 partitions (#18224)
* Disable non streaming shuffle 5000 partitions

* increase timeout for 5000 partition shuffle
2021-08-31 00:28:15 -07:00
SangBin Cho
dfbad8668a
Support better infra failure detection + stable flag (#18202) 2021-08-30 10:51:03 -07:00
Sven Mika
8acb469b04
[RLlib; Testing] Green all RLlib nightly tests. (#18073) 2021-08-26 14:09:20 +02:00
Kai Fricke
089dd9b949
[release] Add release logs for 1.6.0 (#18067) 2021-08-26 12:13:15 +02:00
Jiao
b52c873027
[serve] Use list_deployments in benchmark (#18050) 2021-08-25 12:26:46 -05:00
Kai Fricke
4c3276644e
[release] After buildkite ask step, use RAY_TEST_REPO pipeline (#18074) 2021-08-25 15:58:38 +02:00
SangBin Cho
43da68e657
Fix a nightly dask on ray test (#18060) 2021-08-24 22:15:34 -07:00
Sven Mika
9883505e84
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00
Sven Mika
d47ed62083
[Testing RLlib] Increase timeout for nightly multi-GPU learning tests. (#17987) 2021-08-23 10:37:50 +02:00
Kai Fricke
fca8af88d2
[release] Fix e2e environment variable passing from pipeline (#18000) 2021-08-23 09:26:37 +02:00