SangBin Cho
30000ff8ae
Fix a bug from many drivers. ( #22248 )
...
After this PR (https://github.com/ray-project/ray/pull/22156 ), for some reasons the driver script has some string that cannot be encoded with ascii. It seems like using utf-8 solves the problem.
2022-02-09 15:17:15 -08:00
SangBin Cho
ac00389cbe
[Nightly test] Bring back the old way of running commands. ( #22209 )
...
Bring back the old way of running commands for non-k8s tests.
This also fixes the regression from many_drivers.py
2022-02-08 01:44:07 -08:00
Eric Liang
00b5801d71
Fix datasets leaking worker processes due to closure capture of stats actor handle ( #22156 )
2022-02-07 14:05:44 -08:00
Yi Cheng
0659d4a472
[nightly] Limit many drivers iteration to 4000 iterations ( #21958 )
...
Due to faster running of many drivers, we limit the iteration to 4k for the test.
2022-01-31 13:26:02 -08:00
Balaji Veeramani
7f1bacc7dc
[CI] Format Python code with Black ( #21975 )
...
See #21316 and #21311 for the motivation behind these changes.
2022-01-29 18:41:57 -08:00
mwtian
97f7e3d0e6
[e2e] do not terminate in serve_failure
smoke test ( #21925 )
...
When the script terminates, it will also terminate its cluster including dashboard, which will prevent subsequent job submissions. Other long running e2e tests do not terminate in smoke test mode, so make `serve_failure` behave the same.
2022-01-27 15:36:46 -08:00
Simon Mo
ca90c63483
[Serve] Add serve failure test to CI ( #20392 )
2021-11-16 08:12:08 -08:00
Edward Oakes
7c9881b73d
[serve] Fix serve_failure test ( #20268 )
2021-11-11 19:19:34 -08:00
Jiao
aaef82920d
[serve] Add periodic timeouts to long poll client to avoid accumulating concurrent tasks in the controller ( #19728 )
2021-10-26 09:44:00 -05:00
SangBin Cho
9000f41aa6
[Nightly Test] Support memory profiling on Ray + implement memory monitor for nightly tests ( #19539 )
...
* random fixes
* Done
* done
* update the doc
* doc lint fix
* .
* .
2021-10-21 07:37:05 -07:00
Kai Fricke
e07d0953ea
[ci/release] Undo faulty change to many_ppo num_samples ( #19388 )
2021-10-14 10:27:31 -07:00
Kai Fricke
9cee83c919
[tune] PBT: Add burn-in period ( #19321 )
2021-10-14 16:28:29 +01:00
Kai Fricke
d52203ee03
[ci/release] Fix long running serve test result fetching ( #18880 )
2021-09-24 16:16:01 +01:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. ( #17885 )
...
* gcs_utils
* resource_spec
* profiling
* ray_perf and ray_cluster_perf
* test_utils
2021-08-18 20:56:33 -07:00
architkulkarni
8c1317067d
move variable updates from middle of loop to end ( #17591 )
2021-08-05 09:53:01 +01:00
Jiao
6aeda62d40
[Serve] Add serve test config files and wrk dependency ( #16631 )
2021-06-28 10:01:55 -07:00
Kai Fricke
9352cb781c
[release tests] Fix microbenchmark base image, network overhead cluster wait time, add long running tests ( #16355 )
2021-06-16 21:37:17 +01:00
Kai Fricke
153a8b8fec
[release] convert tune release tests ( #15913 )
2021-06-01 11:19:15 -07:00
Kai Fricke
1d52ab819f
[release] release 1.3.0 results and test updates ( #15366 )
...
Convert a number of release tests and add logs for release 1.3.0
2021-05-04 22:10:04 +01:00
Amog Kamsetty
ebc44c3d76
[CI] Upgrade flake8 to 3.9.1 ( #15527 )
...
* formatting
* format util
* format release
* format rllib/agents
* format rllib/env
* format rllib/execution
* format rllib/evaluation
* format rllib/examples
* format rllib/policy
* format rllib utils and tests
* format streaming
* more formatting
* update requirements files
* fix rllib type checking
* updates
* update
* fix circular import
* Update python/ray/tests/test_runtime_env.py
* noqa
2021-05-03 14:23:28 -07:00
Edward Oakes
0f9d1bb223
Serve failure release test fix ( #15276 )
...
This test is currently not tested in CI
2021-04-13 17:49:29 +01:00
Edward Oakes
e4ca337e16
[serve] Change remaining tests to use deployment API ( #15167 )
2021-04-08 08:15:38 -05:00
SangBin Cho
0428537d0b
[Object Spilling] Long running object spilling test ( #13331 )
...
* done.
* formatting.
2021-01-12 16:53:13 -08:00
Simon Mo
01dcb993c7
[Serve] Rescale Serve's Long Running Test to Cluster Mode ( #13247 )
...
Now that `HeadOnly` becomes the new default HTTP location, we can
re-enable the long running tests to use local multi-clusters.
(also fixed the controller's API to match up to date, we should
have caught these, I will open issues for this.)
2021-01-07 08:57:24 -08:00
Simon Mo
3d8c1cbae6
[Serve] Fix Serve Release Tests ( #12777 )
2020-12-11 11:53:47 -08:00
Eric Liang
9f322db71d
Add many_ppo long running test ( #12364 )
...
* add new tes
* update
* update
2020-11-24 16:00:33 -08:00
Edward Oakes
2feba4409c
[serve] Fix long running failure test ( #11805 )
2020-11-09 11:21:03 -06:00
Barak Michener
4348ecf850
Clean up release tests ( #11420 )
2020-10-22 17:04:41 -07:00