Yi Cheng
9b3ea7c32b
[core] Take care of object spilling failure ( #14703 )
...
* fix spilling failure
* format
* unittests added
* format
* format
* format
* fix
* add comment
* fix some comments
* add test cases
* format
* format
2021-04-20 10:28:48 -07:00
Eric Liang
a482034916
Flaky test builder for tests tagged "flaky" ( #15408 )
2021-04-20 00:19:07 -07:00
Sven Mika
7ff27dfe07
[RLlib] Remove atari dependency for RLlib (in favor of detailed error message). ( #15292 )
2021-04-20 08:46:58 +02:00
architkulkarni
3bda2812fa
[Serve] Remove old ImportedBackend factory ( #15376 )
2021-04-19 16:25:59 -07:00
Edward Oakes
fbe510cd47
[serve] Clean up route prefixing behavior for deployments ( #15193 )
2021-04-19 12:50:46 -05:00
Jiaxin Shan
86468ce59f
[kubernetes] Remove unrelated fields in manifest file ( #15243 )
2021-04-19 10:54:33 -05:00
DK.Pino
b0a813baad
[Placement Group] Fix PlacementGroup ready when specify memory resource ( #15189 )
...
* fix placement group ready when memory specified
* lint
* add memory resource check in suppressed
* fix lint
* update comment
* fix lint
* delete unrelated code
* update comment
* lint
* fix ut
2021-04-17 22:21:05 -07:00
SangBin Cho
5f74d0e40d
[Test] Fix flaky test failure ( #15326 )
...
* Fix trial.
* unskip test.
* Mock commit
2021-04-16 18:09:02 -07:00
Dmitri Gekhtman
e6864523cf
[autoscaler] Do not divide by zero in resource demand scheduler ( #15323 )
...
* Do not divide by zero
* Don't take min or mean of an empty list
* max workers 0 for head node in distributed benchmark
* test
* Correct the type annotation
* comment grammar tweak
* message
* docs
* test
* Move test cli to large tests.
2021-04-16 10:20:05 -07:00
Risto Vuorio
dcda4a3d60
[tune] escaping paths before globbing in TrainableUtil.get_checkpoints_paths ( #15368 )
...
* Fixes 15367 by escaping paths before globbing in TrainableUtil.get_checkpoints_paths
* Adds a test testGetTrialCheckpointsPathsByPathWithSpecialCharacters for fix_15367
2021-04-16 09:41:02 -07:00
Richard Liaw
dc80d9f42a
[flaky] fix mnist ptl data cache ( #15344 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-04-15 16:24:17 -07:00
SangBin Cho
a54d69f535
[Test] Split long runtime env tests. ( #15340 )
...
* [Test] Split long runtime env tests.
* Addressed code review.
2021-04-15 14:28:28 -07:00
Siyuan (Ryans) Zhuang
4de1f35b3e
run_function_on_all_workers
only once in the driver (#15203 )
2021-04-15 13:58:36 -07:00
Richard Liaw
eaa3ce3f40
Fix release test -- client remote put ( #15325 )
...
* fix-test
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* fix
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* Update python/ray/util/client/server/dataservicer.py
* Update python/ray/util/client/server/dataservicer.py
* Update python/ray/_private/ray_client_microbenchmark.py
2021-04-15 13:30:38 -07:00
Kai Fricke
1c783e2eeb
[tune] Allow 0 CPU head bundles in for placement group factories ( #15338 )
2021-04-15 20:21:35 +01:00
SangBin Cho
4dd4756c09
[Test] skip flaky pg tests. ( #15337 )
2021-04-15 11:55:19 -07:00
Ameer Haj Ali
981fa5829a
[client] Enable ClientObjectRef Comparisons ( #15320 )
2021-04-15 16:46:44 +03:00
Yi Cheng
a9402c21e6
Revert "Revert "[runtime_env] Add support of exclusion ( #15241 )" ( #15303 )" with fixing ( #15310 )
...
* Revert "Revert "[runtime_env] Add support of exclusion (#15241 )" (#15303 )"
This reverts commit 775deca5ad
.
* fix
2021-04-14 20:34:53 -07:00
SangBin Cho
e0bbfaf87e
[Log] Fix log monitor issue. ( #15302 )
2021-04-14 18:11:24 -07:00
Charles
82e730078f
[autoscaler] Converting assert False into useful exceptions. ( #15306 )
2021-04-14 16:16:37 -07:00
Simon Mo
c4b1985a5b
[Serialization] Pydantic -> serialization_addons.py
and Ray Client support. ( #15181 )
2021-04-14 15:21:13 -07:00
SangBin Cho
775deca5ad
Revert "[runtime_env] Add support of exclusion ( #15241 )" ( #15303 )
...
This reverts commit 359b5ce06b
.
2021-04-14 11:58:53 -07:00
Richard Liaw
59bf3a7b22
ray[cluster] -> ray[default] ( #15251 )
2021-04-14 09:37:04 -07:00
Antoni Baum
b93bd9bef4
[tune] Set correct Optuna TrialState on trial complete ( #15283 )
2021-04-14 15:59:23 +01:00
Kai Fricke
aaa14d63a7
[tune] deflake test_convergence, add seed
parameter to OptunaSearch ( #15248 )
...
* De-flake optuna convergence test
* Even higher threshold
* Add `seed` parameter to OptunaSearch
2021-04-14 01:06:49 -07:00
Edward Oakes
4ed7a14e23
[serve] Support normal args and kwargs for deployments ( #15172 )
2021-04-13 16:20:50 -05:00
Richard Liaw
f4b2dd94b2
[tune] Cache MNIST and restore MNIST tests ( #15260 )
2021-04-13 14:20:26 -07:00
Yi Cheng
359b5ce06b
[runtime_env] Add support of exclusion ( #15241 )
2021-04-13 15:55:12 -05:00
Ian Rodney
d145ad94e4
[Client] Add metadata to Terminate Calls to make ray.kill() and ray.cancel() work ( #15221 )
2021-04-13 23:24:54 +03:00
Ian Rodney
ec3d5f2ef1
[client] Handle ray.put
failures ( #15229 )
2021-04-13 11:23:16 -07:00
Clark Zinzow
05d99c9432
[dask-on-ray] Don't leak a global enabling of client mode in Dask callback test. ( #15257 )
...
* Don't leak a global enabling of client mode in Dask callback test.
* Enable and disable client_mode_enabled, not _client_hook_enabled.
2021-04-12 22:00:30 -07:00
SangBin Cho
9197552802
Temporarily disable flaky tests. ( #15250 )
2021-04-12 14:09:07 -07:00
Ameer Haj Ali
580b2bb9cc
[metrics.Histogram] improve error message ( #15240 )
2021-04-12 21:28:21 +03:00
Tao Wang
4c9eee609c
Revert "Revert "[GCS]Increase heartbeat interval to reduce pressure o… ( #15207 )
...
* Revert "Revert "[GCS]Increase heartbeat interval to reduce pressure on gcs server (#14203 )" (#15194 )"
This reverts commit a9ac4ad890
.
* optimize wait condition to avoid flakey test
* remove unnecessary sleep
2021-04-12 10:45:42 -07:00
Richard Liaw
56c95075d1
Revert "[tune] enable mnist test v3 ( #15198 )" ( #15242 )
...
This reverts commit d913f32126
.
2021-04-12 09:27:55 -07:00
Clark Zinzow
1b62e9f844
[dask-on-ray][client] Support ClientObjectRefs in the Dask-on-Ray scheduler. ( #15237 )
...
* Support ClientObjectRefs in the Dask-on-Ray scheduler.
2021-04-11 10:44:38 +03:00
Qing Wang
0f444b1a59
Fix unexpected error when handling the process that has exited in memory monitor. ( #14932 )
...
Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-04-11 00:57:10 +08:00
Richard Liaw
0136ae10f8
[tune] run new test ( #15119 )
...
* add-runtest
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* ok
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-04-10 01:12:51 -07:00
chenk008
6709560ef6
fix setproctitle break /proc/PID/environ ( #15056 )
...
* fix setproctitle break /proc/PID/environ
* bugfix
* add ut
* fix ut
* fix ut
* fix ut
* improve comment
* improve comment
* fix ut lint
* fix ut lint
* revert init.py
Co-authored-by: wuhua.ck <wuhua.ck@alibaba-inc.com>
2021-04-09 15:45:19 -07:00
Sam O
be62444bc5
[Log monitor] Resolves the stacktrace ( #15199 )
2021-04-09 11:32:04 -07:00
Siyuan (Ryans) Zhuang
af9e38fd1c
Cloudpickle workaround for false positive cases ( #15202 )
...
* Cloudpickle workaround for false positive cases in '_is_parametrized_type_hint'.
* update comments
2021-04-09 02:22:46 -07:00
Richard Liaw
d913f32126
[tune] enable mnist test v3 ( #15198 )
2021-04-09 00:10:12 -07:00
Dmitri Gekhtman
58fbb419ea
[client][rllib] Add client_mode_hook for ray.get_gpu_ids ( #15185 )
2021-04-08 23:36:11 -07:00
Eric Liang
268409b6ad
updat warning ( #15200 )
2021-04-08 17:56:52 -07:00
Stephanie Wang
94e592004e
Prioritize worker requests for objects over queued task arguments ( #15157 )
2021-04-08 14:51:21 -07:00
Dmitri Gekhtman
4289fa8d43
[kubernetes][autoscaler][test] Kubernetes scale tests ( #15133 )
2021-04-08 11:42:53 -07:00
Eric Liang
982558a4d3
Update ray client protocol version ( #15184 )
2021-04-08 11:38:48 -07:00
SangBin Cho
a88d20729a
[Test] Skip TestConvergenceOptuna temporarily ( #15197 )
...
* Skip flaky tune test temporarily.
* Lint.
2021-04-08 11:36:10 -07:00
Edward Oakes
06f0c0b6a2
[serve] Remove test_api.py::test_shard_key ( #15195 )
2021-04-08 10:50:17 -07:00
SangBin Cho
a9ac4ad890
Revert "[GCS]Increase heartbeat interval to reduce pressure on gcs server ( #14203 )" ( #15194 )
...
This reverts commit ef195e5108
.
2021-04-08 09:29:13 -07:00