Eric Liang
7daf28f348
Revert "[Test] Fix flaky test_gpu test ( #19524 )" ( #19562 )
...
This reverts commit 39e54cd276
.
2021-10-20 12:21:19 -07:00
Clark Zinzow
88c5fcde8c
[Datasets] Unrevert Arrow table copy method change. ( #19534 )
2021-10-20 11:57:36 -07:00
Jiao
c51f79bca6
[runtime_env] Support remote s3 package in runtime env ( #19315 )
2021-10-20 10:41:54 -05:00
mwtian
aaff6901dd
[Pubsub] refactor pubsub to support different channel types ( #19498 )
...
* refactor pubsub to support different channel types
* fix
* use std::string for key id
* fix mock
* fix
2021-10-20 07:02:55 -07:00
Kai Fricke
71564040ec
[ci/release] Unwrap after installing pip packages ( #19552 )
2021-10-20 13:41:16 +01:00
Jiajun Yao
39e54cd276
[Test] Fix flaky test_gpu test ( #19524 )
2021-10-19 22:36:34 -07:00
Yi Cheng
01b899dafb
[nightly] Fix broken test due to bad syntax #19536 ( #19536 )
2021-10-19 21:43:46 -07:00
Simon Mo
59eef6521b
[Serve] Use regular dict for handle caching ( #19162 )
2021-10-19 21:27:01 -07:00
Yi Cheng
7a9cedfc5c
[nightly] Add grpc based broadcasting into nightly test for decision_tree ( #19531 )
...
* dbg
* up
* check
* up
* up
* put grpc based one into nightly test
* up
2021-10-19 19:59:39 -07:00
Jiajun Yao
4fc5b11c68
Simple block dataset groupBy ( #19435 )
2021-10-19 19:53:13 -07:00
Eric Liang
eacfbf8be2
[data] Don't shuffle during repartition by default ( #19379 )
2021-10-19 19:46:22 -07:00
SangBin Cho
3222d39fb8
[Dashboard] Dashboard memory improvement ( #19385 )
...
* many ppo profiling
* completed
* improve memory usage lint
* revert temporarily
* Addressed code review
* Fix a test
2021-10-19 19:34:42 -07:00
Simon Mo
30c8c073a2
[Doc] Generate sitemap ( #19375 )
2021-10-19 14:14:17 -07:00
Simon Mo
48cf366dca
[Hotfix] Pin node version to 14 ( #19522 )
2021-10-19 14:13:06 -07:00
Ian Rodney
02090afc26
[Docker] Re-Tag Docker Images with a lambda ( #19081 )
...
* lil lambda
* Better Credential Handling
* use a script for this :)
* better timeout and link & echo messages
2021-10-19 14:06:31 -07:00
matthewdeng
19eabd7a55
[train] remove default num_workers ( #19518 )
...
* [train] remove default num_workers
* fix tests
2021-10-19 13:53:23 -07:00
Akash Patel
7e10f6a876
add missing <limits> header for prometheus_cpp ( #19108 )
2021-10-19 13:33:31 -07:00
gjoliver
2bd7932830
Add a script to analyze python module dependencies using static analysis ( #18965 )
2021-10-19 13:33:02 -07:00
mwtian
098ff36faa
[CI] Remove config that disables Bazel test result cache ( #18701 )
2021-10-19 13:31:42 -07:00
Edward Oakes
a596d59863
[serve] Modify serve debugger example to use current APIs ( #19513 )
2021-10-19 13:21:56 -07:00
matthewdeng
56e46c3c23
[train] add callbacks package compatibility ( #19519 )
2021-10-19 12:56:49 -07:00
Kai Fricke
3e8587644b
[ci/release] wrap all release test pip github installs in quotation marks ( #19521 )
2021-10-19 20:55:02 +01:00
Edward Oakes
4645893a5f
Add prototype of ray.serve.pipeline ( #19278 )
2021-10-19 11:36:49 -07:00
xwjiang2010
a6f9c93db0
Revert "[Datasets] Add support for slicing Arrow blocks that contain tensor columns. ( #19494 )" ( #19517 )
...
This reverts commit ad03917b8f
.
2021-10-19 11:35:04 -07:00
Duarte OC
5af6152e76
[Serve] [Doc] Update docs with import missing ( #19469 )
2021-10-19 11:23:50 -07:00
Tao He
1dde588702
[Dataset] Support dataset from a single dataframe/table. ( #18205 )
2021-10-19 10:27:43 -07:00
Alex Wu
a819e417ac
Revert "[Hotfix] Revert "[Workflow] workflow.delete"" ( #19248 )
...
* Revert "Revert "[Workflow] workflow.delete (#19178 )" (#19247 )"
This reverts commit b59317520d
.
* fix
* .
* .
* .
* Revert "."
This reverts commit 423b9b8e7e83f07cb0942b04e568e37ea0c62ba8.
* .
* .
* done?
* 4real
Co-authored-by: Alex <alex@anyscale.com>
2021-10-19 09:47:56 -07:00
Gagandeep Singh
cc00ab74da
[Windows] Fix test_fair_queuing and test_wait_timing ( #19456 )
...
* modified timeout in test_fair_qeueing
* bump bounds to pass the tests
2021-10-19 09:27:04 -07:00
mwtian
3260330e45
Disable clang-tidy until ergonomic issues are resolved ( #19499 )
...
Why are these changes needed?
Currently clang-tidy does not run inside scripts/format.sh. Also clang-tidy can produce false positive warnings. Maybe we can disable clang-tidy until ergonomic issues are resolved.
2021-10-19 08:45:25 -07:00
architkulkarni
b8941338d3
[runtime env] Raise error when creating runtime env when ray[default] is not installed ( #19491 )
2021-10-19 09:16:04 -05:00
Jiajun Yao
805ce453dd
[Java] Remove auto-generated pom.xml files. ( #19475 )
2021-10-19 17:35:37 +08:00
matthewdeng
4674c78050
[Train] Rename Ray SGD v2 to Ray Train ( #19436 )
2021-10-18 22:27:46 -07:00
Guyang Song
46b4c7464d
runtime env eager install by default ( #19449 )
2021-10-19 11:31:14 +08:00
Clark Zinzow
ad03917b8f
[Datasets] Add support for slicing Arrow blocks that contain tensor columns. ( #19494 )
2021-10-18 20:07:06 -07:00
Simon Mo
6f2eb1f9fa
[Serve] Use ray core metrics for autoscaling ( #19038 )
2021-10-18 19:32:49 -07:00
Chen Shen
b38ebd368c
[Dataset][nighlyt-test] spend less money #19488
...
Reduce the epoch and ensure everything runs in the same datacenter.
2021-10-18 18:53:50 -07:00
Gagandeep Singh
0b82135d2d
Use 127.0.0.1 in win32 as node ip addr ( #19362 )
2021-10-18 15:51:15 -07:00
gjoliver
e9f66cc394
Reduce success criteria for a few learning tests. ( #19484 )
2021-10-18 15:44:38 -07:00
Ian Rodney
74db390d15
[Docker] Fix Rsync ( #19020 )
...
* rsync down
* Rsync up, but not delete
* test fixes
* Explicit rsync -e
* Better copy check
* quick comment
* Additional fix to rsync_up
2021-10-18 14:35:22 -07:00
Kai Fricke
6798bdbb5d
Revert "Revert "[RLlib](deps): Bump tensorflow from 2.5.0 to 2.6.0 in /python/requirements/rllib"" ( #19352 )
...
This reverts commit bde9e058da
.
2021-10-18 22:29:16 +01:00
Simon Mo
a081579f68
[Dashboard] Fix gRPC GCS healthcheck thread ( #19360 )
2021-10-18 13:18:06 -07:00
Eric Liang
1bb2b1fc49
[hotfix] Pin pyspark dep to 3.1.2
2021-10-18 13:10:06 -07:00
Jiajun Yao
4d9585773f
[Release] Remove release process doc ( #19312 )
2021-10-18 11:24:03 -07:00
Yi Cheng
f47f69d31e
[nightly] Add decision_tree_autoscaling_20_runs to nightly test
2021-10-18 11:19:40 -07:00
Kai Fricke
ad94eb03c6
[ci/release] wrap pip github installs in quotation marks to prevent comment errors ( #19464 )
2021-10-18 18:55:56 +01:00
mwtian
9742abb749
[Debugging] Print Python stack trace in addition to C++ stack trace, when Python worker crashes ( #19423 )
...
Why are these changes needed?
Right now the failure signal handler registered in Python worker is skipped on crashes like segfault, because C++ core worker overrides the failure signal handler here and does not call the previously registered handler. This prevents Python stack trace from being printed on crashes. The fix is to make the C++ fault signal handler to call the previous signal handler registered in Python. For example with the script below which segfaults,
import ray
ray.init()
@ray.remote
def f():
import ctypes;
ctypes.string_at(0)
ray.get(f.remote())
Ray currently only prints the following stack trace:
(pid=26693) *** SIGSEGV received at time=1634418743 ***
(pid=26693) PC: @ 0x7fff203d9552 (unknown) _platform_strlen
(pid=26693) [2021-10-16 14:12:23,331 E 26693 12194577] logging.cc:313: *** SIGSEGV received at time=1634418743 ***
(pid=26693) [2021-10-16 14:12:23,331 E 26693 12194577] logging.cc:313: PC: @ 0x7fff203d9552 (unknown) _platform_strlen
With this change, Python stack trace will be printed in addition to the stack trace above:
(pid=26693) Fatal Python error: Segmentation fault
(pid=26693)
(pid=26693) Stack (most recent call first):
(pid=26693) File "/Users/mwtian/opt/anaconda3/envs/ray/lib/python3.7/ctypes/__init__.py", line 505 in string_at
(pid=26693) File "stack.py", line 7 in f
(pid=26693) File "/Users/mwtian/work/ray-project/ray/python/ray/worker.py", line 425 in main_loop
(pid=26693) File "/Users/mwtian/work/ray-project/ray/python/ray/workers/default_worker.py", line 212 in <module>
This should make debugging crashes in Python worker easier, for users and Ray devs.
Also, try to initialize symbolizer in GCS, Raylet and core worker. This is a no-op on MacOS and some Linux environments (e.g. Ray on Ubuntu 20.04 already produces symbolized stack traces), but should make Ray more likely to have symbolized stack traces on other platforms.
2021-10-18 09:05:08 -07:00
Kai Fricke
eee05505b1
[ci/release] Add separate timeout parameter for prepare commands ( #19459 )
2021-10-18 16:29:25 +01:00
Kai Fricke
57fe405120
[ci/release] Bump long running release test timeouts to 6 minutes ( #19458 )
2021-10-18 16:27:53 +01:00
Chen Shen
9dba5e0ead
[dataset][nightly-test] fix pipeline ingest test ( #19437 )
2021-10-18 11:31:24 +01:00
Kai Fricke
6c6639a0d7
[ci/release] hotfix for undefined local variable ( #19460 )
2021-10-18 11:28:33 +01:00