Ian Rodney
857408874c
[Autoscaler][Azure] check if 'update' is available in ( #14787 )
2021-03-19 08:39:46 -07:00
Amog Kamsetty
47300d5a53
[SGD] Worker Startup Fault Tolerance ( #14724 )
2021-03-18 22:53:56 -07:00
Eric Liang
c30d5f445c
Nonblocking release for ray client to deflake tests ( #14782 )
...
* fix
* update
* fix
2021-03-18 21:49:36 -07:00
Ian Rodney
00aceaae37
[Client] Test Serialization in a platform independent way. ( #14786 )
2021-03-18 18:24:44 -07:00
Amog Kamsetty
7ee2e4185b
[Tune] PTL Fractional GPUs ( #14781 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-18 17:07:51 -07:00
Lixin Wei
9227a83b59
Print ERROR log of actor creation task ( #14764 )
2021-03-18 16:56:55 -07:00
Richard Liaw
ebc71339fe
[client] fix multi-threading bugs ( #14701 )
2021-03-18 16:25:55 -07:00
Dmitri Gekhtman
da56a863f9
[Kubernetes][autoscaler] Deep copy in K8s Node Provider to fix scaling issues ( #14773 )
2021-03-18 18:17:57 -05:00
Ian Rodney
0495d6af15
[autoscaler] fix azure config issues ( #14750 )
2021-03-18 16:00:25 -07:00
Yi Cheng
881a46e1d6
[core] RuntimeEnv GC in local node ( #14594 )
2021-03-18 14:55:11 -07:00
Ian Rodney
eb12033612
[Code Cleanup] Switch to use ray.util.get_node_ip_address() ( #14741 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-18 13:10:57 -07:00
Richard Liaw
1d033fb552
[client] Fix serialization of RayTaskError ( #14698 )
2021-03-18 12:26:33 -07:00
Richard Liaw
8201e4ea11
[client] fix refcounting for named actors ( #14753 )
...
* max-workers
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* fix
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-18 12:20:29 -07:00
Eric Liang
ef249c98b1
[flaky test] Fix test_cli by disabling config cache for dashboard test ( #14755 )
2021-03-18 12:02:25 -07:00
SangBin Cho
351540e17e
[Test] Fix flaky object spilling test linux ( #14757 )
...
* Fix.
* done.
2021-03-18 11:37:09 -07:00
Edward Oakes
90f5ebac72
[serve] Add backend_state tests for updating backend config ( #14772 )
2021-03-18 12:58:39 -05:00
Edward Oakes
de598149d1
[serve] Add tests for backend_state versioning ( #14748 )
2021-03-18 11:08:45 -05:00
Ian Rodney
971855a353
[Serve] Disable Final Standalone Test on Windows ( #14761 )
2021-03-18 09:26:55 -05:00
Edward Oakes
91308b9b52
[serve] Refactor to add basic unit tests for BackendState ( #14740 )
2021-03-17 22:35:28 -05:00
Ian Rodney
50e95ad6dd
[Serve] Disable More test::standalone on windows ( #14751 )
2021-03-17 16:51:02 -07:00
Edward Oakes
34b5781ae0
[serve] Add basic support for a declarative deploy() API call ( #14720 )
2021-03-17 16:00:23 -05:00
Edward Oakes
f2013a0586
[serve] Skip test_standalone::test_connect on windows ( #14747 )
2021-03-17 13:50:34 -07:00
Lixin Wei
72d87093b9
[Core] Make Actor DEAD and Save Exceptions in GCS When Error Happens in Constructor ( #14211 )
2021-03-17 12:50:28 -07:00
Alex Wu
534846a1d2
[Autoscaler] Track failed nodes ( #14608 )
2021-03-17 12:49:31 -07:00
Siyuan (Ryans) Zhuang
6d346e74a6
cleanup python code ( #14691 )
...
* cleanup python code
2021-03-17 10:45:05 -07:00
Ian Rodney
10250d737f
[Autoscaler] Add tests around docker run options ( #14713 )
2021-03-17 10:13:51 -07:00
Edward Oakes
c781197755
[serve] Temporarily disable ray client test ( #14733 )
2021-03-17 08:48:05 -07:00
Edward Oakes
aab7ccc466
[serve] Deprecate client-based API in favor of process-wide singleton ( #14696 )
2021-03-17 09:39:54 -05:00
Siyuan (Ryans) Zhuang
f30ac73640
update cloudpickle to commit 6e0f571 ( #14693 )
2021-03-16 12:36:43 -07:00
Edward Oakes
5a45e3351f
add Serve service by default ( #14711 )
2021-03-16 10:34:30 -07:00
Eric Liang
b240f5f0c9
Incremental refactor of runtime_env for consistency ( #14632 )
2021-03-16 10:11:50 -07:00
Edward Oakes
ae2c20c1ac
[serve] Include required and available resources in slow startup message ( #14695 )
2021-03-15 21:32:07 -05:00
Ian Rodney
d251bb676d
[Autoscaler] Get_Head_Node should return an up-to-date
node ( #14579 )
2021-03-15 17:48:18 -07:00
Ian Rodney
4160f18e46
[Autoscaler] Remove Dangerous Default Values ( #14694 )
2021-03-15 17:47:40 -07:00
Xianyang Liu
c3d8ef1b49
[autoscaler] Auto detect memory resource + P0 bug fix in k8s ( #14567 )
...
* auto detect memory resources
* format
* fixes
* fixes
* fixes
* fixes
* fixes
* address comments and fix ci
* address comments
* address comments
* corrent comments
* fixes
2021-03-16 02:32:54 +02:00
Ezra-H
eda01f9f8c
[Collective] distributed cpu tests ( #14591 )
...
Co-authored-by: Hao Zhang <zhisbug@gmail.com>
2021-03-15 16:30:57 -07:00
Edward Oakes
f1a42c4744
[serve] [ray_client] Basic test for using Ray client with Serve ( #14163 )
2021-03-15 15:30:32 -07:00
Mark
14562a2b90
Fix the docstring for ray.cancel ( #14670 )
...
* Fix the docstring for ray.cancel
* Update python/ray/worker.py
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-03-15 15:16:17 -07:00
Edward Oakes
9f1421e3a3
[metrics] Standardize histogram to Prometheus API ( #14627 )
2021-03-15 14:37:56 -05:00
architkulkarni
8b17ec7c6d
[Serve] Disable auto conda env setting if using Ray Client ( #14672 )
2021-03-15 14:36:46 -05:00
Edward Oakes
d90cd545d1
[serve] Deprecate system-level batching with warning, update the docs ( #14648 )
2021-03-15 13:47:01 -05:00
Cassidy Laidlaw
ef5adc1476
Include py.typed in MANIFEST.in to make sure it is packaged in wheels ( #14492 )
2021-03-15 10:43:12 -07:00
Edward Oakes
c409b5b63a
[metrics] Standardize Gauge to Prometheus interface ( #14626 )
2021-03-15 12:15:26 -05:00
Ian Rodney
63d77e0d4c
[Testing] Deflake test_object_spilling ( #14678 )
2021-03-15 10:12:53 -07:00
architkulkarni
dc2be14462
[Serve] Fix bug where create_backend
fails in (base) conda env ( #14633 )
2021-03-15 10:06:41 -05:00
Tao Wang
c572563e1e
[large scale]Add enable sharding option and disable sharding for gcs client ( #14600 )
2021-03-15 19:35:00 +08:00
Richard Liaw
c2aeccaf14
[tune] revert all mnist tests ( #14677 )
...
This reverts commit 3f557348a2
.
2021-03-14 23:58:13 -07:00
Eric Liang
3bdcca7ee5
Add basic concurrency test for ray client ( #14630 )
2021-03-13 11:24:57 -08:00
Edward Oakes
66be4801c6
Add deprecation warning to Counter.record() ( #14622 )
2021-03-12 17:04:28 -06:00
Eric Liang
b47036d014
Bump Ray client protocol version; fix dataclasses dependency for py 3.6 ( #14654 )
2021-03-12 14:58:34 -08:00