mwtian
70db5c5592
[GCS][Bootstrap n/n] Do not start Redis in GCS bootstrapping mode ( #21232 )
...
After this change in GCS bootstrapping mode, Redis no longer starts and `address` is treated as the GCS address of the Ray cluster.
Co-authored-by: Yi Cheng <chengyidna@gmail.com>
Co-authored-by: Yi Cheng <74173148+iycheng@users.noreply.github.com>
2022-01-04 23:06:44 -08:00
Yi Cheng
09421a4ca6
[2/gcs] Bootstrap dashboard for gcs ha ( #21179 )
...
This is part of gcs ha project. This PR try to bootstrap dashboard with gcs address instead of redis.
Co-authored-by: mwtian <81660174+mwtian@users.noreply.github.com>
2021-12-21 16:58:03 -08:00
iasoon
1c93beb490
[serve] use true nulls in snapshot ( #21062 )
2021-12-20 16:07:09 -08:00
iasoon
33059cff3d
[serve] support not exposing deployments over http ( #21042 )
2021-12-13 09:43:55 -08:00
Guyang Song
53630ee03b
Revert "Revert "[runtime env] redefine runtime env to protobuf"" and fix windows compiling ( #20692 )
...
- Fix windows compiling and revert https://github.com/ray-project/ray/pull/20641
- Seems the pr https://github.com/ray-project/ray/pull/20670 can solve the windows compiling issue.
2021-11-24 09:01:01 -08:00
Alex Wu
9388d28233
Revert "[runtime env] redefine runtime env to protobuf" ( #20641 )
...
Reverts #19511
Breaks windows compilation
2021-11-22 13:11:30 -08:00
Guyang Song
ad56b9b432
[runtime env] redefine runtime env to protobuf ( #19511 )
2021-11-20 16:54:42 +08:00
Edward Oakes
d26c9e67e8
[job submission] Add a message
to the JobStatus to return more detailed errors ( #20491 )
2021-11-18 10:15:23 -06:00
Yi Cheng
a4e187c0e7
[gcs] Update function table to use internal kv ( #20152 )
...
## Why are these changes needed?
This is a part of redis removal. This PR remove redis kv in function table.
rpush related code is not updated in this PR.
## Related issue number
2021-11-15 23:34:41 -08:00
Yi Cheng
e54d3117a4
[gcs] Update all redis kv usage in python except function table ( #20014 )
...
## Why are these changes needed?
This is part of redis removal project. In this PR all direct usage of redis got removed except function table.
Function table will be migrated in the next PR
## Related issue number
#19443
2021-11-10 20:24:53 -08:00
Edward Oakes
81f036d078
[job submission] Move job_manager to dashboard module, common parts to common.py ( #20209 )
2021-11-10 14:14:55 -08:00
Edward Oakes
b2ddea255d
[job submission] Add job submission ID + status to /api/snapshot ( #19994 )
2021-11-03 09:49:28 -05:00
Guyang Song
ab55b808c5
[runtime env] move worker env to runtime env in Java ( #19060 )
2021-10-11 17:25:09 +08:00
Edward Oakes
73b8936aa8
[runtime_env] Unify rpc::RuntimeEnv with serialized_runtime_env field ( #18641 )
2021-09-28 15:13:15 -05:00
Edward Oakes
7736cdd91d
[dashboard] Rename "new_dashboard" -> "dashboard" ( #18214 )
2021-09-15 11:17:15 -05:00
Tanmay Chordia
bf1176311f
[dashboard] add an endpoint to force kill an actor ( #18508 )
2021-09-13 20:03:15 -07:00
Edward Oakes
17dded543c
Support passing gcs_client to internal_kv ( #18235 )
2021-08-31 12:46:41 -05:00
Nikita Vemuri
a9c731edd3
[serve] Remove requirement to specify namespace for serve.start(detached=True) ( #17470 )
2021-08-25 10:39:32 -05:00
architkulkarni
97dd13be09
[Serve] [dashboard] Fix formatting bugs in cluster snapshot ( #17977 )
...
* show "unversioned" in actor metadata
* hash deployment names
* update test
* replace "Unversioned" with "None"
* bypass convert to camelCase for deployment names
* fix convert_case default to match previous setting
* lint
* replace deployment_name_hash with underscore
2021-08-24 12:06:26 -07:00
architkulkarni
5ed3f0ce35
[Serve] [Dashboard] Add end times and DELETED state for endpoints ( #17898 )
2021-08-19 11:10:42 -05:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. ( #17885 )
...
* gcs_utils
* resource_spec
* profiling
* ray_perf and ray_cluster_perf
* test_utils
2021-08-18 20:56:33 -07:00
architkulkarni
fcac416933
[Serve] [Dashboard] Add start times and replica tags to cluster snapshot ( #17749 )
2021-08-13 09:49:12 -07:00
architkulkarni
00f6b30684
[Serve] [Dashboard] Support nondetached and multiple Serve instances in cluster snapshot ( #17747 )
2021-08-11 22:26:54 -05:00
Jiao
e38db5875b
Add serve external kv store ( #17622 )
2021-08-11 12:06:14 -07:00
architkulkarni
0c2c99b951
[Dashboard] [Serve] Make serve import conditional ( #17713 )
2021-08-10 17:06:00 -07:00
architkulkarni
febe54f422
[serve] [dashboard] Change empty serve cluster snapshot from empty list to empty dict ( #17655 )
2021-08-10 13:35:00 -05:00
architkulkarni
6d975b821b
[Serve] [Dashboard] Initial PR for exporting Serve data to cluster snapshot ( #17489 )
2021-08-06 15:03:29 -07:00
fyrestone
57b9b1bb0f
[Dashboard] Use a dedicated RPC to check the GCS is alive ( #16330 )
...
* Dashboard check gcs is alive
* Fix dashboard hangs at exit
* ray health-check call GCS CheckAlive
* Minor fixes
Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-27 14:05:44 +08:00
fyrestone
4ca316a0f4
Move test_snapshot from test_dashboard.py to modules/snapshot/tests/test_snapshot.py ( #16306 )
...
Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-06-08 10:26:03 -07:00
Alex Wu
e1da31f149
[dashboard] Include ray session name in dashboard snapshot ( #16199 )
...
* .
* .
* .
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-02 15:07:06 -07:00
Alex Wu
f080911d9b
[dashboard] include worker id in actor snapshot ( #15967 )
...
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-05-21 09:26:37 -07:00
Alex Wu
cd2fc7792f
[dashboard] Snapshot of cluster state ( #15868 )
2021-05-20 08:10:32 -07:00