Commit graph

379 commits

Author SHA1 Message Date
Simon Mo
9b2a368c8c
[Runtime Env] Implement basic runtime env plugin mechanism (#19044) 2021-10-01 17:22:54 -07:00
Edward Oakes
8e5d48d668
[runtime_env] Remove deprecated override_environment_variables and worker_env fields (#18213) 2021-09-30 18:55:24 -05:00
Chu Xiangyang
505aa89d12
[Dashboard] Add start/end time for job (#18901) 2021-09-28 20:57:13 -07:00
Edward Oakes
73b8936aa8
[runtime_env] Unify rpc::RuntimeEnv with serialized_runtime_env field (#18641) 2021-09-28 15:13:15 -05:00
Eric Liang
11a2dfcaab
Improve unschedulable task warning messages by integrating with the autoscaler (#18724) 2021-09-24 12:19:58 -07:00
architkulkarni
fbf5f5d56b
[runtime env] [Serve] Fix error when uris field is None (#18874) 2021-09-24 14:07:17 -05:00
Chu Xiangyang
2220fe8a78
[Dashboard] Keep Job timestamp as millisecond (#18806)
* [Dashboard] Keep Job timestamp as millisecond

Current the `timestamp` is already millisecond, 13 digits long,
so no need to * 1000 in the dashboard UI.

* Fix format wih prettier

* use Number to convert timestamp
2021-09-24 10:31:54 -07:00
Qing Wang
6f1d3f94db
Publish actor state PENDING_CREATION for dashboard showing. (#18666) 2021-09-18 15:44:58 +08:00
Guyang Song
89ce8a3a02
support 'CustomFields' tooltip in dashboard (#18698) 2021-09-17 17:48:32 +08:00
Edward Oakes
7736cdd91d
[dashboard] Rename "new_dashboard" -> "dashboard" (#18214) 2021-09-15 11:17:15 -05:00
Tanmay Chordia
bf1176311f
[dashboard] add an endpoint to force kill an actor (#18508) 2021-09-13 20:03:15 -07:00
Edward Oakes
111a31d6a1
[runtime_env] Make Ray client server setup go through the runtime_env agent (#18478) 2021-09-13 14:16:35 -05:00
Edward Oakes
c482779da2
[runtime_env] Improve file-not-found msg in deletion (#18496) 2021-09-13 11:32:22 -05:00
Edward Oakes
2fcfea10b3
[runtime_env] Move URI deletion logic to the agent, remove util worker code (#18471) 2021-09-10 00:13:32 -07:00
Dominic Ming
97f71e15d4
[Dashboard] new dashboard event page for API Server event module (#18330) 2021-09-09 19:43:48 +08:00
Edward Oakes
f0555f88d6
[runtime_env] Move worker process startup logic to context (#18341) 2021-09-08 17:08:27 -05:00
Ian Rodney
c91e0eb065
[Dashboard] Increase Actor Snapshot Size (#18433) 2021-09-08 12:06:33 -07:00
Simon Mo
e61160d514
[Dashboard] Move gcs health check to a separate thread to avoid crashing due to excessive CPU usage. (#18236) 2021-09-03 14:23:56 -07:00
Edward Oakes
1f6705d35d
[runtime_env] Centralize runtime_env logic into ray._private.runtime_env submodule (#18310) 2021-09-03 10:19:00 -05:00
Edward Oakes
5d122cf7b7
[runtime_env] Move working dir setup to the agent (#18170) 2021-08-31 17:22:49 -05:00
Edward Oakes
17dded543c
Support passing gcs_client to internal_kv (#18235) 2021-08-31 12:46:41 -05:00
mwtian
26679d62c5
[Core][ObjectRef] Change default to not record call stack during ObjectRef creation (#18078) 2021-08-27 15:45:34 -07:00
Simon Mo
ca61e43e76
[dashboard] Fix GCS retry error message in dashboard_head (#18057) 2021-08-25 11:23:36 -05:00
Nikita Vemuri
a9c731edd3
[serve] Remove requirement to specify namespace for serve.start(detached=True) (#17470) 2021-08-25 10:39:32 -05:00
architkulkarni
97dd13be09
[Serve] [dashboard] Fix formatting bugs in cluster snapshot (#17977)
* show "unversioned" in actor metadata

* hash deployment names

* update test

* replace "Unversioned" with "None"

* bypass convert to camelCase for deployment names

* fix convert_case default to match previous setting

* lint

* replace deployment_name_hash with underscore
2021-08-24 12:06:26 -07:00
Edward Oakes
b969aa3c80
[dashboard] Don't start dashboard agent when missing dependencies (#17966) 2021-08-21 01:04:21 -07:00
architkulkarni
5ed3f0ce35
[Serve] [Dashboard] Add end times and DELETED state for endpoints (#17898) 2021-08-19 11:10:42 -05:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. (#17885)
* gcs_utils

* resource_spec

* profiling

* ray_perf and ray_cluster_perf

* test_utils
2021-08-18 20:56:33 -07:00
Simon Mo
b573864928
[CI] Add test owners (#17893) 2021-08-18 18:38:31 -07:00
architkulkarni
fcac416933
[Serve] [Dashboard] Add start times and replica tags to cluster snapshot (#17749) 2021-08-13 09:49:12 -07:00
Simon Mo
6879293b6b
[CI] Mark some tests exclusive (#17650) 2021-08-12 10:28:03 -07:00
architkulkarni
00f6b30684
[Serve] [Dashboard] Support nondetached and multiple Serve instances in cluster snapshot (#17747) 2021-08-11 22:26:54 -05:00
Jiao
e38db5875b
Add serve external kv store (#17622) 2021-08-11 12:06:14 -07:00
architkulkarni
0c2c99b951
[Dashboard] [Serve] Make serve import conditional (#17713) 2021-08-10 17:06:00 -07:00
architkulkarni
febe54f422
[serve] [dashboard] Change empty serve cluster snapshot from empty list to empty dict (#17655) 2021-08-10 13:35:00 -05:00
architkulkarni
6d975b821b
[Serve] [Dashboard] Initial PR for exporting Serve data to cluster snapshot (#17489) 2021-08-06 15:03:29 -07:00
architkulkarni
ac9a1a20df
[core] [runtime_env] Use per-env async lock in agent (#17542)
Co-authored-by: Ed Oakes <ed.nmi.oakes@gmail.com>
2021-08-06 11:11:37 -05:00
Amog Kamsetty
add6ceb3ec
[Dependencies] Fix missing dependency UX (#17420)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-05 20:18:42 -07:00
Ian Rodney
7b1c207be3
[Dashboard] Allow agent to bind to Wildcard address. (#17393) 2021-08-03 02:03:19 -07:00
Ian Rodney
b26ba7ba9e
[Dashboard] Allow Agent HTTP listening port to be specified. (#17392) 2021-08-02 02:09:50 -07:00
Edward Oakes
7007c6271d
[runtime_env] Gracefully fail tasks when an environment fails to be set up (#17249) 2021-07-28 15:25:02 -05:00
Simon Mo
4a4210a083
Support streaming output of runtime env setup to logger/driver (#17306) 2021-07-27 16:39:15 -07:00
fyrestone
57b9b1bb0f
[Dashboard] Use a dedicated RPC to check the GCS is alive (#16330)
* Dashboard check gcs is alive

* Fix dashboard hangs at exit

* ray health-check call GCS CheckAlive

* Minor fixes

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-27 14:05:44 +08:00
Richard Liaw
597dc08dfe
Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies"" (#17254)
* Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)"

This reverts commit 7b44dd8ecb.

* Lint

* Fix more imports

Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-07-26 21:09:25 -07:00
architkulkarni
bcb3a6789b
[Core] [runtime env] Cache created runtime envs (#17342) 2021-07-26 14:37:40 -05:00
Simon Mo
7b44dd8ecb
Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)
This reverts commit 64874e1877.
2021-07-21 16:57:47 -07:00
Kai Fricke
64874e1877
[core] remove opencensus/prometheus_exporter dependencies (#17182) 2021-07-21 12:57:31 +01:00
Kai Fricke
e881c6cff8
[core] remove aiohttp dependencies (#17181) 2021-07-21 07:18:19 +01:00
Amog Kamsetty
cb74053ee5
Retry remove gpustat dependency (#17115)
* remove gpustat

* move psutil imports
2021-07-19 11:14:10 -07:00
Amog Kamsetty
8dfd471823
Revert "Revert "[Dashboard][event] Basic event module (#16985)" (#17068)" (#17107)
This reverts commit c17e171f92.

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-18 12:59:04 +08:00