Commit graph

348 commits

Author SHA1 Message Date
architkulkarni
00f6b30684
[Serve] [Dashboard] Support nondetached and multiple Serve instances in cluster snapshot (#17747) 2021-08-11 22:26:54 -05:00
Jiao
e38db5875b
Add serve external kv store (#17622) 2021-08-11 12:06:14 -07:00
architkulkarni
0c2c99b951
[Dashboard] [Serve] Make serve import conditional (#17713) 2021-08-10 17:06:00 -07:00
architkulkarni
febe54f422
[serve] [dashboard] Change empty serve cluster snapshot from empty list to empty dict (#17655) 2021-08-10 13:35:00 -05:00
architkulkarni
6d975b821b
[Serve] [Dashboard] Initial PR for exporting Serve data to cluster snapshot (#17489) 2021-08-06 15:03:29 -07:00
architkulkarni
ac9a1a20df
[core] [runtime_env] Use per-env async lock in agent (#17542)
Co-authored-by: Ed Oakes <ed.nmi.oakes@gmail.com>
2021-08-06 11:11:37 -05:00
Amog Kamsetty
add6ceb3ec
[Dependencies] Fix missing dependency UX (#17420)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-05 20:18:42 -07:00
Ian Rodney
7b1c207be3
[Dashboard] Allow agent to bind to Wildcard address. (#17393) 2021-08-03 02:03:19 -07:00
Ian Rodney
b26ba7ba9e
[Dashboard] Allow Agent HTTP listening port to be specified. (#17392) 2021-08-02 02:09:50 -07:00
Edward Oakes
7007c6271d
[runtime_env] Gracefully fail tasks when an environment fails to be set up (#17249) 2021-07-28 15:25:02 -05:00
Simon Mo
4a4210a083
Support streaming output of runtime env setup to logger/driver (#17306) 2021-07-27 16:39:15 -07:00
fyrestone
57b9b1bb0f
[Dashboard] Use a dedicated RPC to check the GCS is alive (#16330)
* Dashboard check gcs is alive

* Fix dashboard hangs at exit

* ray health-check call GCS CheckAlive

* Minor fixes

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-27 14:05:44 +08:00
Richard Liaw
597dc08dfe
Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies"" (#17254)
* Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)"

This reverts commit 7b44dd8ecb.

* Lint

* Fix more imports

Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-07-26 21:09:25 -07:00
architkulkarni
bcb3a6789b
[Core] [runtime env] Cache created runtime envs (#17342) 2021-07-26 14:37:40 -05:00
Simon Mo
7b44dd8ecb
Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)
This reverts commit 64874e1877.
2021-07-21 16:57:47 -07:00
Kai Fricke
64874e1877
[core] remove opencensus/prometheus_exporter dependencies (#17182) 2021-07-21 12:57:31 +01:00
Kai Fricke
e881c6cff8
[core] remove aiohttp dependencies (#17181) 2021-07-21 07:18:19 +01:00
Amog Kamsetty
cb74053ee5
Retry remove gpustat dependency (#17115)
* remove gpustat

* move psutil imports
2021-07-19 11:14:10 -07:00
Amog Kamsetty
8dfd471823
Revert "Revert "[Dashboard][event] Basic event module (#16985)" (#17068)" (#17107)
This reverts commit c17e171f92.

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-18 12:59:04 +08:00
fyrestone
e2808a35cf
Dashboard job module uses attrs instead of pydantic for job description (#17116) 2021-07-16 22:26:00 +08:00
Amog Kamsetty
caa78a3cff
Revert "[Core] Remove gpustat from core dependencies (#17059)" (#17106)
This reverts commit 7ec18f671a.
2021-07-14 20:19:33 -07:00
Amog Kamsetty
c17e171f92
Revert "[Dashboard][event] Basic event module (#16985)" (#17068)
This reverts commit f1faa79a04.
2021-07-13 23:18:43 -07:00
Amog Kamsetty
7ec18f671a
[Core] Remove gpustat from core dependencies (#17059) 2021-07-13 21:22:02 -07:00
fyrestone
f1faa79a04
[Dashboard][event] Basic event module (#16985)
* Basic event module

* Fix comments

* Set the SCAN_EVENT_DIR_INTERVAL_SECONDS defaults to 2

* Fix lint

* Fix lint

* Clean code

* Try to fix flaky

* Fix test

* Disable event module by default

* Make monitor events task cancellable

* Fix error

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-13 19:08:39 -07:00
Amog Kamsetty
a14342ce6f
Revert "[Dashboard][event] Basic event module (#16698)" (#17004)
This reverts commit 66ea099897.
2021-07-12 11:22:46 -07:00
fyrestone
66ea099897
[Dashboard][event] Basic event module (#16698)
* Basic event module

* Fix comments

* Set the SCAN_EVENT_DIR_INTERVAL_SECONDS defaults to 2

* Fix lint

* Fix lint

* Clean code

* Try to fix flaky

* Fix test

* Disable event module by default

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-09 10:25:30 -07:00
Amog Kamsetty
39d60f62d2
[hotfix] fix material-ui version once more (#16901) 2021-07-06 13:57:34 -07:00
Simon Mo
b11b35aa45
hotfix material-ui version again (#16897) 2021-07-06 11:08:57 -07:00
Amog Kamsetty
d5ac5c45ea
[Dashboard] Pin material-ui/lab dependency (#16890) 2021-07-06 10:49:10 -07:00
architkulkarni
06dfd8dddb
Revert "[Dashboard][event] Basic event module (#16283)" (#16676)
This reverts commit 5afa53aa64.
2021-06-25 09:38:18 -07:00
SongGuyang
e74d9d3ded
[runtime env] Download runtime env(conda) in agent instead of setup_worker (#16525) 2021-06-25 19:39:05 +08:00
fyrestone
5afa53aa64
[Dashboard][event] Basic event module (#16283) 2021-06-25 13:59:02 +08:00
SongGuyang
874e947d6f
[runtime env] support create or delete runtime envs in agent (#15904) 2021-06-09 20:22:25 +08:00
fyrestone
4ca316a0f4
Move test_snapshot from test_dashboard.py to modules/snapshot/tests/test_snapshot.py (#16306)
Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-06-08 10:26:03 -07:00
fyrestone
dfadf33a94
[Dashboard] Reorganize dashboard modules - node (#16217) 2021-06-07 19:50:46 -07:00
Alex Wu
e1da31f149
[dashboard] Include ray session name in dashboard snapshot (#16199)
* .

* .

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-02 15:07:06 -07:00
fyrestone
c53893cb13
[Dashboard] Reorganize dashboard modules - actor (#16170) 2021-06-02 06:58:30 -07:00
Simon Mo
677514b3ff
Revert "[Dashboard] Actor Table UI Optimize (#15802)" (#15981)
This reverts commit 43be599a9a.
2021-05-21 10:56:15 -07:00
Alex Wu
f080911d9b
[dashboard] include worker id in actor snapshot (#15967)
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-05-21 09:26:37 -07:00
Dominic Ming
43be599a9a
[Dashboard] Actor Table UI Optimize (#15802) 2021-05-21 09:23:32 -07:00
Alex Wu
cd2fc7792f
[dashboard] Snapshot of cluster state (#15868) 2021-05-20 08:10:32 -07:00
Ian Rodney
7b1c5dbe0a
[Hotfix][Lint] Pin other ESlint Deps (#15816) 2021-05-14 09:18:43 -07:00
fyrestone
56c309416e
[Job submission] Basic job submission structure (#15103) 2021-05-12 15:08:20 +08:00
Ashwin Hegde
4d8ed6dd5c
#13890 [new-dashboard] add object store memory column (#15697) 2021-05-11 15:36:16 -05:00
Ian Rodney
90ce25cb35
[dashboard] Avoid global min_workers (#15660) 2021-05-10 15:47:51 -07:00
Ian Rodney
c50490ccef
[Lint] Pin Prettier to 2.3.0 (#15721) 2021-05-10 11:46:29 -07:00
Ian Rodney
11b5c6c702
[HotFix][Lint] Fix Lint because of Prettier update (#15720) 2021-05-10 09:51:41 -07:00
SongGuyang
b8ff86adb9
Add objectStore stats to dashboard API. (#15677) 2021-05-10 11:32:14 -05:00
Amog Kamsetty
ebc44c3d76
[CI] Upgrade flake8 to 3.9.1 (#15527)
* formatting

* format util

* format release

* format rllib/agents

* format rllib/env

* format rllib/execution

* format rllib/evaluation

* format rllib/examples

* format rllib/policy

* format rllib utils and tests

* format streaming

* more formatting

* update requirements files

* fix rllib type checking

* updates

* update

* fix circular import

* Update python/ray/tests/test_runtime_env.py

* noqa
2021-05-03 14:23:28 -07:00
Ian Rodney
546e5f6f13
[API] Remove non-API top Level function imports (#15440) 2021-04-27 12:33:59 -07:00