Commit graph

54 commits

Author SHA1 Message Date
Guyang Song
d1d5fe61c2
[Dashboard][Frontend] Worker table enhancement (#25934) 2022-06-21 14:09:48 +08:00
Guyang Song
e13cc4088a
[Dashboard] Don't sort node list by defult (#25884) 2022-06-20 11:35:12 +08:00
mwtian
f79b826f31
[Dashboard] avoid showing disk info when it is unavailable (#24992) 2022-05-24 17:13:47 -07:00
SangBin Cho
b9c30529d8
[Core/Observability 1/N] Add a "running" state to task status (#24651)
This PR adds 2 more states into TaskStatus

enum TaskStatus {
  // The task is scheduled properly and waiting for execution.
  // It includes time to deliver the task to the remote worker + queueing time
  // from the execution side.
  WAITING_FOR_EXECUTION = 5;
  // The task that is running.
  RUNNING = 6;
}
2022-05-16 05:39:05 -07:00
Jiajun Yao
628f886af4
Don't show usage stats prompt in dashboard if prompt is disabled (#24700) 2022-05-12 07:55:28 -07:00
Jiajun Yao
1daad65568
[Doc] Add doc for usage stats collection (#24522) 2022-05-10 17:18:49 -07:00
Jiajun Yao
3fb63847e2
Show usage stats prompt (#23822)
Show usage stats prompt when it's enabled.

Current UX are:

* The usage stats enabled or disabled message is shown every time in both terminal and dashboard.
* If users don't explicitly enable or disable usage stats, the first time they start a ray cluster interactively, they will be asked to confirm and will enable if no user action within 10s. If it's non-interactive, collection is enabled by default without confirmation.
* ray.init() doesn't collect usage stats
* Usage stats can be disabled via three approaches: 1. RAY_USAGE_STATS_ENABLED env var, 2. ray xxx --disable-usage-stats, 3. ray disable-usage-stats
2022-04-25 16:01:24 -07:00
Amog Kamsetty
1d11963618
[Dashboard] Specify @types/react resolution (#23794)
A new @types/react release has broken the dashboard build. Make sure to specify the older version under package resolutions.
2022-04-07 17:24:19 -07:00
mwtian
51feac9868
Clean up dev docs (#23407) 2022-03-22 23:22:56 -07:00
Yi Cheng
7d2237bc9f
[dashboard] Remove unused fields in dashboard actor table for better memory footprint (#21919) 2022-01-26 22:48:17 -08:00
Yao Yuan
422d20e945
[Dashboard] Fix NPE when there is no GPU on the node (#21650)
There is an NPE bug that causes browser crash when no GPU on the node.
We can add a condition to fix it.
2022-01-18 08:12:49 -08:00
Simon Mo
72ae22e82b
[CI] Fix frontend build issue (#20375) 2021-11-15 10:12:43 -08:00
Avnish Narayan
026bf01071
[RLlib] Upgrade gym version to 0.21 and deprecate pendulum-v0. (#19535)
* Fix QMix, SAC, and MADDPA too.

* Unpin gym and deprecate pendulum v0

Many tests in rllib depended on pendulum v0,
however in gym 0.21, pendulum v0 was deprecated
in favor of pendulum v1. This may change reward
thresholds, so will have to potentially rerun
all of the pendulum v1 benchmarks, or use another
environment in favor. The same applies to frozen
lake v0 and frozen lake v1

Lastly, all of the RLlib tests and have
been moved to python 3.7

* Add gym installation based on python version.

Pin python<= 3.6 to gym 0.19 due to install
issues with atari roms in gym 0.20

* Reformatting

* Fixing tests

* Move atari-py install conditional to req.txt

* migrate to new ale install method

* Fix QMix, SAC, and MADDPA too.

* Unpin gym and deprecate pendulum v0

Many tests in rllib depended on pendulum v0,
however in gym 0.21, pendulum v0 was deprecated
in favor of pendulum v1. This may change reward
thresholds, so will have to potentially rerun
all of the pendulum v1 benchmarks, or use another
environment in favor. The same applies to frozen
lake v0 and frozen lake v1

Lastly, all of the RLlib tests and have
been moved to python 3.7
* Add gym installation based on python version.

Pin python<= 3.6 to gym 0.19 due to install
issues with atari roms in gym 0.20

Move atari-py install conditional to req.txt

migrate to new ale install method

Make parametric_actions_cartpole return float32 actions/obs

Adding type conversions if obs/actions don't match space

Add utils to make elements match gym space dtypes

Co-authored-by: Jun Gong <jungong@anyscale.com>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-11-03 16:24:00 +01:00
Philipp Moritz
45f1ff0fa9
[Windows] Update react-scripts dependency for dashboard (#19489) 2021-10-20 17:57:30 -07:00
Chu Xiangyang
505aa89d12
[Dashboard] Add start/end time for job (#18901) 2021-09-28 20:57:13 -07:00
Chu Xiangyang
2220fe8a78
[Dashboard] Keep Job timestamp as millisecond (#18806)
* [Dashboard] Keep Job timestamp as millisecond

Current the `timestamp` is already millisecond, 13 digits long,
so no need to * 1000 in the dashboard UI.

* Fix format wih prettier

* use Number to convert timestamp
2021-09-24 10:31:54 -07:00
Guyang Song
89ce8a3a02
support 'CustomFields' tooltip in dashboard (#18698) 2021-09-17 17:48:32 +08:00
Dominic Ming
97f71e15d4
[Dashboard] new dashboard event page for API Server event module (#18330) 2021-09-09 19:43:48 +08:00
Amog Kamsetty
39d60f62d2
[hotfix] fix material-ui version once more (#16901) 2021-07-06 13:57:34 -07:00
Simon Mo
b11b35aa45
hotfix material-ui version again (#16897) 2021-07-06 11:08:57 -07:00
Amog Kamsetty
d5ac5c45ea
[Dashboard] Pin material-ui/lab dependency (#16890) 2021-07-06 10:49:10 -07:00
Simon Mo
677514b3ff
Revert "[Dashboard] Actor Table UI Optimize (#15802)" (#15981)
This reverts commit 43be599a9a.
2021-05-21 10:56:15 -07:00
Dominic Ming
43be599a9a
[Dashboard] Actor Table UI Optimize (#15802) 2021-05-21 09:23:32 -07:00
Ian Rodney
7b1c5dbe0a
[Hotfix][Lint] Pin other ESlint Deps (#15816) 2021-05-14 09:18:43 -07:00
Ashwin Hegde
4d8ed6dd5c
#13890 [new-dashboard] add object store memory column (#15697) 2021-05-11 15:36:16 -05:00
Ian Rodney
c50490ccef
[Lint] Pin Prettier to 2.3.0 (#15721) 2021-05-10 11:46:29 -07:00
Ian Rodney
11b5c6c702
[HotFix][Lint] Fix Lint because of Prettier update (#15720) 2021-05-10 09:51:41 -07:00
Dmitri Gekhtman
410f768046
[Kubernetes] [Dashboard] Remove disk data from dashboard when running on K8s. (#14676) 2021-04-05 17:16:20 -07:00
Eric Liang
9db000ff2c
Auto report object store memory usage; remove some deprecated code (#14260) 2021-03-01 13:19:44 -08:00
niole
488f63efe3
[Dashboard] Make requests sent by the dashboard reverse proxy compatible (#14012) 2021-02-24 18:31:59 -08:00
Kathryn Zhou
d6521be7ef
Export GPU metrics, CPU count, and additional Memory metrics to Prometheus (#14170) 2021-02-22 10:04:18 -08:00
Kathryn Zhou
f6b5e838fe
Add disk and network metrics to Prometheus and fix dashboard (#14144) 2021-02-17 10:27:14 -08:00
Dominic Ming
4b60c388ef
[Dashboard] fix new dashboard entrance and some table problem (#13790) 2021-01-30 10:42:16 +08:00
Dominic Ming
752da83bb7
[Dashboard] Add the new dashboard code and prompt users to try it (#11667) 2021-01-29 15:22:26 +08:00
Simon Mo
321bbe1ffb
[Dashboard] Fix GPU resource rendering issue (#13388) 2021-01-14 12:23:21 -08:00
Max Fitton
25f7bdc0d8
[Bugfix][Dashboard] Fix undefined logCount, errorCount UI crash (#13113) 2020-12-30 14:19:56 -06:00
Sumanth Ratna
b7404e7955
[dashboard] Resolve npm vulnerabilities (#12620)
* npm audit fix

* npm dedupe
2020-12-08 10:26:49 -08:00
Max Fitton
34b9c7449b
[Dashboard] Fix object store memory display. (#12664) 2020-12-07 21:40:49 -08:00
Max Fitton
a5c846c83b
[Dashboard][Bugfix] Filter dead nodes from Machine View (fixes duplicate node issue) (#12579) 2020-12-02 14:08:14 -08:00
Max Fitton
2708b3abbc
[Dashboard][Bug] Fix duplicate node total rows in dashboard (#12410)
* Fix duplicate node total rows in dashboard by changing the react key of the NodeTotalRow component from the node IP to the node ID (node IP can be duplicated in the case of docker).

* simplify a piece of test code and fix a flaky time out

* lint
2020-11-30 18:43:09 -08:00
Eric Liang
9b8218aabd
[docs] Move all /latest links to /master (#11897)
* use master link

* remae

* revert non-ray

* more

* mre
2020-11-10 10:53:28 -08:00
Max Fitton
d352feadf0
[Dashboard] Memory Page Loading Wheel (#11651)
* Switch memory view loading message over to a loading wheel to make UX less confusing.

* lint

Co-authored-by: Max Fitton <max@semprehealth.com>
2020-11-03 09:37:30 -08:00
Max Fitton
3202ff74c2
[Dashboard] Don't show GPU columns if no GPU in cluster (#11704) 2020-11-02 18:07:27 -06:00
Max Fitton
b4df42b027
[Dashboard] Make Infeasible Actor UX Less Scary (#11654)
* Update infeasible actor UI so that it only shows infeasible for an ActorClassGroup if at least one actor in the class is infeasible

* lint
2020-10-29 23:12:43 -07:00
Max Fitton
d6628cdbfb
[Dashboard] Fix null gpu utilization (#11650)
* update dashboard to work if GPU utilization field is missing from GPU payload

* lint

* lint
2020-10-29 23:11:50 -07:00
Max Fitton
caf3b04b27
[Dashboard] Turn on new dashboard by default pt 2 (#11510) 2020-10-23 15:52:14 -05:00
Max Fitton
0a9cc9cce5
Revert "remove .fake build files (#11478)" (#11488)
This reverts commit 3ed3dea004.
2020-10-19 18:48:32 -07:00
Max Fitton
3ed3dea004
remove .fake build files (#11478)
Co-authored-by: Max Fitton <max@semprehealth.com>
2020-10-19 15:36:47 -07:00
Max Fitton
cd9dcfca0d
[Dashboard] CPU/GPU usage details in actor pane (#11269) 2020-10-13 20:23:23 -05:00
Max Fitton
5a42ed1848
[Dashboard] Add support for new backend to existing front-end (#11013)
* Trying to commit on top of old code again

* address comment

Co-authored-by: Max Fitton <max@semprehealth.com>
2020-10-02 12:46:47 -07:00