ray/dashboard/modules/reporter
mwtian 65d7a610ab
[Core] Push message to driver when a Raylet dies (#25516)
Currently when Raylets die, it is hard to figure out:

if a Raylet died at all in a cluster. Usually we have to check on nodes where a number of workers died and see if the Raylet has died as well.
reason of Raylet's death.
With this PR, if a Raylet dies from a reason other than SIGTERM, the dashboard agent will report the failure along with last 20 lines of the Raylet log.
2022-06-09 05:54:34 -07:00
..
tests [Core] Export additional metrics for workers and Raylet memory (#25418) 2022-06-06 10:58:14 -07:00
__init__.py [Dashboard] New dashboard skeleton (#9099) 2020-07-27 11:34:47 +08:00
reporter_agent.py [Core] Push message to driver when a Raylet dies (#25516) 2022-06-09 05:54:34 -07:00
reporter_consts.py [CI] Format Python code with Black (#21975) 2022-01-29 18:41:57 -08:00
reporter_head.py Clean up docstyle in python modules and add LINT rule (#25272) 2022-06-01 11:27:54 -07:00