hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 18:11:42 -05:00

Author	SHA1	Message	Date
SangBin Cho	39b9c44c8d	[State Observability] pre-alpha documentation (#26560 ) Adds Documentation for state APIs API reference	2022-07-26 05:49:28 -07:00
SangBin Cho	6552e096e6	[State Observability] Summary APIs (#25672 ) Task/actor/object summary Tasks: Group by the func name. In the future, we will also allow to group by task_group. Actors: Group by actor class name. In the future, we will also allow to group by actor_group. Object: Group by callsite. In the future, we will allow to group by reference type or task state.	2022-06-22 06:21:50 -07:00
Eric Liang	43aa2299e6	[api] Annotate as public / move ray-core APIs to _private and add enforcement rule (#25695 ) Enable checking of the ray core module, excluding serve, workflows, and tune, in ./ci/lint/check_api_annotations.py. This required moving many files to ray._private and associated fixes.	2022-06-21 15:13:29 -07:00
Stephanie Wang	abf2a70a29	[core] Add task and object reconstruction status to ray memory (#22317 ) Improve observability for general objects and lineage reconstruction by adding a "Status" field to `ray memory`. The value of the field can be: ``` // The task is waiting for its dependencies to be created. WAITING_FOR_DEPENDENCIES = 1; // All dependencies have been created and the task is scheduled to execute. SCHEDULED = 2; // The task finished successfully. FINISHED = 3; ``` In addition, tasks that failed or that needed to be re-executed due to lineage reconstruction will have a field listing the attempt number. Example output: ``` IP Address \| PID \| Type \| Call Site \| Status \| Size \| Reference Type \| Object Ref 192.168.4.22 \| 279475 \| Driver \| (task call) ... \| Attempt #2: FINISHED \| 10000254.0 B \| LOCAL_REFERENCE \| c2668a65bda616c1ffffffffffffffffffffffff0100000001000000 ```	2022-02-22 21:26:21 -08:00
Balaji Veeramani	7f1bacc7dc	[CI] Format Python code with Black (#21975 ) See #21316 and #21311 for the motivation behind these changes.	2022-01-29 18:41:57 -08:00
SangBin Cho	2010f13175	Fix dashboard test bug (#21742 ) Currently `wait_until_succeeded_without_exception` is used in the dashboard, and it returns True/False. Unfortunately, there are lots of code that doesn't assert on this method (which means things are not actually tested).	2022-01-24 11:38:51 -06:00
Edward Oakes	7736cdd91d	[dashboard] Rename "new_dashboard" -> "dashboard" (#18214 )	2021-09-15 11:17:15 -05:00
mwtian	26679d62c5	[Core][ObjectRef] Change default to not record call stack during ObjectRef creation (#18078 )	2021-08-27 15:45:34 -07:00
fyrestone	dfadf33a94	[Dashboard] Reorganize dashboard modules - node (#16217 )	2021-06-07 19:50:46 -07:00
Micah Yong	b3089b31f2	[RFC] Ray memory improvements: format and summary (#14520 ) * Better formatting when terminal size doesn't support tabular * Summary now displays size of reference types * Add unit conversion support (e.g. b, kb, mb, gb) * Format and test * Add ability to specify the number of sorted entries * Linting * Clean up group summary, move import defaultdict, comment num entries counter, n * Format and lint	2021-03-28 21:03:06 -07:00
Clark Zinzow	5a788474aa	[Core] First pass at privatizing non-public Python APIs. (#14607 ) * async_compat * utils * cluster_utils * compat * function_manager * import_thread * memory_monitor * monitor, log_monitor, ray_process_reaper * metrics_agent * parameter * prometheus_exporter * ray_logging * signature	2021-03-10 22:47:28 -08:00
SangBin Cho	a04ab9b472	[Core] Fix ray memory bug (#14452 ) * ray memory bug * Fix ray memory issue. * done.	2021-03-03 09:20:00 -08:00
SangBin Cho	09fd38ede1	[Multi node shuffle] More efficient ray memory --stats-only (#14423 ) * Done. * Fix all the issues.	2021-03-01 23:14:06 -08:00
Micah Yong	db0c16824c	[Dashboard][CLI] Ray memory parity with dashboard 2 (#13444 ) * Minor improvements in Ray Core Walkthrough as seen in https://github.com/ray-project/ray/issues/12472 * Define node_stats() to return NodeStats object from cluster * Add --group-by and --sort-by capabilities to ray memory script * Resolve merge conflict * Add helper functions for group by and sorting type in memory_utils.py * Reformat * Format * Compartmentalize memory script into get_memory_summary and get_store_stats_summary * Modify unit tests in test_mem_stat * Lint and format * Test cases for group_by sort_by * Lint and format * Fix actor handle failing test case * Update test_memstat.py * Resolve merge conflicts * Adjust ray memory output based on terminal size * Formatting and linting * Use constant for callsite length * Switch from OS to shutil for querying terminal size (official python support) * Linting and formatting * Lint and format * Resolve lint issue in walkthrough.rst * Revert to python 3.6 * Delete visitor.py It was accidentally included in most recent commit * Delete .eggs It was accidentally included in most recent commit * Resolve test_object_spilling.py test case * Add stats only argument * revert changes on this file * Remove package-lock.json * Add back npm installation * Sync package-lock.json * Linting and formatting * Sync with package-lock * Sync with package-lock pt 2 * Update documentation in https://docs.ray.io/en/master/memory-management.html * Add include_memory_info as argument for node_stats * Switch object ref and call site positions * Linting and formatting * Change from MiB to B * Change from stats-only to store-true * Add memory test case * Add memory test case * Lint and format * Correct test in memstat * Change line wrap and stats only to flags * Clarify --stats-only and --no-format in ray memory * --stats-only description modified Co-authored-by: Micah Yong <micahyong@Micahs-MacBook-Pro.local>	2021-03-01 09:27:22 -08:00
fyrestone	05ad4c7499	[Dashboard] Optimize dashboard datacenter (#11391 ) * Optimize dashboard datacenter * Fix tests * Fix tests * Fix * Fix CI * python/build-wheel-macos.sh Co-authored-by: 刘宝 <po.lb@antfin.com> Co-authored-by: Max Fitton <maxfitton@anyscale.com>	2020-10-27 23:49:31 -07:00
Max Fitton	ff6d412ad9	[Dashboard] Add API support for the logical view and machine view in new backend (#11012 ) * Add API support for the logical view and machine view, which lean on datacenter in common. * Update dashboard/datacenter.py Co-authored-by: fyrestone <fyrestone@outlook.com> * Update dashboard/modules/logical_view/logical_view_head.py Co-authored-by: fyrestone <fyrestone@outlook.com> * Address PR comments * lint * Add dashboard tests to CI build * Fix integration issues * lint Co-authored-by: Max Fitton <max@semprehealth.com> Co-authored-by: fyrestone <fyrestone@outlook.com>	2020-10-02 17:58:44 -07:00
Max Fitton	9a6d01ebf9	[Dashboard] Add utility functions for actor and memory APIs (#11011 ) * Add actor and memory utility functions needed by upcoming logical view and memory view APIs * Add a method to allow printing Dict custom class and add support for hot-reloading local dev environment. * Address PR comments * Add unit tests from test metrics to branch for new memory_utils module * Add note about sorting / grouping * lint Co-authored-by: Max Fitton <max@semprehealth.com>	2020-10-01 23:48:03 -07:00

17 commits