hiro/ray - Forgejo: Beyond coding. We Forge.

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-06 10:31:39 -05:00

Author	SHA1	Message	Date
Simon Mo	efee158cec	[Serve] Use Async Handle for DAG Execution (#27411 )	2022-08-06 22:23:44 -07:00
zcin	64c550a2b1	Revert "[serve] Integrate and Document Bring-Your-Own Gradio Applications (#26403 )" (#27587 ) This reverts commit `8a9d994dd0`.	2022-08-06 21:38:55 -07:00
Sihan Wang	5fe586b881	[Serve/Doc] Add deployment migration guide (#27408 ) Co-authored-by: Archit Kulkarni <architkulkarni@users.noreply.github.com>	2022-08-05 14:28:48 -05:00
zcin	22db41c21a	[Serve][doc] Modify and Combine Tensorflow, Pytorch, Sklearn Tutorials (#26817 )	2022-08-05 11:55:31 -05:00
zcin	04c7ccacf1	[Serve][Doc] Moves Serve REST API and Serve CLI API into separate subpages (#26914 )	2022-08-05 11:51:53 -05:00
zcin	8a9d994dd0	[serve] Integrate and Document Bring-Your-Own Gradio Applications (#26403 ) Integration between Ray Serve and Gradio. Users of Gradio can wrap their Gradio app in a Serve deployment by using `GradioIngress`, and scale it up through more replicas or more CPU/GPU resources.	2022-08-05 11:31:00 -05:00
Archit Kulkarni	1714d0266b	[Doc] [Serve] Refresh code for "monitoring" for 2.0 (#27400 )	2022-08-04 20:10:12 -07:00
shrekris-anyscale	11abc89746	[Serve] [Docs] Use dashboard agent port in REST API documentation (#27450 )	2022-08-04 10:24:57 -07:00
Archit Kulkarni	9f0d8e364d	[Doc] Update Serve architecture doc for 2.0 (#26861 ) - Move autoscaling architecture from autoscaling page to architecture page - Update architecture page - Remove "Router" actor - Update description of ServeHandle - Update defaults about HTTPproxy (default one on each node -> default just one per cluster, on the head node) - Add note about fault tolerance in different failure scenarios - Assorted typos/usage nits Co-authored-by: shrekris-anyscale <92341594+shrekris-anyscale@users.noreply.github.com> Co-authored-by: Simon Mo <simon.mo@hey.com>	2022-08-03 14:30:33 -05:00
Archit Kulkarni	a12c04a2fe	[Serve] [Doc] Update key concepts for 2.0, remove deprecated APIs (#26965 ) Removes deprecated APIs: - serve.start() - get_handle() Rewrites the ServeHandle doc snippet to use the recommended workflow for ServeHandles (only access them from other deployments, pass Deployments in as input args to `.bind()`, which get resolved to ServeHandles at runtime) Co-authored-by: shrekris-anyscale <92341594+shrekris-anyscale@users.noreply.github.com>	2022-08-03 11:27:23 -05:00
Archit Kulkarni	e02b072939	[Doc] [Serve] Edit grammar/usage/organization for HTTP adapters page (#26969 ) Moves FastAPI into its own section instead of appearing in a duplicated note. Co-authored-by: simon-mo <simon.mo@hey.com>	2022-08-02 15:08:05 -05:00
shrekris-anyscale	324d8e4bca	[Serve] Serialize `user_config` with JSON instead of Pickle (#26235 )	2022-08-01 17:53:43 -07:00
shrekris-anyscale	cc84953da3	[Serve] [Docs] Update "Getting Started" documentation (#26745 )	2022-08-01 16:31:48 -07:00
shrekris-anyscale	510a0e038c	[Serve] Add `host` and `port` options to the Serve config file (#27026 ) The Serve CLI and REST API always sets the host to `0.0.0.0` and the port to Serve's default. This change adds `host` and `port` as top level options in the Serve config file, so users can manually set the host and port of their Serve application to different values. This change introduces a new Serve config file format: ```yaml import_path: ... runtime_env: ... host: ... port: ... deployments: ... ... ``` `host` and `port` are optional and can be omitted. A running Serve application's `host` and `port` cannot be changed. If a user tries to `serve deploy` a config file with different `host` and `port` options than an already-running Serve application, `serve deploy` will fail without making any changes to the application. The user must `serve shutdown` their application and restart it with `serve deploy` to change their `host` and `port`. Follow-Up Items * The following CLI commands should not start Serve automatically. They should check whether Serve is running and perform some sort of no-op if it's not. That would alleviate the concern that the user starts Serve by accident through a `GET` request and needs to deal with default `host`/`port` options. Corresponding docs should also be updated. * `serve status` * `serve config` * `serve shutdown`	2022-07-28 11:26:46 -05:00
Simon Mo	e5a8b1dd55	[Serve] Add API Annotations And Move to _private (#27058 )	2022-07-27 09:08:26 -07:00
Dmitri Gekhtman	a70ada7341	[kubernetes][docs] Implement landing page and getting started guide (#26912 ) Implements a landing page for the new KubeRay-based deployment guide. Implements a "Getting started" Jupyter notebook	2022-07-26 00:41:56 -07:00
Sihan Wang	8ecd928c34	[Serve] Make the checkpoint and recover only from GCS (#26753 )	2022-07-25 14:24:53 -07:00
Stephanie Wang	55a0f7bb2d	[core] ray.init defaults to an existing Ray instance if there is one (#26678 ) ray.init() will currently start a new Ray instance even if one is already existing, which is very confusing if you are a new user trying to go from local development to a cluster. This PR changes it so that, when no address is specified, we first try to find an existing Ray cluster that was created through `ray start`. If none is found, we will start a new one. This makes two changes to the ray.init() resolution order: 1. When `ray start` is called, the started cluster address was already written to a file called `/tmp/ray/ray_current_cluster`. For ray.init() and ray.init(address="auto"), we will first check this local file for an existing cluster address. The file is deleted on `ray stop`. If the file is empty, autodetect any running cluster (legacy behavior) if address="auto", or we will start a new local Ray instance if address=None. 2. When ray.init(address="local") is called, we will create a new local Ray instance, even if one is already existing. This behavior seems to be necessary mainly for `ray.client` use cases. This also surfaces the logs about which Ray instance we are connecting to. Previously these were hidden because we didn't set up the log until after connecting to Ray. So now Ray will log one of the following messages during ray.init: ``` (Connecting to existing Ray cluster at address: <IP>...) ...connection... (Started a local Ray cluster.\| Connected to Ray Cluster.)( View the dashboard at <URL>) ``` Note that this changes the dashboard URL to be printed with `ray.init()` instead of when the dashboard is first started. Co-authored-by: Eric Liang <ekhliang@gmail.com>	2022-07-23 11:27:22 -07:00
Sihan Wang	27f1532a15	[Serve] Promote graceful shutdown and health check (#26682 )	2022-07-21 17:37:10 -05:00
Sihan Wang	b606169cb5	[Serve] Promote autoscaling feature (#26393 ) 1. get rid of the private attribute 2. fix unit test 3. docs and workflows	2022-07-13 14:38:38 -05:00
ej	636105e8e2	[Docs] [Serve] Has a consistent landing page style (#26029 )	2022-07-08 11:58:21 -07:00
brucez-anyscale	f76d7b23f2	Revert "Revert "[Dashboard][Serve] Move Serve related endpoints to dashboard agent"" (#26336 )	2022-07-06 19:37:30 -07:00
Yi Cheng	12d147ff1f	Revert "[Dashboard][Serve] Move Serve related endpoints to dashboard agent (#26107 )" (#26333 ) This reverts commit `84166ccb04`.	2022-07-06 13:30:33 -07:00
brucez-anyscale	84166ccb04	[Dashboard][Serve] Move Serve related endpoints to dashboard agent (#26107 ) In Ray 2.0, we want to achieve api server HA. Originally serve endpoints are in head node. This pr moves serve endpoints to dashboard agents, so they will be HA due to multiple replica of dashboard agent.	2022-07-06 10:58:00 -07:00
Simon Mo	88a219c7f2	Revert "Revert "[AIR][Serve] Rename ModelWrapperDeployment -> PredictorDeployment"" (#26231 )	2022-07-05 13:26:49 -07:00
Archit Kulkarni	84be085a5a	[Doc] Fix typo in Serve doc (#26211 )	2022-06-29 16:15:26 -07:00
Stephanie Wang	c9be251b7a	Revert "[AIR][Serve] Rename ModelWrapperDeployment -> PredictorDeployment (#25962 )" (#26176 ) This reverts commit `68692b3464`.	2022-06-28 17:07:07 -07:00
Simon Mo	68692b3464	[AIR][Serve] Rename ModelWrapperDeployment -> PredictorDeployment (#25962 )	2022-06-28 10:26:10 -07:00
shrekris-anyscale	6092869ff3	[Serve] [Docs] Create end-to-end documentation example for Serve REST API and CLI (#25936 )	2022-06-24 14:44:39 -07:00
shrekris-anyscale	97a9a20f74	[Serve] [Docs] Add Serve REST API Schema to Serve API Docs (#25786 )	2022-06-24 14:06:26 -07:00
Sihan Wang	c0cf9b8098	[Serve][Doc] Autoscaling (#25646 ) - new section of doc for autoscaling (introduction of serve autoscaling and config parameter) - Remove the version requirement note inside the doc Co-authored-by: Simon Mo <simon.mo@hey.com> Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com> Co-authored-by: shrekris-anyscale <92341594+shrekris-anyscale@users.noreply.github.com> Co-authored-by: Archit Kulkarni <architkulkarni@users.noreply.github.com>	2022-06-22 15:32:18 -05:00
Eric Liang	43aa2299e6	[api] Annotate as public / move ray-core APIs to _private and add enforcement rule (#25695 ) Enable checking of the ray core module, excluding serve, workflows, and tune, in ./ci/lint/check_api_annotations.py. This required moving many files to ray._private and associated fixes.	2022-06-21 15:13:29 -07:00
Jiao	f6735f90c7	[Ray DAG] Move `dag` project folder out of `experimental` (#25532 )	2022-06-16 19:15:39 -07:00
shrekris-anyscale	d944f7469c	[Serve] [Docs] Remove references to namespaces in the Serve documentation (#25830 ) #25575 starts all Serve actors in the `"serve"` namespace. This change updates the Serve documentation to remove now-outdated explanations about namespaces and to specify that all Serve actors start in the `"serve"` namespace.	2022-06-16 10:50:49 -05:00
zcin	3f91cbd979	[serve][docs] Replaced term 'actor_init_options' with 'ray_actor_options' in documentation (#25808 ) Replaced the term `actor_init_options` with `ray_actor_options` in [this documentation section](https://docs.ray.io/en/releases-1.13.0/serve/performance.html#choosing-the-right-hardware) because `actor_init_options` is an outdated variable name. It's been changed to `ray_actor_options` in the [code](`2546fbf99d/python/ray/serve/deployment.py (L45)`).	2022-06-15 15:21:24 -05:00
shrekris-anyscale	3278763dd7	[Serve] Start all Serve actors in the `"serve"` namespace only (#25575 )	2022-06-13 10:31:28 -07:00
Sven Mika	ca10530a1a	[Serve; RLlib; Docs] Change terms in Serve+RLlib example (Trainer -> Algorithm). (#25700 )	2022-06-13 11:43:38 +02:00
Simon Mo	271c7d73ac	[AIR][Serve] Add support for multi-modal array input (#25609 )	2022-06-10 09:19:42 -07:00
Simon Mo	7471b1fa41	[Serve] [AIR] ModelWrapper improvements and docs (#25003 ) * batching collation code and tests * wip notebook for np and dataframe * finish content * reset ray-more-libs changes * add comments * run through * Apply suggestions from code review Co-authored-by: shrekris-anyscale <92341594+shrekris-anyscale@users.noreply.github.com> * rename package * lint * richard's comment Co-authored-by: shrekris-anyscale <92341594+shrekris-anyscale@users.noreply.github.com>	2022-06-07 08:53:10 -07:00
kimikuri	60f59bd804	[Serve] Fix misspell in Serve Doc User Guides. (#25494 )	2022-06-06 13:00:20 -07:00
Jiao	aa965ba0a9	[Deployment Graph] Add visualization cookbook (#25112 )	2022-06-06 11:05:58 -07:00
Sven Mika	b5bc2b93c3	[RLlib] Move all remaining algos into `algorithms` directory. (#25366 )	2022-06-04 07:35:24 +02:00
Yi Cheng	fd0f967d2e	Revert "[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346 )" (#25420 ) This reverts commit `e4ceae19ef`. Reverts #25346 linux://python/ray/tests:test_client_library_integration never fail before this PR. In the CI of the reverted PR, it also fails (https://buildkite.com/ray-project/ray-builders-pr/builds/34079#01812442-c541-4145-af22-2a012655c128). So high likely it's because of this PR. And test output failure seems related as well (https://buildkite.com/ray-project/ray-builders-branch/builds/7923#018125c2-4812-4ead-a42f-7fddb344105b)	2022-06-02 20:38:44 -07:00
Sihan Wang	3c9bd66485	[Serve][Doc] Add http endpoint for dag pattern doc (#25390 )	2022-06-02 09:01:37 -07:00
Sven Mika	e4ceae19ef	[RLlib] Move (A/DD)?PPO and IMPALA algos to `algorithms` dir and rename policy and trainer classes. (#25346 )	2022-06-02 16:47:05 +02:00
Yi Cheng	287892657b	Revert "[Serve][Doc] Add http endpoint for dag pattern doc (#25243 )" (#25388 ) This reverts commit `4ad75056eb`.	2022-06-02 02:40:09 +00:00
Sihan Wang	4ad75056eb	[Serve][Doc] Add http endpoint for dag pattern doc (#25243 )	2022-06-01 11:30:42 -07:00
Naka Masato	897cb5d778	[Serve][Doc] Update batch.md to fix typo(#25270 )	2022-05-31 15:04:18 -07:00
Sihan Wang	4de3ce5c25	[Serve][Doc] Add deploy graph about control_flow_based_on_user_inputs pattern doc (#24871 )	2022-05-25 15:38:23 -07:00
Sven Mika	09886d7ab8	[RLlib] Upgrade gym 0.23 (#24171 )	2022-05-23 08:18:44 +02:00

1 2 3 4 5

207 commits