Commit graph

4030 commits

Author SHA1 Message Date
Richard Liaw
c84073f3f4
[minor] improve warning message for Ray. #14949
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-29 15:17:32 -07:00
Akash Patel
208cde8d9b
Add support for Python 3.9 (#12613) 2021-03-29 11:57:06 -07:00
Edward Oakes
fe39c88a57
skip on windows (#14988) 2021-03-29 10:06:25 -07:00
Edward Oakes
e79d4cf6f5
[serve] Support setting deployment options via kwargs (#14935) 2021-03-29 11:14:27 -05:00
Amog Kamsetty
95ff342558
[Tune] Wandb API Key File Compatibility with Ray Client (#14942) 2021-03-29 15:39:54 +02:00
dependabot[bot]
68c82b6503
[tune](deps): Bump wandb from 0.10.19 to 0.10.23 in /python/requirements (#14964)
Bumps [wandb](https://github.com/wandb/client) from 0.10.19 to 0.10.23.

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-03-29 15:37:56 +02:00
Siyuan (Ryans) Zhuang
87c79553e9
[Core] Remove code paths that contains plasma store executable (#14950)
* remove plasma store executable & never used tests

* set default behavior

* fix tests
2021-03-28 21:22:14 -07:00
Micah Yong
b3089b31f2
[RFC] Ray memory improvements: format and summary (#14520)
* Better formatting when terminal size doesn't support tabular

* Summary now displays size of reference types

* Add unit conversion support (e.g. b, kb, mb, gb)

* Format and test

* Add ability to specify the number of sorted entries

* Linting

* Clean up group summary, move import defaultdict, comment num entries counter, n

* Format and lint
2021-03-28 21:03:06 -07:00
Dmitri Gekhtman
dcf41d868c
[autoscaler][Kubernetes] Fix non_terminated_nodes consistency (#14976)
* Verify pod termination

* deletion-timestamp

* get rid of extra constant
2021-03-28 14:52:12 -07:00
Frank Luan
cdbaf930ab
[metrics] Fix deserialization warnings for metrics.Counter (#14969) 2021-03-28 09:44:30 -05:00
Edward Oakes
fd4ed3acfe
[serve] Skip failing test_deploy tests on windows (#14957) 2021-03-26 13:51:54 -05:00
SangBin Cho
839cd1e0a2
[Core] Remove unnecessary redis connection (#14511)
* remove unnecessary stuff.

* test in progress.

* Fix tests.

* lint

* fix.

* Remove tests that were not working properly before.
2021-03-26 10:29:12 -07:00
Eric Liang
2157021fd3
Refactor object restoration path (#14821) 2021-03-25 22:46:50 -07:00
tchordia
4e66efc532
Update ARCHITECTURE.md (#14889)
update link
2021-03-25 12:30:35 -07:00
Edward Oakes
63594c5370
[serve] Rolling updates for redeployments (#14803) 2021-03-25 12:23:08 -05:00
Simon Mo
1fcca07856
[Serve] FastAPI Simple Class Based View (#14858) 2021-03-25 12:21:36 -05:00
Kai Fricke
b366500938
[tune] fix long running release test WIP (#14866)
- Use placement groups
- Introduce time between checks for failure testing
- Use gloo instead of nccl
2021-03-25 11:03:22 +01:00
Kai Fricke
84b3c3376b
[tune] document scalability best practices (k8s, scalability thresholds) (#14566)
Adds a new page and table to document current scalability thresholds in Ray Tune to the documentation.

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-25 09:54:14 +01:00
SangBin Cho
0004d87194
[Test] Refactor object spilling test (#14861)
* refactoring done.

* refactoring done.
2021-03-25 00:46:46 -07:00
architkulkarni
03afaed6e1
[Serve] [Doc] Create top-level page for Calling Endpoints from HTTP and from Python (#14904) 2021-03-24 20:29:24 -05:00
Dmitri Gekhtman
25ebefafc8
[autoscaler][aws][test] Validate current state of subnet-specification (#14859)
* This PR adds a test that validates that adding head_nodes and worker_nodes fields with subnet data to a multi-node-type config leads to a correct configuration of a security group.
2021-03-25 01:40:16 +02:00
Yi Cheng
f427801c10
Revert "[core] Fix worker type in python (#14823)" (#14910)
This reverts commit 9ccf291f4d.
2021-03-24 13:27:56 -07:00
Simon Mo
d57808d007
[Serve] Add support for handle.method_name.remote (#14831) 2021-03-24 12:10:14 -07:00
Edward Oakes
59e231818d
[serve] Add Deployment.delete() and un-skip relevant tests (#14898) 2021-03-24 13:40:30 -05:00
SangBin Cho
ec3cfef883
Revert "[Core] Implement long polling-based pubsub to reduce number of WaitForObjectEviction requests in flight. (#14638)" (#14905)
This reverts commit 35ec91c4e0.
2021-03-24 11:22:48 -07:00
Clark Zinzow
ed46d8bf45
[Core] Added ownership-based object directory metrics, fixed raylet metric bug. (#14855)
* Added ownership-based object directory metrics.

* Updated OBOD metric descriptions.

* Dump OBOD metrics in debug string.

* Added e2e tests for metrics.
2021-03-24 10:53:22 -07:00
Edward Oakes
ade5857aa3
[serve] Duplicate test_api for new declarative codepath (#14877) 2021-03-24 11:29:35 -05:00
SangBin Cho
35ec91c4e0
[Core] Implement long polling-based pubsub to reduce number of WaitForObjectEviction requests in flight. (#14638)
* in progress.

* IN progress.

* lint.

* Updated code

* lint.

* In progress of writing tets.

* Finished implementation. Need cleanup & refactoring.

* fixing tests...

* Finish the impl.

* Fix typo.

* impl done. Only cleanup left.

* done.

* Finished clean up.

* Fix issues.

* Add a stronger consistency check.

* Addressed code review.

* lint.

* done.

* Addressed more.

* addressed all reviews.

* Addressed code review.

* lint.

* Added unit tests to assert no leak.
2021-03-23 23:47:08 -07:00
Stephanie Wang
201ebc3f92
Revert "[core] Set a configurable max memory for fetched objects (#14817)" (#14887)
This reverts commit 8769953474.
2021-03-23 21:58:11 -07:00
Dmitri Gekhtman
10458564a3
[autoscaler][aws] Use subnets in only one VPC (#14868) 2021-03-23 21:16:25 -07:00
fyrestone
52cfa1cdd7
Fix load code from local (#12102) 2021-03-24 11:49:58 +08:00
Kai Fricke
898243d538
[tune] Limit maximum number of pending trials. Add convergence test. (#14835) 2021-03-23 18:19:41 -07:00
Ian Rodney
5d763b3f49
[Autoscaler][Docker] Mention nightly images in addition to nightly wheels (#14871) 2021-03-23 17:13:36 -07:00
Ian Rodney
3869c9d342
[Test] Fix Test-Dashboard (#14874) 2021-03-23 15:50:15 -07:00
Edward Oakes
36bce2d3af
[flaky_test][windows] Disable test_actor_resources::test_creating_more_actors_than_resources on Windows (#14879) 2021-03-23 15:45:10 -07:00
Stephanie Wang
8769953474
[core] Set a configurable max memory for fetched objects (#14817)
* Set threshold, tests

* comment

* move max to pull manager

* unit test

* fix plasma

* comment
2021-03-23 13:55:02 -07:00
Edward Oakes
7a1f85ab38
[serve] Add initial support for @serve.deployment syntax (#14869) 2021-03-23 15:07:43 -05:00
Ian Rodney
9775ad2f25
[Autoscaler] Correct typing for Event Handler execute_callback (#14718) 2021-03-23 12:57:49 -07:00
Ian Rodney
4fb4b96a4f
[Autoscaler] Make setup_commands in defaults.yaml Docker Compatible (#14804) 2021-03-23 12:52:15 -07:00
Xianyang Liu
a3eb64311e
[sgd] Fixes TrainingOperator wrap model incorrect (#14353)
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-03-23 12:19:32 -07:00
Ian Rodney
e0f31bf035
[Disable Tests] New Dashboard Tests (#14845) 2021-03-23 07:49:54 -07:00
Hao Chen
9053be0e63
Fix 'dictionary changed size during iteration' in _available_resources_per_node (#14839) 2021-03-23 17:44:05 +08:00
Sven Mika
f859ebb99f
[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). (#14796) 2021-03-23 10:06:06 +01:00
Yi Cheng
9ccf291f4d
[core] Fix worker type in python (#14823)
* Fix

* format

* more

* format
2021-03-23 00:58:57 -07:00
Ian Rodney
95e8e8c049
[Dependencies] Install Ray Full in Autoscaler (#14851) 2021-03-22 20:57:16 -07:00
Ian Rodney
ce18427017
[HotFix] Fix Colorful mock for with_style (#14848) 2021-03-22 14:11:03 -07:00
Simon Mo
2f25c7d32c
[Core] Remove legacy code path for unhandled exception in asyncio (#14828) 2021-03-22 13:09:48 -07:00
SangBin Cho
87877cdfbf
[Test] Fix flaky object spilling test (#14722)
* start

* done.

* d

* d

* Push the fix.

* done.

* Enable test.
2021-03-22 12:51:47 -07:00
Simon Mo
21b5ab6739
[Serve] Raise error when endpoint fail to register (#14827) 2021-03-22 11:40:58 -07:00
andrew-rosenfeld-ts
d6933da423
[tune] fixing Trainable.__init__ docstring reference to build() #14824 (#14850)
Co-authored-by: Andrew Rosenfeld <rosnfeld@gmail.com>
2021-03-22 18:53:49 +01:00