Commit graph

4670 commits

Author SHA1 Message Date
SangBin Cho
d6b6356173
[Core] Properly call shutdown instead of deleting a reference (#17096)
* Properly call shutdown instead of deleting a reference

* Add unit tests

* Add test ray shutdown

* Formatting

* format2

* Revert main logic to see if windows issue still fail

* Skip tests for windows.

* formatting

* Try fixing flakiness

* Remove node removed code path

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2021-07-20 08:22:33 -07:00
Antoni Baum
5e9b680e39
[docs] Add LightGBM-Ray docs, update XGBoost-Ray docs (#17188) 2021-07-20 16:06:47 +01:00
Siyuan (Ryans) Zhuang
8efc04a8a6
[Core] Actor namespace (#17178)
* set actor namespace in Python on creation

* get actor with namespace in Python

* update message
2021-07-19 21:51:04 -07:00
matthewdeng
fef74aa94f
[sgd] add placement group support (#17037)
* [sgd] add placement group support

* add logic for removing placement group upon shutdown

* set placement group; add tests

* address comments - add timeout and improve error handling

* remove unused import

* mock SGD_PLACEMENT_GROUP_TIMEOUT_S
2021-07-19 21:50:37 -07:00
Siyuan (Ryans) Zhuang
9ca6bda3a1
[Workflow] Fix recovery storage mismatch issue (#17166)
* fix recovery path issue and add test

* add TODOs
2021-07-19 21:49:12 -07:00
dependabot[bot]
2de7b8f084
[tune](deps): Bump tune-sklearn in /python/requirements/tune (#17173)
Bumps [tune-sklearn](https://github.com/ray-project/tune-sklearn) from 0.3.0 to 0.4.0.
- [Release notes](https://github.com/ray-project/tune-sklearn/releases)
- [Commits](https://github.com/ray-project/tune-sklearn/compare/v0.3.0...v0.4.0)

---
updated-dependencies:
- dependency-name: tune-sklearn
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-19 19:42:54 -07:00
Amog Kamsetty
c9522e9a6f
Remove requests from Core Dependencies (#17066)
* remove requests

* update

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* update

* lint

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-19 19:38:29 -07:00
Eric Liang
fabba96fad
Re-merge large function def, skipping test failing on Windows (#17191) 2021-07-19 18:03:26 -07:00
Amog Kamsetty
777921b2e7
[dependencies] vendor colorama (#17183) 2021-07-19 16:29:29 -07:00
Eric Liang
d59da075a6
Re-merge TMPDIR support, but only for Linux. OSX requires RAY_TMPDIR (#17190) 2021-07-19 15:45:03 -07:00
Patrick Ames
34789b3e56
[autoscaler] Add support for custom EC2 instance network interfaces (#14080)
* [autoscaler] Add support for custom EC2 instance network interfaces.

* [autoscaler] Add unit tests for custom EC2 network interfaces and support for AWS node provider stubs.
2021-07-19 17:21:21 -04:00
Amog Kamsetty
cb74053ee5
Retry remove gpustat dependency (#17115)
* remove gpustat

* move psutil imports
2021-07-19 11:14:10 -07:00
Siyuan (Ryans) Zhuang
1fbfbfc55b
[Serializatioin] Bump pickle5 version (#17124) 2021-07-19 10:40:38 -07:00
Siyuan (Ryans) Zhuang
9b110f9344
[Workflow] Update API (#17165)
* actor_id same as workflow_id

* @workflow.actor -> @workflow.virtual_actor

* readonly decorator

* run/run_async for virtual actor

* get_or_create for virtual actor

* update doc

* run/run_async for steps

* update tests

* update comments
2021-07-19 10:19:46 -07:00
architkulkarni
4069686e0f
Revert "Improve error message for oversized function (#17133)" (#17184)
This reverts commit 3e53619d64.
2021-07-19 09:28:33 -07:00
dependabot[bot]
bda1a37e93
[tune](deps): Bump mlflow in /python/requirements/tune (#17168)
Bumps [mlflow](https://github.com/mlflow/mlflow) from 1.17.0 to 1.19.0.
- [Release notes](https://github.com/mlflow/mlflow/releases)
- [Changelog](https://github.com/mlflow/mlflow/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/mlflow/mlflow/compare/v1.17.0...v1.19.0)

---
updated-dependencies:
- dependency-name: mlflow
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-17 16:23:10 -07:00
Eric Liang
3e53619d64
Improve error message for oversized function (#17133) 2021-07-17 11:04:05 -07:00
Philipp Moritz
c5c167300b
[Ray debugger] Set breakpoint() hook only for tasks and actors (#17164) 2021-07-17 10:27:51 -07:00
Simon Mo
9da49a7fb8
Revert "ray ignores TMPDIR variable" (#17161)
This reverts commit c27f43d9b8.
2021-07-16 20:28:38 -07:00
Eric Liang
94f17ec099
[RFC] API stability annotations (#17100) 2021-07-16 17:09:20 -07:00
Alex Wu
93c16346bf
[Dataset] imagenet nightly test (#17069) 2021-07-16 14:15:49 -07:00
Eric Liang
c27f43d9b8
ray ignores TMPDIR variable (#17130) 2021-07-16 13:23:44 -07:00
Clark Zinzow
8302b5a335
[Core] Reverts full dispatch queue iteration PRs. (#17127)
* Revert "[Core] iterate over entire dispatch queue instead of returning when worker unavailable (#16535)"

This reverts commit 54d66ac637.

* Revert "[Core] [runtime env] [Tests] Add C++ unit test for dispatch queue nonblocking behavior (#16751)"

This reverts commit 13a133817b.

* Revert failing runtime_env test.
2021-07-16 10:28:00 -07:00
Siyuan (Ryans) Zhuang
fd3742bb63
[Workflow] Update tests (#17147)
* update workflow tests

* use conftest

* use conftest

* use conftest
2021-07-16 09:25:40 -07:00
Eric Liang
40f1ee6e1b
Fix prefetch typo in Dataset (#17143) 2021-07-16 09:22:42 -07:00
fyrestone
e2808a35cf
Dashboard job module uses attrs instead of pydantic for job description (#17116) 2021-07-16 22:26:00 +08:00
Kai Fricke
49b72eec16
[tune] filter placement group resources if not in use (#16996) 2021-07-16 00:35:04 -07:00
Zhi Lin
6d9fb421c6
[tracing] Do not wrap when tracing is not enabled (#16607) 2021-07-16 00:27:54 -07:00
Eric Liang
f03b43c532
[dataset] Support callable classes to simplify state initialization (#17136) 2021-07-15 23:06:14 -07:00
SongGuyang
dcb1baabd7
[C++ API] support loading C++ dynamic libraries from code search path (#16828) 2021-07-16 13:02:45 +08:00
SongGuyang
a57de0e224
support build different python wheel in setup.py (#16998) 2021-07-16 13:01:48 +08:00
Eric Liang
e69987bc96
Improve dataset error message (#17129) 2021-07-15 20:58:15 -07:00
Edward Oakes
90a1667b29
[debugger] Clean up breakpoint state for dead jobs (#17095) 2021-07-15 22:20:09 -05:00
Ian Rodney
23d43919cd
[Client] Except TypeError when Deserializing (#17035)
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-07-15 19:10:21 -07:00
Amog Kamsetty
860addeafa
Remove pyspy dependency (#17061)
* remove pyspy

* fix

* update-for-posterity

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* update

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* Update setup.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-15 17:36:42 -07:00
Yi Cheng
138676295f
[core] Add bundle id as a label; (#16819)
* check

* up

* up

* up

* up

* up

* up

* format

* up

* up

* add test

* format

* up

* format

* up

* format

* up

* up

* up

* rollback

* uncomment

* format

* fix comments

* fix mac build
2021-07-15 16:05:42 -07:00
Amog Kamsetty
d607a894de
[Autoscaler] Remove jsonschema from core dependencies (#17052) 2021-07-15 13:56:44 -07:00
Siyuan (Ryans) Zhuang
dd1427548c
[Workflow] Readonly Virtual Actor (#16963)
* readonly virtual actor

* create and get actor

* add TODO

* mapping between actor ID and workflow ID

* update doc

* ensure storage serializable

* get_latest_output

* update storage

* tests
2021-07-15 13:44:51 -07:00
Amog Kamsetty
6ff4d1ddb1
[Datasets] to_torch implementation (#17113) 2021-07-15 13:02:07 -07:00
Stephanie Wang
bdaa96bf43
[core] Fix bugs in worker cleanup on driver exit (#17049)
* unit test

* cleanup test

* Don't kill workers when job finishes

* better test

* lint

* lint

* comment

* check
2021-07-15 12:53:51 -07:00
Qing Wang
d4635836ba
Port python API on get_current_actor_handle. (#17110)
* Port python API on get current actor handle.

* Address comment.
2021-07-15 11:22:46 -07:00
Eric Liang
3d764d7b4b
[data] Fix the ObjectRef type in the dataset docs (#17111)
* fix reft

* remove exp

* fix
2021-07-15 09:50:37 -07:00
architkulkarni
95a7c28ed5
[Core] [runtime env] Use global lock for conda install instead of per-env lock (#17101) 2021-07-15 11:33:30 -05:00
architkulkarni
8ece30246f
[Core] [runtime env] [Test] Partially deflake test_runtime_env_complicated by bumping 0.1s timeout to 0.5s (#17109) 2021-07-15 11:26:37 -05:00
Edward Oakes
58f62dbc52
[flaky test] Skip some flaky list_named_actors tests on windows (#17087) 2021-07-15 08:40:54 -07:00
Antoni Baum
f20311f194
[tune] ResourceChangingScheduler improvements (#17082) 2021-07-15 15:03:27 +01:00
Sven Mika
649580d735
[RLlib] Redo simplify multi agent config dict: Reverted b/c seemed to break test_typing (non RLlib test). (#17046) 2021-07-15 05:51:24 -04:00
Clark Zinzow
915c426515
[Dataset] Fix S3FileSystem subsystem initialization on deserialization. (#17103)
* Add S3FileSystem wrapper that initializes the S3 subsystem on deserialization, use it for file-based datasources.

* Use S3FileSystem wrapper for read_binary_files.
2021-07-14 23:32:48 -07:00
Eric Liang
38bddc3f2b
First cut at dataset documentation (#16956) 2021-07-14 23:27:13 -07:00
Chris K. W
bd9d7bbbaa
[client] Add support for protocol (ray://, local://, custom://) to ray.init (#16946) 2021-07-14 21:45:46 -07:00