Commit graph

8778 commits

Author SHA1 Message Date
Stephanie Wang
dad8db46e1
[core] Do not spill back tasks blocked on args to blocked nodes (#16488) 2021-07-20 17:13:02 -07:00
Jiao
f4f702c595
[Release] change default expiration to 2 days in order to prevent custodian kill it early morning (#17215)
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-07-20 17:03:14 -07:00
Eric Liang
877076160e
[data] Enable zero-copy access to underlying Arrow tables (#17192) 2021-07-20 16:38:21 -07:00
Eric Liang
d6e91a5b46
Update PublicAPI annotations #17224 2021-07-20 16:37:53 -07:00
Clark Zinzow
09f32b68d3
[Datasets] Slice off S3 protocol from S3 URIs. (#17219)
* Ensure that S3 protocols are sliced off of S3 URIs.

* Use urllib to parse and trim URI to path.
2021-07-20 15:23:35 -07:00
Clark Zinzow
08a50bf3b7
[Datasets] Allow for Parquet metadata file to be missing. (#17217)
* Allow for Parquet metadata file to be missing.

* Remove for-else.
2021-07-20 15:20:26 -07:00
Jiao
7473f663ef
[Release] change replica to 100 to collect signals now (#17214)
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-07-20 12:27:56 -07:00
Ian Rodney
db510f317a
[CODEOWNERS] Add Ray Client Proto (#17210)
* add ray client proto

* Add Ameer
2021-07-20 12:23:19 -07:00
Alex Wu
433382f102
Update CODEOWNERS for dashboard snapshot (#17196)
* Update CODEOWNERS for dashboard snapshot

* Update CODEOWNERS

* Update CODEOWNERS
2021-07-20 12:22:52 -07:00
Ian Rodney
e6bf0a8ea6
[autoscaler][docstring] Add Docstring for StandardAutoscaler ctor (#17213) 2021-07-20 12:19:54 -07:00
Sven Mika
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
Patrick Ames
efed07023f
[autoscaler] Custom AWS network interface error condition tests and missing security group bug fix. (#17207) 2021-07-20 11:17:27 -07:00
Jialing He
492076806d
[object store] Assign the object owner in ray.put() (#16833) 2021-07-20 11:06:00 -07:00
Amog Kamsetty
4ece5247d6
[Datasets] to_torch no DataLoader (#17211) 2021-07-20 11:05:17 -07:00
Siyuan (Ryans) Zhuang
859cba7655
[Workflow] Remove namespace in workflow 2021-07-20 11:04:46 -07:00
Yi Cheng
8253064163
[workflow] workflow error handling (#17175) 2021-07-20 11:03:53 -07:00
Chen Shen
055a90374c
[Core] fix erase iterator while iterating over a map. (#17204) 2021-07-20 11:02:55 -07:00
Simon Mo
908aa2c7f3
Fix runtime env and dispatch queue take 2 (#17163) 2021-07-20 10:24:08 -07:00
Jiao
994ff3ce21
[Serve] Add initial large scale tests (#17026) 2021-07-20 08:56:29 -07:00
Kai Yang
f0c148b158
[Core] Simplify the code to read env variables in RayConfig (#16775)
* Simplify the code to read env variables in RayConfig

* simplify

* Correctly print config type

* Change to lower case

* fix template specialization

* lint
2021-07-20 08:40:16 -07:00
SangBin Cho
d6b6356173
[Core] Properly call shutdown instead of deleting a reference (#17096)
* Properly call shutdown instead of deleting a reference

* Add unit tests

* Add test ray shutdown

* Formatting

* format2

* Revert main logic to see if windows issue still fail

* Skip tests for windows.

* formatting

* Try fixing flakiness

* Remove node removed code path

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2021-07-20 08:22:33 -07:00
Antoni Baum
5e9b680e39
[docs] Add LightGBM-Ray docs, update XGBoost-Ray docs (#17188) 2021-07-20 16:06:47 +01:00
Siyuan (Ryans) Zhuang
8efc04a8a6
[Core] Actor namespace (#17178)
* set actor namespace in Python on creation

* get actor with namespace in Python

* update message
2021-07-19 21:51:04 -07:00
matthewdeng
fef74aa94f
[sgd] add placement group support (#17037)
* [sgd] add placement group support

* add logic for removing placement group upon shutdown

* set placement group; add tests

* address comments - add timeout and improve error handling

* remove unused import

* mock SGD_PLACEMENT_GROUP_TIMEOUT_S
2021-07-19 21:50:37 -07:00
Siyuan (Ryans) Zhuang
9ca6bda3a1
[Workflow] Fix recovery storage mismatch issue (#17166)
* fix recovery path issue and add test

* add TODOs
2021-07-19 21:49:12 -07:00
dependabot[bot]
2de7b8f084
[tune](deps): Bump tune-sklearn in /python/requirements/tune (#17173)
Bumps [tune-sklearn](https://github.com/ray-project/tune-sklearn) from 0.3.0 to 0.4.0.
- [Release notes](https://github.com/ray-project/tune-sklearn/releases)
- [Commits](https://github.com/ray-project/tune-sklearn/compare/v0.3.0...v0.4.0)

---
updated-dependencies:
- dependency-name: tune-sklearn
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-19 19:42:54 -07:00
Amog Kamsetty
c9522e9a6f
Remove requests from Core Dependencies (#17066)
* remove requests

* update

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* update

* lint

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-19 19:38:29 -07:00
Eric Liang
fabba96fad
Re-merge large function def, skipping test failing on Windows (#17191) 2021-07-19 18:03:26 -07:00
Chen Shen
fe9a6b669c
[nightly-test] add 4-nodes shuffle-data-loader test (#17155) 2021-07-19 17:46:22 -07:00
Chen Shen
b26fcd3fce
fix spill bug (#17187) 2021-07-19 17:44:12 -07:00
Amog Kamsetty
777921b2e7
[dependencies] vendor colorama (#17183) 2021-07-19 16:29:29 -07:00
Eric Liang
d59da075a6
Re-merge TMPDIR support, but only for Linux. OSX requires RAY_TMPDIR (#17190) 2021-07-19 15:45:03 -07:00
SangBin Cho
561dcbd99c
[Test] Fix the permission issue for Dask on Ray multi node sort #17189( #17189) 2021-07-19 14:42:39 -07:00
Patrick Ames
34789b3e56
[autoscaler] Add support for custom EC2 instance network interfaces (#14080)
* [autoscaler] Add support for custom EC2 instance network interfaces.

* [autoscaler] Add unit tests for custom EC2 network interfaces and support for AWS node provider stubs.
2021-07-19 17:21:21 -04:00
Chen Shen
80e013f342
[core] Fix SIGABRT on erase call (#17140) 2021-07-19 11:42:38 -07:00
SangBin Cho
bfc9e5c36f
[Logs] Clean core worker logs (#17033)
* Ready

* Formatting

* Fix

* addressed review.
2021-07-19 11:25:41 -07:00
Amog Kamsetty
cb74053ee5
Retry remove gpustat dependency (#17115)
* remove gpustat

* move psutil imports
2021-07-19 11:14:10 -07:00
Siyuan (Ryans) Zhuang
1fbfbfc55b
[Serializatioin] Bump pickle5 version (#17124) 2021-07-19 10:40:38 -07:00
Siyuan (Ryans) Zhuang
9b110f9344
[Workflow] Update API (#17165)
* actor_id same as workflow_id

* @workflow.actor -> @workflow.virtual_actor

* readonly decorator

* run/run_async for virtual actor

* get_or_create for virtual actor

* update doc

* run/run_async for steps

* update tests

* update comments
2021-07-19 10:19:46 -07:00
Sven Mika
18d173b172
[RLlib] Implement policy_maps (multi-agent case) in RolloutWorkers as LRU caches. (#17031) 2021-07-19 13:16:03 -04:00
Sven Mika
e0640ad0dc
[RLlib] Fix seeding for ES and ARS. (#16744) 2021-07-19 13:13:05 -04:00
architkulkarni
4069686e0f
Revert "Improve error message for oversized function (#17133)" (#17184)
This reverts commit 3e53619d64.
2021-07-19 09:28:33 -07:00
Qing Wang
195cdcf5b8
Fix memory leak in JNI. (#17177)
Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-07-19 14:06:30 +08:00
Amog Kamsetty
8dfd471823
Revert "Revert "[Dashboard][event] Basic event module (#16985)" (#17068)" (#17107)
This reverts commit c17e171f92.

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-18 12:59:04 +08:00
dependabot[bot]
bda1a37e93
[tune](deps): Bump mlflow in /python/requirements/tune (#17168)
Bumps [mlflow](https://github.com/mlflow/mlflow) from 1.17.0 to 1.19.0.
- [Release notes](https://github.com/mlflow/mlflow/releases)
- [Changelog](https://github.com/mlflow/mlflow/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/mlflow/mlflow/compare/v1.17.0...v1.19.0)

---
updated-dependencies:
- dependency-name: mlflow
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-17 16:23:10 -07:00
Eric Liang
3e53619d64
Improve error message for oversized function (#17133) 2021-07-17 11:04:05 -07:00
Philipp Moritz
c5c167300b
[Ray debugger] Set breakpoint() hook only for tasks and actors (#17164) 2021-07-17 10:27:51 -07:00
Simon Mo
9da49a7fb8
Revert "ray ignores TMPDIR variable" (#17161)
This reverts commit c27f43d9b8.
2021-07-16 20:28:38 -07:00
Eric Liang
94f17ec099
[RFC] API stability annotations (#17100) 2021-07-16 17:09:20 -07:00
Alex Wu
93c16346bf
[Dataset] imagenet nightly test (#17069) 2021-07-16 14:15:49 -07:00