Commit graph

1580 commits

Author SHA1 Message Date
Amog Kamsetty
f71cfca439 [SGD] Retry sgd.local_rank() (#18824)
* finish

* fix

* wip

* address comment

* update

* fix test

* fix failing test

* address comments

* fix test

* fix
2021-09-23 14:12:15 -07:00
Simon Mo
5e9cb232c7 [Serve] Doc: Mock ray.serve.generated package for doc building (#18767) 2021-09-20 15:42:53 -07:00
Eric Liang
85aaca8d45
Update the contribution guide / style guide (#18753) 2021-09-19 20:14:51 -07:00
xwjiang2010
5551cdac19
[Tune] Break from loop after warning msg is logged. (#18720) 2021-09-18 16:33:44 -07:00
Jiajun Yao
ffe7108eae
Fix cpp api doc (#18671) 2021-09-17 14:01:23 -07:00
Antoni Baum
7e95f330d5
[ci] Fix xgboost_ray install from git (#18640) 2021-09-15 18:07:15 +01:00
Antoni Baum
eeb67a42cc
pip install xgboost_ray -> xgboost_ray[default] (#18607)
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-09-15 14:45:56 +01:00
qicosmos
15881acffd
[C++ Worker]Update cpp worker doc (#18537) 2021-09-15 14:11:17 +08:00
Eric Liang
d1f348cd9d
[RFC] Split the list of libraries into ML vs production 2021-09-14 16:32:07 -07:00
matthewdeng
380a653787
[SGD] update SGDv2 user guide docs (#18270)
* [SGD] update SGDv2 user guide docs

* Update doc/source/raysgd/v2/user_guide.rst

Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>

* add new line

* update docs

* fix header line length

* lint

* lint

* lint

* lint

* fix remaining lint issues

* Update doc/source/raysgd/v2/user_guide.rst

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>

* Update doc/source/raysgd/v2/user_guide.rst

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>

* address comments

* address comments

* add TODO for iterator API

* Update doc/source/raysgd/v2/user_guide.rst

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>

* address comments

* address comments

* add tune doc

* restructure table of contents

* add examples; rename example files to include example suffix

* add quick start, porting code

* address comments

Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-09-14 09:07:25 -07:00
Yi Cheng
7d1f408de9
[workflow] Move experimental/workflow to workflow (#18521) 2021-09-13 17:45:18 -07:00
Stephanie Wang
284dee493e
[core][usability] Disambiguate ObjectLostErrors for better understandability (#18292)
* Define error types, throw error for ObjectReleased

* x

* Disambiguate OBJECT_UNRECONSTRUCTABLE and OBJECT_LOST

* OwnerDiedError

* fix test

* x

* ObjectReconstructionFailed

* ObjectReconstructionFailed

* x

* x

* print owner addr

* str

* doc

* rename

* x
2021-09-13 16:16:17 -07:00
Eric Liang
53a2a47655
Polish workflows doc, add semantics and best practices for sub-workflows (#18525) 2021-09-12 11:08:06 -07:00
Yi Cheng
370473fc5f
[workflow] Update documentation (#18522) 2021-09-11 13:40:09 -07:00
Yi Cheng
965c55fe1b
[workflow] set max retry to 3 (#18477) 2021-09-09 23:03:24 -07:00
Eric Liang
4d2065352b
Increase dataset read parallelism by default (#18420) 2021-09-09 15:07:49 -07:00
Kai Fricke
395976c8a1
[tune] Never block for results (#18391)
* [tune] Never block for results

* Fix tests

* Block in tests

* Add comment to test
2021-09-09 12:08:00 -07:00
mwtian
26fd10c9e8
[CI] Add clang-tidy to lint (#18124)
* clang-tidy

* fix

* fix script

* test clang compiler

* fix clang-tidy rules

* Fix windows and other issues.

* Fix

* Improve information when running check-git-clang-tidy-output.sh on different OS
2021-09-09 00:41:53 -07:00
Clark Zinzow
b30c41759d
[Datasets] Adds tensor column support (tensors-in-tables) via Pandas/Arrow extension types/arrays. (#18301) 2021-09-08 10:09:01 -07:00
Eric Liang
cbdafa0b63
[doc] Fix various workflow doc bugs (#18357) 2021-09-06 01:39:08 -07:00
Richard Liaw
0594deafdf
[tune] allow users to configure bootstrap for docker syncer (#17786) 2021-09-05 22:04:31 -07:00
Richard Liaw
93f7976215
[docs/deps] Clean up dependency ux/docs #18360
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-09-05 22:03:32 -07:00
Eric Liang
c4199a8054
Add more workflow comparisons (#18347) 2021-09-03 19:26:33 -07:00
xwjiang2010
01adf030ec
[Tune] Raise Error when there are insufficient resources. (#17957) 2021-09-03 10:49:54 -07:00
Eric Liang
7dcae690b9
Mark datasets as still in alpha for now (#18321) 2021-09-02 17:07:33 -07:00
SangBin Cho
9b9eae1e86
Change misleading documentation from the placement group (#18257)
* Modify a doc

* completed
2021-09-02 16:40:48 -07:00
Antoni Baum
4c95ea6d0a
[client] Improve Ray Client connection timeout information (#18281)
* Improve Ray Client connection timeout information

* fix lint issue.

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-09-02 16:34:11 +03:00
matthewdeng
a3123b6860
[SGD] v2 Horovod backend (#18047)
* [SGD] add Horovod backend

* address comments: set CUDA_VISIBLE_DEVICES, refactor code

* fix gpu test

* fix lint/test import

* address comments, add example cluster config

* delay horovod imports
2021-08-31 12:54:59 -07:00
Wesley Gifford
6133a561e9
Dataset from modin (#18122) 2021-08-31 11:19:35 -07:00
xwjiang2010
63f00843f3
[Tune] Inform users of the setup needed for uploading results to cloud. (#18220) 2021-08-31 10:27:50 -07:00
Ryan L. Melvin
c081c68de7
[tune] Conditional search space example using hyperopt (#18130)
Co-authored-by: Ryan Melvin <rmelvin@uabmc.edu>
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
2021-08-31 17:06:22 +02:00
Tim Hopper
fd2a8a6b9c
[docs] Fix broken urls (#18206) 2021-08-30 17:24:06 -07:00
Amog Kamsetty
3b77840c1b
PyTorch Lightning Updates (#17876) 2021-08-27 23:15:51 -07:00
Eric Liang
95b5ad12ba
Initial version of workflow documentation (#18138) 2021-08-27 16:20:48 -07:00
Clark Zinzow
c0598de82a
[Datasets] Port write APIs to use file-based datasources. (#18135) 2021-08-27 15:24:54 -07:00
Clark Zinzow
aee7ba2510
[Datasets] Add from_numpy() and to_numpy() APIs (#18146) 2021-08-27 13:33:11 -07:00
architkulkarni
ea4f54f8ef
[Serve] [doc] Add model URI to deployment example (#18085) 2021-08-26 11:37:32 -05:00
Eric Liang
e1f69ceb5e
Add documentation for DatasetPipeline.from_iterable (#18106) 2021-08-25 22:31:23 -07:00
Eric Liang
451c501bbd
[doc] Add API stability annotations to developer guide (#18052) 2021-08-25 13:34:43 -07:00
Sasha Sobol
fcb044d47c
[autoscaler] make 0 default min/max workers for head node (#17757)
* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* comments. logging

* better wording (logs)

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>

* fix logging message

* fix max workers in raycluster.yaml

* use default values of 0 for min/max workders in a helm chart

* add missing line back

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
2021-08-25 14:56:20 -04:00
Nikita Vemuri
a9c731edd3
[serve] Remove requirement to specify namespace for serve.start(detached=True) (#17470) 2021-08-25 10:39:32 -05:00
gjoliver
9c7456933c
[RLlib] Small parallel iterator doc fix. (#18043) 2021-08-24 19:57:58 +02:00
Antoni Baum
88163c4755
[docs] Add a TPU example to the docs (#17959)
* Add a TPU example to the docs

* Add a line about TPU API

* Add link to TPU pods

* Clarify
2021-08-24 10:08:26 -07:00
architkulkarni
f9de05d44c
[Doc] fix incorrect conda example in runtime env doc (#17972) 2021-08-23 13:44:04 -07:00
Clark Zinzow
5ca28b1cc8
[Core] Update Bazel (to 3.4.1), gRPC, boringssl, and absl as a precursor to gRPC streaming PR. (#17903)
* Update Bazel (to 3.4.1), gRPC, boringssl, absl.

* Always reinstall Bazel if needing to upgrade to a new Bazel version.

* Add patch for properly detecting Windows Python headers when building gRPC.

* Add minimum Bazel version check.

* Update docs with new Bazel version.
2021-08-21 11:33:11 -07:00
77loopin
c6b24fcb5d
[RayClient] Add the guide for k8s Ingress (#17736)
Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
Co-authored-by: seungjaebaek <seungjaebaek@linecorp.com>
2021-08-20 18:31:03 -07:00
Edward Oakes
30541025e5
[serve] Remove deprecated APIs from code & docs (#17754) 2021-08-20 11:59:45 -05:00
architkulkarni
36c26578a7
[runtime env] [test] Add nightly test to verify Ray wheel URLs are valid (#17938) 2021-08-19 15:48:37 -07:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. (#17885)
* gcs_utils

* resource_spec

* profiling

* ray_perf and ray_cluster_perf

* test_utils
2021-08-18 20:56:33 -07:00
architkulkarni
4c6a695dab
[Doc] Runtime env docstring fix monospace formatting (#17929) 2021-08-18 20:53:41 -07:00