Commit graph

1218 commits

Author SHA1 Message Date
Siyuan (Ryans) Zhuang
d1e9887be2
[Serialization] New custom serialization API (#13291)
* new serialization API with doc & test

* add more notes

* refine notes

* doc
2021-01-14 13:15:31 -08:00
PENG Zhenghao
e63da54931
[docs] Add more guideline on using ray in slurm cluster (#12819)
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
Co-authored-by: PENG Zhenghao <pengzh@ie.cuhk.edu.hk>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-14 12:17:53 -08:00
Amog Kamsetty
560299972c
Revert "Enable Ray client server by default (#13350)" (#13429)
This reverts commit 912d0cbbf9.
2021-01-13 21:28:54 -08:00
Simon Mo
44acbdd82a
[Serve] [Doc] Improve batching doc (#13389) 2021-01-13 14:39:42 -08:00
Eric Liang
912d0cbbf9
Enable Ray client server by default (#13350)
* update

* fix

* fix test

* update
2021-01-12 21:31:01 -08:00
Simon Mo
8e0a2f669b
[Doc] Remove trailing whitespaces (#13390) 2021-01-12 20:35:38 -08:00
architkulkarni
e560933f9c
[Serve] Add dependency management support for driver not running in a conda env (#13269) 2021-01-12 09:57:15 -08:00
Kai Fricke
518427627b
[tune] buffer trainable results (#13236)
* Working prototype

* Pass buffer length, fix tests

* Don't buffer per default

* Dispatch and process save in one go, added tests

* Fix tests

* Pass adaptive seconds to train_buffered, stop result processing after STOP decision

* Fix tests, add release test

* Update tests

* Added detailed logs for slow operations

* Update python/ray/tune/trial_runner.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Revert tests and go back to old tuning loop

* nit

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-12 18:52:47 +01:00
Dmitri Gekhtman
7166949194
[Kubernetes][Docs] GPU usage (#13325)
* gpu-note

* gpu-note

* More info

* lint?

* Update doc/source/cluster/kubernetes.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/cluster/kubernetes.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/cluster/kubernetes.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/cluster/kubernetes.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* GKE->Kubernetes

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-11 21:36:31 -08:00
Edwin Goh
a5ddc27bab
Fix typo in Tune Docs (Checkpointing) (#13348)
See issue #13299
2021-01-11 20:27:18 -08:00
Amog Kamsetty
0452a3a435
[Tune] Rename MLFlow to MLflow (#13301) 2021-01-11 17:36:55 -08:00
Sven Mika
9dd9f72111
[RLlib] Add more detailed Documentation on Model building API (#13261) 2021-01-09 12:38:29 +01:00
Michael Luo
67229bf350
[RLlib] SlateQ Documentation (#13266) 2021-01-09 11:21:51 +01:00
Barak Michener
eb6f403b97
[ray_client]: first draft of documentation (#13216) 2021-01-08 15:38:36 -08:00
Kai Fricke
97211a6170
[Tune] Fix tune serve integration example (#13233) 2021-01-06 17:02:04 +01:00
Simon Mo
39813ff6b0
[Serve] HTTPOptions for deployment modes (#13142) 2021-01-05 16:41:52 -08:00
Eric Liang
dfb326d4b5
Surface object store spilling statistics in ray memory (#13124) 2021-01-04 17:35:39 -08:00
Amog Kamsetty
15e86581bd
[XGboost] Update Documentation (#13017)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-04 17:21:04 -08:00
Dmitri Gekhtman
31453621ef
[kubernetes][docs][minor] Kubernetes version warning (#13161) 2021-01-04 10:29:17 -06:00
architkulkarni
a95275bdd9
[Serve] [Doc] Add existing web server integration ServeHandle tutorial (#13127) 2021-01-04 10:28:34 -06:00
Ameer Haj Ali
61c3b6d3bf
[docs] Small fix in C++ documentation. (#13154)
* prepare for head node

* move command runner interface outside _private

* remove space

* Eric

* flake

* min_workers in multi node type

* fixing edge cases

* eric not idle

* fix target_workers to consider min_workers of node types

* idle timeout

* minor

* minor fix

* test

* lint

* eric v2

* eric 3

* min_workers constraint before bin packing

* Update resource_demand_scheduler.py

* Revert "Update resource_demand_scheduler.py"

This reverts commit 818a63a2c86d8437b3ef21c5035d701c1d1127b5.

* reducing diff

* make get_nodes_to_launch return a dict

* merge

* weird merge fix

* auto fill instance types for AWS

* Alex/Eric

* Update doc/source/cluster/autoscaling.rst

* merge autofill and input from user

* logger.exception

* make the yaml use the default autofill

* docs Eric

* remove test_autoscaler_yaml from windows tests

* lets try changing the test a bit

* return test

* lets see

* edward

* Limit max launch concurrency

* commenting frac TODO

* move to resource demand scheduler

* use STATUS UP TO DATE

* Eric

* make logger of gc freed refs debug instead of info

* add cluster name to docker mount prefix directory

* grrR

* fix tests

* moving docker directory to sdk

* move the import to prevent circular dependency

* smallf fix

* ian

* fix max launch concurrency bug to assume failing nodes as pending and consider only load_metric's connected nodes as running

* small fix

* deflake test_joblib

* lint

* placement groups bypass

* remove space

* Eric

* first ocmmit

* lint

* exmaple

* documentation

* hmm

* file path fix

* fix test

* some format issue in docs

* modified docs

Co-authored-by: Ameer Haj Ali <ameerhajali@ameers-mbp.lan>
Co-authored-by: Alex Wu <alex@anyscale.io>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
Co-authored-by: root <root@ip-172-31-56-188.us-west-2.compute.internal>
2021-01-02 11:47:06 -08:00
Ameer Haj Ali
27cbac576d
[docs] Minor change to formating C++ docs. (#13151) 2021-01-01 19:43:59 -08:00
Ameer Haj Ali
710615c228
[docs] Documentation + example for the C++ language API (#13138) 2021-01-01 18:18:41 -08:00
Sven Mika
9eba1871bb
[RLlib] Support easy use_attention=True flag for using the GTrXL model. (#11698) 2021-01-01 14:06:23 -05:00
Dmitri Gekhtman
4ca64549e2
[docs][kubernetes][minor] Update K8s examples in doce (#13129) 2020-12-31 16:25:38 -06:00
Sven Mika
391cdfae8c
[RLlib] Trajectory view API docs. (#12718) 2020-12-30 17:32:21 -08:00
Sumanth Ratna
59e9b80903
[Doc] Fix Sphinx.add_stylesheet deprecation (#13067) 2020-12-29 16:35:40 -08:00
architkulkarni
032a6546d5
Serve metrics docs (#13096) 2020-12-29 14:03:34 -06:00
Lavanya Shukla
350917958c
[docs] fix wandb url (#13094) 2020-12-28 17:19:17 -08:00
Eric Liang
836c5d5a91
Deprecate experimental / dynamic resources (#13019) 2020-12-28 11:52:36 -08:00
architkulkarni
9a0218fb89
[Serve] [Doc] Front page update (#13032) 2020-12-28 10:19:36 -08:00
Antoni Baum
a4f2dd2138
[Tune]Add integer loguniform support (#12994)
* Add integer quantization and loguniform support

* Fix hyperopt qloguniform not being np.log'd first

* Add tests, __init__

* Try to fix tests, better exceptions

* Tweak docstrings

* Type checks in SearchSpaceTest

* Update docs

* Lint, tests

* Update doc/source/tune/api_docs/search_space.rst

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2020-12-23 09:27:16 -08:00
Simon Mo
bc68260144
[Serve] Handle Bug Fixes (#12971) 2020-12-22 19:13:16 -08:00
Eric Liang
81d3cbaa77
Add "beta" documentation for enabling object spilling manually (#13047) 2020-12-22 16:08:41 -08:00
architkulkarni
8b4b4bf0a2
[Serve] Migrate from Flask.Request to Starlette Request (#12852) 2020-12-21 15:34:15 -06:00
Philipp Moritz
ec9ad4a56b
Documentation for Ray debugger stepping (#12845) 2020-12-20 00:43:27 -08:00
Amog Kamsetty
5d3c9c8861
[Tune] Mlflow Integration (#12840)
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-19 00:40:02 -08:00
Edward Oakes
3521e74f3a
[serve] Support for imported backends (#12923) 2020-12-18 15:49:24 -06:00
Kai Fricke
3d72000826
[tune] Add points_to_evaluate to BasicVariantGenerator (#12916)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-17 19:16:03 -08:00
SangBin Cho
de7848231c
[Doc] Fix placement group doc (#12875) 2020-12-15 10:36:51 -08:00
Max Fitton
e077bc4206
[Release] Bump master to 1.2.0 for 1.1.0 release (#12856) 2020-12-15 09:40:26 -08:00
Gekho457
8cebe5cbe9
[docs][autoscaler][k8s][minor] quotes #12866 2020-12-14 18:24:13 -08:00
Gekho457
44f5be04ca
[autoscaler][k8s][doc][minor] Fix typo in k8s doc. (#12865) 2020-12-14 17:30:43 -08:00
architkulkarni
231518e86f
[Serve] Support basic Starlette response types (#12811) 2020-12-14 17:03:56 -06:00
Gekho457
11ce1dc743
Ray cluster CRD and example CR + multi-ray-cluster operator (#12098) 2020-12-14 10:26:01 -06:00
Kai Fricke
5f04ade6ef
[tune] add more stoppers and stopper documentation (#12750)
* Add new stoppers & docs

* Add tests for maximum iteration stopper and trial plateau stopper

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Apply suggestions from code review

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 01:47:19 -08:00
Kai Fricke
905652cdd6
[tune] migrate xgboost callback api (#12745)
* Migrate to new-style xgboost callbacks

* Fix flaky progress reporter test

* Fix import error

* Take last value (not first)
2020-12-12 01:42:20 -08:00
Eric Liang
4ad4463be6
Add comments to clarify purpose of new scheduler queues (#12730)
* update

* clarify

* update
2020-12-11 11:53:09 -08:00
Sumanth Ratna
8d1ad25545
[docs] Add troubleshooting section to installation page (#12659)
* Add troubleshooting section to installation docs

* Set fix instructions lang to bash

* Update doc/source/installation.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-10 18:56:56 -08:00
Ian Rodney
9b3ef2f340
[docs] Fix Docker links (#12702)
* switch autoscaler -> ray-ml

* add more tables
2020-12-10 18:08:48 -08:00