Commit graph

8966 commits

Author SHA1 Message Date
Amog Kamsetty
add6ceb3ec
[Dependencies] Fix missing dependency UX (#17420)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-05 20:18:42 -07:00
Amog Kamsetty
14b02c3341
Add ray.data symlink to setup-dev.py (#17624) 2021-08-05 19:51:15 -07:00
Chen Shen
920a4e3d56
[core] Improve fatal message for fallback allocation (#17595) 2021-08-05 17:58:45 -07:00
Chen Shen
0fd3f761b9
[ci][rfc] build debug wheels and run python test on debug build (#17399)
* enable debug mode

* add

* :upload debug wheels

* upload debug wheels

* add

* fix bug

* add dbg

* Update python/setup.py

Co-authored-by: Simon Mo <simon.mo@hey.com>

* skip windows

Co-authored-by: Simon Mo <simon.mo@hey.com>
2021-08-05 17:58:19 -07:00
Chen Shen
4ff35d43b3
[object store refactor 3/n] introduce object_store (#17332)
refactor-allocator

add object_store
2021-08-05 17:36:27 -07:00
SangBin Cho
8bc9286296
Remove an unused profile event code from object manager. (#17529)
* Remove an unused profile event code from object manager.

* Addressed code review.

* Temporarily skip a test

* lint
2021-08-05 17:13:16 -07:00
SangBin Cho
d59d6ad653
[RFC][Usability] Improve general Ray stacktrace including adding Actor repr (#17389)
* 1. Added a label to the stack trace. 2. Remove ray code from user stacktrace. Improve stacktrace message.

* Add a test to the build

* Fix the issue

* Addressed code review.

* Addressed code review and debugging

* fix

* Try fixing tests.

* Fixed the issue.

* Fixed a bug for real. Tests need to be re-written

* Try one test.

* Formatting

* Addressed code review.

* Addressed the last code review.
2021-08-05 17:12:24 -07:00
SangBin Cho
99b26b476d
Fix flaky windows reconstruction test (#17564) 2021-08-05 17:10:54 -07:00
Amog Kamsetty
e4cf26ea6e
[SGD] v2 Prototype sgd.report() implementation (#17536)
* finish session

* finish

* formatting

* tests

* wip

* remove pdb

* remove import

* add tests

* raise from None

* Address comments

* Exception

* remove from None

* fix test

* address comments

* Update python/ray/util/sgd/v2/constants.py

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* add tests for session

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
2021-08-05 16:03:21 -07:00
SangBin Cho
381ffdb6d0
Revert "[gcs] Fix actor killing race condition (#17456)" (#17599)
This reverts commit 521457b51b.
2021-08-05 15:54:03 -07:00
Edward Oakes
839ceba6db
[serve] Replace "backend" with "deployment" in metrics & logging (#17434) 2021-08-05 17:37:21 -05:00
architkulkarni
05b0da94b7
[core] [runtime env] [test] Add runtime env release test (#17507) 2021-08-05 16:06:20 -05:00
Simon Mo
c2ebb67c1d
[Doc] Fold TOC index by default (#17617)
* [Doc] Fold TOC index by default

* pin
2021-08-05 14:02:05 -07:00
architkulkarni
e84ae6caa5
[Core] [runtime env] Avoid spurious worker startup (#17422) 2021-08-05 15:46:23 -05:00
SangBin Cho
667851f0ad
Prototype done. (#17603) 2021-08-05 13:32:44 -07:00
Eric Liang
8ff3fce4ba
Add a warning if the number of queued tasks to an actor exceeds 5k (#17581) 2021-08-05 12:03:48 -07:00
Kai Fricke
86fcf076d6
[release] use job timeout with ray/anyscale connect (#17614) 2021-08-05 17:53:14 +01:00
Amog Kamsetty
be238e159d
[Tune] Update docs for with_parameters (#17441)
* with_parameters_doc

* update docstring

* address comments
2021-08-05 08:48:34 -07:00
Sven Mika
3b447265d8
[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566) 2021-08-05 11:41:18 -04:00
Sven Mika
0eb0e0ff58
[RLlib] Add support for multi-GPU to DDPG. (#17586) 2021-08-05 11:39:51 -04:00
Sven Mika
3013d9b341
[RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587) 2021-08-05 11:39:15 -04:00
architkulkarni
3ae5229b44
[core] Skip adding "script directory" to workers' sys.path when in interactive shell (#17556) 2021-08-05 10:05:19 -05:00
SongGuyang
79bec61e12
[event] support WithField option in RAY_EVENT api (#17476) 2021-08-05 20:45:55 +08:00
qicosmos
f1f7d4a085
[C++ Worker]Add some APIs for task call part one (#16499) 2021-08-05 17:25:36 +08:00
Amog Kamsetty
d3155bc1a8
increase timeout (#17580) 2021-08-05 10:20:46 +01:00
matthewdeng
46c1db1aa7
[release] increase golden notebook test timeout (#17601) 2021-08-05 10:00:38 +01:00
architkulkarni
8c1317067d
move variable updates from middle of loop to end (#17591) 2021-08-05 09:53:01 +01:00
SongGuyang
93094e567b
[C++ API] C++ API doc update (#17538) 2021-08-05 16:24:07 +08:00
Siyuan (Ryans) Zhuang
ffe5b45cc1
[workflow] Enable test (#17585) 2021-08-04 21:18:50 -07:00
matthewdeng
1eca6ac154
[SGD] v2 alpha: Tensorflow Backend (#17532)
* [SGD] Implement Tensorflow Backend

* addres comments

* address comments

* format
2021-08-04 16:49:50 -07:00
Eric Liang
6db63990af
Don't capture child tasks in placement groups by default (#17527) 2021-08-04 16:09:45 -07:00
Chen Shen
53a0c74413
[nightly-test] fix non_streaming_shuffle_1tb_5000_partitions 2021-08-04 16:06:53 -07:00
Eric Liang
d4f9d3620e
Move ray.data out of experimental (#17560) 2021-08-04 13:31:10 -07:00
architkulkarni
63708468df
[runtime env] [Doc] Runtime env doc and messaging improvements (#17547) 2021-08-04 12:28:42 -07:00
Siyuan (Ryans) Zhuang
e3c09b0af1
[Workflow] Fix nested virtual actor (#17565)
* fix nested actor

* fix nested actor serialization

* one more example

* update exception message
2021-08-04 10:46:45 -07:00
SangBin Cho
3d13781e67
[Test ]Unflake raylet signal test (#17563) 2021-08-04 10:38:59 -07:00
Yi Cheng
521457b51b
[gcs] Fix actor killing race condition (#17456) 2021-08-04 10:37:56 -07:00
Lixin Wei
a2b0d2f99f
[Core] Add Back Pressure to GCS's gRPC Server (#17427) 2021-08-04 10:36:39 -07:00
Kai Fricke
5d56a8aac5
[RLlib] Fix ignoring "sample_collector" config key (#17460) 2021-08-04 10:27:35 -04:00
Navneet Nandan
208d997414
Added support for the imagePullSecrets in helm chart (#17520) 2021-08-04 09:45:39 -04:00
SongGuyang
3e42f54910
Support copyright format for c++ files (#14348) 2021-08-04 17:19:38 +08:00
Sven Mika
b539560429
[RLlib] Remove SampleBatch._get_slice_indices() deprecated logging spam (moderate). (#17567) 2021-08-04 04:21:32 -04:00
Chen Shen
8393b9add8
[Test] increase timeout for autoscaling shuffle (#17545) 2021-08-03 23:16:00 -07:00
Eric Liang
cb48f3a712
Be more conservative in warning about too many workers (#17531) 2021-08-03 22:30:18 -07:00
Chris K. W
a33cbec12a
[client][docs] update docs for new client support in init (#17333)
* start

* check formatting

* undo changes from base branch

* Client builder API docs

* indent

* 8

* minor fixes

* absolute path to runtime env docs

* fix runtime_env link

* Update worker.init docs

* drop clientbuilder docs, link to 1.4.1 docs instead. Specify local:// behavior when address passed

* add debug info for ray.init("local")

* local:// attaches a driver directly

* update ray.init return wording

* remote init.connect() from example

* drop local:// docs, add section on when to use ray client

* link to 1.4.1 docs in code example instead of mentioning clientbuilder

* fix backticks, doc mentions of ray.util.connect

* remove ray.util.connect mentions from examples and comments

* update tune example

* wording

* localhost:<port> also works if you're on the head node

* add quotes

* drop mentions of ray client from ray.init docstring

* local->remote

* fix section ref

* update ray start output

* fix section link

* try to fix doc again

* fix link wording

* drop local:// from docs and special handling from code

* update ray start message

* lint

* doc lint

* remove local:// codepath

* remove 'internal_config'

* Update doc/source/cluster/ray-client.rst

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>

* doc suggestion

* Update doc/source/cluster/ray-client.rst

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-08-04 05:31:44 +03:00
James Mishra
6240d22060
Validate Redis addresses before making the client (#17481) 2021-08-03 16:56:53 -07:00
Siyuan (Ryans) Zhuang
bef519b373
[Workflow] Simplify storage and bug fix (#17453)
* simplify storage

* bug fix

* use a key-value like naming

* update workflow API

* fix s3

* add test
2021-08-03 16:38:54 -07:00
Sven Mika
5107d16ae5
[RLlib] Add @Deprecated decorator to simplify/unify deprecation of classes, methods, functions. (#17530) 2021-08-03 18:30:02 -04:00
Alex Wu
fb3be7a7dd
Fix dataset inference nightly test (#17549)
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-08-03 14:17:11 -07:00
Ian Rodney
f3acae6eb6
[Autoscaler] Sync Files before Starting Docker (#17361) 2021-08-03 13:25:08 -07:00