Commit graph

9186 commits

Author SHA1 Message Date
Tao Wang
5990b60f8b
[Core]Cache named actor in local in case of getting them from GCS frequently. (#17339)
* [Core]Cach named actor in local in case of getting them from GCS frequently

* lint

* fix nullptr

* typo

* add namespace to cache

* lint

* lock, reference and others

* lint

* fix comments and add test

* lint

* lint

* optimize test

* add necessary fields in pub for caching

* add removing test

* fix test
2021-08-09 14:01:57 +08:00
SangBin Cho
1bcab9a7bb
[Object Spilling] Better error message for nightly test debugging (#17645)
* Fix

* Addressed code review.

* Addressed code review.
2021-08-08 20:44:49 -07:00
Hao Chen
0858f0e4f2
Change core worker C++ namespace to ray::core (#17610) 2021-08-08 23:34:25 +08:00
Simon Mo
c315596ed2
[Buildkite] Migrate macOS wheel builds (#16913) 2021-08-07 21:54:34 -07:00
SangBin Cho
654718902f
Fix (#17660) 2021-08-07 18:07:27 -07:00
Qing Wang
4cc34588db
[Core] Support ConcurrentGroup part1 (#16795)
* Core change and Java change.

* Fix void call.

* Address comments and fix cases.

* Fix asyncio
2021-08-07 22:41:33 +08:00
Kai Yang
9b3c0ad35b
Fix “argument type mismatch” when an exception occurs in chained tasks (#17636) 2021-08-07 17:47:43 +08:00
Tricia Fu
c415c26644
[serve] Update FastAPI documentation to make it runnable (#17589) 2021-08-06 17:46:19 -05:00
architkulkarni
f4c70be7f7
[Serve] Add replica tag to request counter and error counter (#17613) 2021-08-06 15:35:34 -07:00
architkulkarni
6d975b821b
[Serve] [Dashboard] Initial PR for exporting Serve data to cluster snapshot (#17489) 2021-08-06 15:03:29 -07:00
SangBin Cho
4616e8a03c
Fix wrong invariant pubsub (#17620)
* ip

* loose check failure

* Fix the bug properly.

* Fix comments.
2021-08-06 14:14:54 -07:00
Edward Oakes
57b190c987
[serve] Remove logic to automatically infer conda env name (#17639) 2021-08-06 13:27:23 -05:00
architkulkarni
b173b33934
[tests] Add runtime envs release test to nightly build script (#17638) 2021-08-06 13:18:25 -05:00
liuyang-my
12bd904594
[Serve] Define BackendConfig protobuf and adapt it in Java (#17201) 2021-08-06 09:50:45 -07:00
architkulkarni
ac9a1a20df
[core] [runtime_env] Use per-env async lock in agent (#17542)
Co-authored-by: Ed Oakes <ed.nmi.oakes@gmail.com>
2021-08-06 11:11:37 -05:00
Kai Fricke
2b520bafc5
[release/alert] less results (#17637) 2021-08-06 10:26:07 +01:00
Kai Fricke
bd2404e496
[release/rllib] fix learning test script (#17635) 2021-08-06 10:07:59 +01:00
Zhi Lin
82123123c4
[object store] Java API for Assign the object owner in Ray.put() (#17237)
Co-authored-by: Qing Wang <kingchin1218@126.com>
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-08-06 15:26:59 +08:00
Amog Kamsetty
f0cca063ad
[SGD v2] Reduce time for HF smoke test (#17623)
* reduce

* switch back model

* Update python/ray/util/sgd/v2/BUILD
2021-08-05 21:04:34 -07:00
Stephanie Wang
a06d71477f
[core] Do not spill back tasks blocked on args to blocked nodes (#17550)
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2021-08-05 20:43:32 -07:00
Amog Kamsetty
add6ceb3ec
[Dependencies] Fix missing dependency UX (#17420)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-05 20:18:42 -07:00
Amog Kamsetty
14b02c3341
Add ray.data symlink to setup-dev.py (#17624) 2021-08-05 19:51:15 -07:00
Chen Shen
920a4e3d56
[core] Improve fatal message for fallback allocation (#17595) 2021-08-05 17:58:45 -07:00
Chen Shen
0fd3f761b9
[ci][rfc] build debug wheels and run python test on debug build (#17399)
* enable debug mode

* add

* :upload debug wheels

* upload debug wheels

* add

* fix bug

* add dbg

* Update python/setup.py

Co-authored-by: Simon Mo <simon.mo@hey.com>

* skip windows

Co-authored-by: Simon Mo <simon.mo@hey.com>
2021-08-05 17:58:19 -07:00
Chen Shen
4ff35d43b3
[object store refactor 3/n] introduce object_store (#17332)
refactor-allocator

add object_store
2021-08-05 17:36:27 -07:00
SangBin Cho
8bc9286296
Remove an unused profile event code from object manager. (#17529)
* Remove an unused profile event code from object manager.

* Addressed code review.

* Temporarily skip a test

* lint
2021-08-05 17:13:16 -07:00
SangBin Cho
d59d6ad653
[RFC][Usability] Improve general Ray stacktrace including adding Actor repr (#17389)
* 1. Added a label to the stack trace. 2. Remove ray code from user stacktrace. Improve stacktrace message.

* Add a test to the build

* Fix the issue

* Addressed code review.

* Addressed code review and debugging

* fix

* Try fixing tests.

* Fixed the issue.

* Fixed a bug for real. Tests need to be re-written

* Try one test.

* Formatting

* Addressed code review.

* Addressed the last code review.
2021-08-05 17:12:24 -07:00
SangBin Cho
99b26b476d
Fix flaky windows reconstruction test (#17564) 2021-08-05 17:10:54 -07:00
Amog Kamsetty
e4cf26ea6e
[SGD] v2 Prototype sgd.report() implementation (#17536)
* finish session

* finish

* formatting

* tests

* wip

* remove pdb

* remove import

* add tests

* raise from None

* Address comments

* Exception

* remove from None

* fix test

* address comments

* Update python/ray/util/sgd/v2/constants.py

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* add tests for session

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
2021-08-05 16:03:21 -07:00
SangBin Cho
381ffdb6d0
Revert "[gcs] Fix actor killing race condition (#17456)" (#17599)
This reverts commit 521457b51b.
2021-08-05 15:54:03 -07:00
Edward Oakes
839ceba6db
[serve] Replace "backend" with "deployment" in metrics & logging (#17434) 2021-08-05 17:37:21 -05:00
architkulkarni
05b0da94b7
[core] [runtime env] [test] Add runtime env release test (#17507) 2021-08-05 16:06:20 -05:00
Simon Mo
c2ebb67c1d
[Doc] Fold TOC index by default (#17617)
* [Doc] Fold TOC index by default

* pin
2021-08-05 14:02:05 -07:00
architkulkarni
e84ae6caa5
[Core] [runtime env] Avoid spurious worker startup (#17422) 2021-08-05 15:46:23 -05:00
SangBin Cho
667851f0ad
Prototype done. (#17603) 2021-08-05 13:32:44 -07:00
Eric Liang
8ff3fce4ba
Add a warning if the number of queued tasks to an actor exceeds 5k (#17581) 2021-08-05 12:03:48 -07:00
Kai Fricke
86fcf076d6
[release] use job timeout with ray/anyscale connect (#17614) 2021-08-05 17:53:14 +01:00
Amog Kamsetty
be238e159d
[Tune] Update docs for with_parameters (#17441)
* with_parameters_doc

* update docstring

* address comments
2021-08-05 08:48:34 -07:00
Sven Mika
3b447265d8
[RLlib] Fix Trainer.add_policy for num_workers>0 (self play example scripts). (#17566) 2021-08-05 11:41:18 -04:00
Sven Mika
0eb0e0ff58
[RLlib] Add support for multi-GPU to DDPG. (#17586) 2021-08-05 11:39:51 -04:00
Sven Mika
3013d9b341
[RLlib] Fix "Cannot convert a symbolic Tensor (default_policy/strided_slice_3:0) to a numpy array. This error may indicate that you're trying to pass a Tensor to a NumPy call, which is not supported" (#17587) 2021-08-05 11:39:15 -04:00
architkulkarni
3ae5229b44
[core] Skip adding "script directory" to workers' sys.path when in interactive shell (#17556) 2021-08-05 10:05:19 -05:00
SongGuyang
79bec61e12
[event] support WithField option in RAY_EVENT api (#17476) 2021-08-05 20:45:55 +08:00
qicosmos
f1f7d4a085
[C++ Worker]Add some APIs for task call part one (#16499) 2021-08-05 17:25:36 +08:00
Amog Kamsetty
d3155bc1a8
increase timeout (#17580) 2021-08-05 10:20:46 +01:00
matthewdeng
46c1db1aa7
[release] increase golden notebook test timeout (#17601) 2021-08-05 10:00:38 +01:00
architkulkarni
8c1317067d
move variable updates from middle of loop to end (#17591) 2021-08-05 09:53:01 +01:00
SongGuyang
93094e567b
[C++ API] C++ API doc update (#17538) 2021-08-05 16:24:07 +08:00
Siyuan (Ryans) Zhuang
ffe5b45cc1
[workflow] Enable test (#17585) 2021-08-04 21:18:50 -07:00
matthewdeng
1eca6ac154
[SGD] v2 alpha: Tensorflow Backend (#17532)
* [SGD] Implement Tensorflow Backend

* addres comments

* address comments

* format
2021-08-04 16:49:50 -07:00