Commit graph

8966 commits

Author SHA1 Message Date
Yi Cheng
29352e7fa3
[workflow] Fix some usability issues (#17284) 2021-07-23 11:39:49 -07:00
Edward Oakes
cc38ffb0a0
[debugger] Add doc entry for the --ray-debugger-external flag (#17294) 2021-07-23 12:00:30 -05:00
Eric Liang
df7fe8dd6d
[data] Cleanup Block type by dropping Generic[T] (#17276)
* wip

* update

* update

* quotes
2021-07-23 09:23:06 -07:00
Lixin Wei
ded239205f
[Core] Close RPC Server After GcsHearbeatManager (#17238) 2021-07-23 09:12:13 -07:00
Dmitri Gekhtman
e701ded54f
[autoscaler] Tweaks to support remote (K8s) operators (#17194)
* node provider hooks

* disable node updaters

* pending means not completed

* draft wip

* add flag to autoscaler initialization

* Explain

* terminate unhealthy nodes

* fix, add event summarizer message

* Revert node provider

* remove hooks from autoscaler.py

* avert indent apocalypse

* wip

* copy-node-termination-logic

* Added a test

* Finish tests

* test cleanup

* Move disable node updaters to config yaml

* fix

* Drop arg
2021-07-23 11:30:18 -04:00
Edward Oakes
811eb4b092
[debugger] Enable attaching to breakpoints on remote nodes (off by default) (#17275) 2021-07-23 09:37:40 -05:00
Siyuan (Ryans) Zhuang
57b2328e7b
[workflow] Virtual actor writer - Part I (#17256)
* update readonly virtual actor

use signature module

refactoring workflow

new execution interface

advance progress of a workflow

update storage

last_step_of_workflow

prevent setting dynamic output of "output.json" in workflow directory

use alternative exception

* fix

* fix comments

* better step names

* add TODO

* fix comments

* log errors when retry

* fix storage test
2021-07-22 22:53:04 -07:00
lantian-xu
daf37b7621
[Streaming] Fix illegal cast when rollbacking. (#17195)
Co-authored-by: yz54123 <57480840+yz54123@users.noreply.github.com>
2021-07-23 13:08:34 +08:00
Clark Zinzow
1ab4f0def7
[Datasets] Port read_binary_files to Datasource API. (#17225) 2021-07-22 19:03:10 -07:00
Yi Cheng
5f4d9085d2
[workflow] workflow ci enable (#17255)
* Enable workflow tests

* update

* Fix one bug
2021-07-22 17:59:24 -07:00
Simon Mo
b9b79cd5f4
[Runtime Env] Support per task/actor uri override job_config (#17252) 2021-07-22 16:37:43 -07:00
Edward Oakes
f6375cbb7c
[core] Fix bazel test sizes for C++ unit tests (#17272) 2021-07-22 17:38:56 -05:00
mwtian
b8e71f641c
[Build] Ray Docker image for Python 3.9. (#16571) 2021-07-22 13:38:57 -07:00
Richard Liaw
2ce73ce843
[docs] Revert #16919 and fix documentation build (#17270) 2021-07-22 13:03:38 -07:00
Dmitri Gekhtman
e5925daed3
fix (#17268) 2021-07-22 15:22:52 -04:00
Clark Zinzow
cff7596ea1
[Core] Update locality protocol comment. (#17267) 2021-07-22 11:43:01 -07:00
Simon Mo
aaf8afb78d
[Runtime Env] Add a test for working_dir inheritance (#17245) 2021-07-22 10:48:25 -07:00
Chen Shen
c691f73d87
[core][usability] fix noisy push related log (#17250) 2021-07-22 09:33:08 -07:00
Chen Shen
7736d06399
[core][easy] remove unused code in buffer_pool 2021-07-22 09:31:20 -07:00
Yi Cheng
760b11263a
[workflow] Workflow manager API (#17226) 2021-07-22 09:30:52 -07:00
Sven Mika
7bc4376466
[RLlib] Example script: Simple league-based self-play w/ open spiel env (markov soccer or connect-4). (#17077) 2021-07-22 10:59:13 -04:00
Richard Liaw
a78a2263e5
[RLlib] Fix reverted RockPaperScissors Pettingzoo example (#16896) 2021-07-22 10:55:07 -04:00
xwjiang2010
f3a31a3b94
[tune] Add test for flatten_dict. (#17241)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-21 22:01:01 -07:00
Alexis DUBURCQ
362f7b7c56
[RLlib] Do not deepcopy input dict for efficiency and consistency with similar methods. (#15709)
Co-authored-by: Alexis Duburcq <alexis.duburcq@wandercraft.eu>
2021-07-21 20:09:41 -07:00
Chen Shen
70ab8aa1d4
Revert "[core] Do not spill back tasks blocked on args to blocked nodes (#16488)" (#17247)
This reverts commit dad8db46e1.
2021-07-21 19:41:35 -07:00
Vince Jankovics
05c9dfbbda
[RLlib] CV2 to Skimage dependency change (#16841) 2021-07-21 22:24:18 -04:00
Clark Zinzow
05a7102104
[Datasets] Port read_parquet to Datasource API. (#17230)
* Port read_parquet to Datasource API.

* Update to new block representation.

* Remove unused _parse_paths.

* Support column selection.

* Formatting.

* Add column selection test.
2021-07-21 17:39:39 -07:00
Chen Shen
edb80d6122
[core][rfc] Fix race condition between write chunk and abort object. (#17234)
* fix

* address comments

* sang's comment
2021-07-21 17:39:06 -07:00
Simon Mo
7b44dd8ecb
Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)
This reverts commit 64874e1877.
2021-07-21 16:57:47 -07:00
Julius Frost
0b1b6222bc
[rllib] Add merge_trainer_config arguments to trainer template (#17160) 2021-07-21 15:43:06 -07:00
Yi Cheng
5accfa662c
[workflow] Test for better coverage (#17233)
* update

* workflow.init

* update

* update

* update tests

* check

* up

* update

* update

* check

* merge

* fix tests

* update

* add tests

* up

* format

* add space

* Update test_storage.py

Co-authored-by: Siyuan <suquark@gmail.com>
2021-07-21 14:48:36 -07:00
Antoni Baum
2e37826458
[tune] Function API support for ResourceChangingScheduler (#17150)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-21 14:14:12 -07:00
Siyuan (Ryans) Zhuang
a550eb3e97
[Workflow] Update workflow storage interface (#17222)
* workflow.init

* update tests

* add tests
2021-07-21 11:51:01 -07:00
Clark Zinzow
b5194ca9f9
Add imports to docs examples to make the code more runnable. (#17240) 2021-07-21 11:18:45 -07:00
chenk008
afd59be8ca
[Core] Add worker resource limit (#17179)
* add resource restricted

* fix test

* lint

* lint
2021-07-21 22:00:34 +08:00
Kai Fricke
64874e1877
[core] remove opencensus/prometheus_exporter dependencies (#17182) 2021-07-21 12:57:31 +01:00
Simon Mo
250f0c24e0
[Runtime Env] Refactor local dev mode for linking ray packages (#17227) 2021-07-21 00:48:58 -07:00
Kai Fricke
e881c6cff8
[core] remove aiohttp dependencies (#17181) 2021-07-21 07:18:19 +01:00
Stephanie Wang
dad8db46e1
[core] Do not spill back tasks blocked on args to blocked nodes (#16488) 2021-07-20 17:13:02 -07:00
Jiao
f4f702c595
[Release] change default expiration to 2 days in order to prevent custodian kill it early morning (#17215)
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-07-20 17:03:14 -07:00
Eric Liang
877076160e
[data] Enable zero-copy access to underlying Arrow tables (#17192) 2021-07-20 16:38:21 -07:00
Eric Liang
d6e91a5b46
Update PublicAPI annotations #17224 2021-07-20 16:37:53 -07:00
Clark Zinzow
09f32b68d3
[Datasets] Slice off S3 protocol from S3 URIs. (#17219)
* Ensure that S3 protocols are sliced off of S3 URIs.

* Use urllib to parse and trim URI to path.
2021-07-20 15:23:35 -07:00
Clark Zinzow
08a50bf3b7
[Datasets] Allow for Parquet metadata file to be missing. (#17217)
* Allow for Parquet metadata file to be missing.

* Remove for-else.
2021-07-20 15:20:26 -07:00
Jiao
7473f663ef
[Release] change replica to 100 to collect signals now (#17214)
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-07-20 12:27:56 -07:00
Ian Rodney
db510f317a
[CODEOWNERS] Add Ray Client Proto (#17210)
* add ray client proto

* Add Ameer
2021-07-20 12:23:19 -07:00
Alex Wu
433382f102
Update CODEOWNERS for dashboard snapshot (#17196)
* Update CODEOWNERS for dashboard snapshot

* Update CODEOWNERS

* Update CODEOWNERS
2021-07-20 12:22:52 -07:00
Ian Rodney
e6bf0a8ea6
[autoscaler][docstring] Add Docstring for StandardAutoscaler ctor (#17213) 2021-07-20 12:19:54 -07:00
Sven Mika
5a313ba3d6
[RLlib] Refactor: All tf static graph code should reside inside Policy class. (#17169) 2021-07-20 14:58:13 -04:00
Patrick Ames
efed07023f
[autoscaler] Custom AWS network interface error condition tests and missing security group bug fix. (#17207) 2021-07-20 11:17:27 -07:00