Commit graph

8303 commits

Author SHA1 Message Date
Qing Wang
d6d27e9d34
[Java] Enable concurrent calls in local mode. (#14896)
* Enable concurrent calls in local mode.

* Fix submitting actor tasks before actor creation task executed.

Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-06-10 23:21:11 +08:00
Tao Wang
9741bc00c9
[Core]Limit job error message size (#16336) 2021-06-10 19:28:00 +08:00
Sven Mika
3d4dc60e2e
[RLlib] CQL iteration count fixes: Remove dummy buffer and unnecessary store op from exec_plan. (#16332) 2021-06-10 07:49:17 +02:00
SangBin Cho
c8a5d7ba85
[TEST] Additional data processing nightly test (#16078)
* in progress

* in progress

* almost done

* Lint

* almost done

* All tests are available now

* Change the test a little more stressful

* Modify paramter to make tests a little more stressful
2021-06-09 22:38:53 -07:00
Eric Liang
d390344a8f
Enable plasma fallback allocations by default (#16244) 2021-06-09 22:05:52 -07:00
SongGuyang
67761a4fc5
[C++ worker] add node-ip-address params to C++ worker (#16253) 2021-06-10 11:10:56 +08:00
architkulkarni
7d029f8e71
[Doc] [Core] [runtime env] Add runtime env doc (#16290) 2021-06-09 20:02:16 -05:00
Siyuan (Ryans) Zhuang
8aee4e5634
[Workflow] Workflow API extension (#16276) 2021-06-09 14:55:01 -07:00
Chen Shen
5fe03667b9
[RFC] add ray.util.get_locations() to look up objects' location. (#16130)
* Implement GetLocationFromOwner at CoreWorker that looks up the locations
for a list of object ids

* plumbing GetLocationAPI to CoWorker

* introduce primary_node_id in refcounter

* add python tests

* address comments

* fix linit

* remove C++ tests

* more tests

* add more tests

* linter

* lint

* lint

* address comments

* fix merge issue

* nits
2021-06-09 11:30:42 -07:00
Eric Liang
6c7147dc97
Fix active RPC tracking in event tracker 2021-06-09 10:53:30 -07:00
matthewdeng
138b273136
[rllib] Add tests for examples using ray client (#16271)
* [rllib] add tests for examples using ray client

* rename test_client to test_ray_client
2021-06-09 10:39:14 -07:00
SongGuyang
874e947d6f
[runtime env] support create or delete runtime envs in agent (#15904) 2021-06-09 20:22:25 +08:00
SangBin Cho
1795e73cf2
Revert "Batch the AddSpilledURLs RPC (#16303)" (#16331)
This reverts commit deda35fb4a.
2021-06-09 00:33:57 -07:00
SangBin Cho
d9227d8506
[Pubsub] Pubsub module command batch part 1 (#16167)
* Basic command batch implemented.

* working.

* fix bugs.

* Improve a protobuf message.

* Update description of protobuf.

* Addressed code review.
2021-06-09 00:27:06 -07:00
Ian Rodney
c2f5ca399f
[Cleanup] Use Constant instead of "RAY_ADDRESS" in code (#16257) 2021-06-08 22:53:56 -07:00
Tao Wang
1c94906efc
[Test][Tiny]Check argv in right way (#16325) 2021-06-09 13:18:27 +08:00
Kai Yang
81be461ba2
[Core] Limit starting workers with maximum_startup_concurrency per worker type (#16214) 2021-06-09 13:11:53 +08:00
Simon Mo
4c0069edc2
[CI] Fix determine_tests_to_run logic (#16320)
We recently had two master breakages due to determine_tests_to_run
script bug.
https://github.com/ray-project/ray/pull/16120
https://github.com/ray-project/ray/pull/15981

This PR fix both of them.
2021-06-08 19:59:03 -07:00
Kathryn Zhou
2394ab2d2e
Update versioning for tracing in Ray docs (#16041)
Co-authored-by: Kathryn Zhou <kathrynzhou@kathryns-mbp.lan>
2021-06-08 19:23:19 -07:00
Dmitri Gekhtman
41b2e569fb
[autoscaler] Don't rsync cluster state with local node provider (#16281) 2021-06-08 12:27:06 -07:00
Eric Liang
deda35fb4a
Batch the AddSpilledURLs RPC (#16303) 2021-06-08 12:10:35 -07:00
Alex Wu
ae1cb12221
Revert "[GCS] Bookkeeping normal task resources in GCS (#16185)" (#16315)
This reverts commit f2384a9743.
2021-06-08 11:02:28 -07:00
fyrestone
4ca316a0f4
Move test_snapshot from test_dashboard.py to modules/snapshot/tests/test_snapshot.py (#16306)
Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-06-08 10:26:03 -07:00
Amog Kamsetty
de4045703d
[SGD] Fix SGD Client CI (#16301) 2021-06-08 10:08:14 -07:00
Simon Mo
9afb6f1ada
Revert "[CI] macOS Build to buildkite (#16135)" (#16312)
This reverts commit 113556463d.
2021-06-08 09:33:03 -07:00
Sven Mika
4b8dadccbd
[RLlib] Fix PR 16162: Having added sleep to _NextValueNotReady causes TD3 tests to become flakey. (#16309) 2021-06-08 07:27:02 -07:00
Chong-Li
f2384a9743
[GCS] Bookkeeping normal task resources in GCS (#16185) 2021-06-08 19:58:15 +08:00
Clark Zinzow
ca68bf1e93
[Release] Update release test configs for 1.4 release. (#16292)
* Updated scalability envelope tests for 1.4.

* Update data processing release test for 1.4.
2021-06-08 00:15:25 -07:00
Lixin Wei
870a0c16a3
[Logging] Change std::exit to std::_Exit (#16280)
* change abort to exit

* change to std::_Exit
2021-06-08 00:14:17 -07:00
Chris K. W
c8e3ed9eec
[core] Use function_actor_manager.lock when deserializing (#16278)
* use function_actor_manager.lock when deserializing

* add comment and todo

* better comment

* fix comment
2021-06-08 00:13:42 -07:00
mwtian
c2a2a6f7c3
Make it easier to run asan and wheel release tests (#16242) 2021-06-07 22:54:22 -07:00
Simon Mo
113556463d
[CI] macOS Build to buildkite (#16135) 2021-06-07 21:33:00 -07:00
fyrestone
dfadf33a94
[Dashboard] Reorganize dashboard modules - node (#16217) 2021-06-07 19:50:46 -07:00
Travis Addair
7802ff66d4
[docker] Updated GPU Dockerfiles to CUDA 11.2 (#16269) 2021-06-07 16:15:19 -07:00
Alex Wu
6f5064b7ef
Use pytest not unittest (#16265)
* .

* done

* done

* .

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-07 12:26:56 -07:00
Alex Wu
9f8f108e3f
[deflek] Split test failure into test failure 4 (#16264)
* .

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-07 11:54:55 -07:00
Lixin Wei
75196cf7f4
[scheduler] Clean up TaskRequest (#16288) 2021-06-07 11:38:34 -07:00
Edward Oakes
418dd1e8b9
fix serve start namespace issue and add test (#16291) 2021-06-07 11:30:31 -07:00
Siyuan (Ryans) Zhuang
480e5e822e
Inital workflow API implementation (#16174) 2021-06-07 10:00:15 -07:00
SangBin Cho
f867c27eda
[Object spilling] Fix race condition that deletes files at the wrong timing. (#16153)
* Error fix.

* remove debug code

* Add unit test

* Fix a test failure
2021-06-07 09:56:55 -07:00
SangBin Cho
3572d0837e
[Test] Dask on ray sort nightly (#16213)
* Make dask on ray sort works

* lint

* revert unrelated change
2021-06-06 15:58:48 -07:00
SangBin Cho
03c33cf443
add a streaming shuffl etest (#16258) 2021-06-06 15:58:14 -07:00
Eric Liang
1d8cb2d19e
Add event stats documentation, fix misc race condition (#16236)
* update

* stats

* udpate

* fix
2021-06-06 12:44:30 -07:00
architkulkarni
b88163f010
[Core] [runtime env] Fix injection of ray[default] (#16275) 2021-06-05 17:32:50 -05:00
architkulkarni
b3a0b97737
Revert "[Core] [runtime env] Inject ray[default] into pip dependencies (#16268)" (#16274)
This reverts commit e5fad4bc2d.
2021-06-05 21:26:19 +03:00
Eric Liang
ca861ee47f
update (#16270) 2021-06-05 11:16:01 -07:00
Dmitri Gekhtman
7d1e7a0d4f
[autoscaler] Fix local node provider (#16202)
* Don't override resources for local node provider.

* Wip

* Local node provider prep logic

* ../python/ray/autoscaler/local/defaults.yaml

* wip

* Fix example-full

* defaults comment

* wip

* head type max workers

* sync-state

* No docker

* Fix

* external head ip option

* wip

* move external_ip out of tags

* Update examples

* Update comment

* Skip local defaults

* Config test

* Test external ip

* Change ray start commands to what they were before

* missing yamls

* Fix test

* Remove scary Docker

* Fixes

* Extra test

* address comments

* fixes pre-single-node-type-attempy

* rewrite comment a bit

* One type

* fix

* get rid of pdb

* no placeholders

* fix

* worker nodes and head node optional during launch

* fix

* fix again

* config comment fixes

* mock -> aws, not local

* Update python/ray/autoscaler/_private/local/config.py

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

* second pop fixed

* Explanatory comments for config logic

* deprecation comments

* Update python/ray/autoscaler/_private/local/config.py

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>

* update test

* fix

* More descriptive name for local provider check

* Remove external-ip from example minimal and add a more detailed doc string.

* Make clearer the equivalence between a ray restart and non-empty ray-start commands

* extra comment

* Update python/ray/autoscaler/_private/local/node_provider.py

* Update python/ray/autoscaler/_private/commands.py

* Update python/ray/autoscaler/_private/commands.py

* Update python/ray/autoscaler/_private/util.py

* lint

* Update python/ray/autoscaler/_private/local/node_provider.py

Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-06-05 19:29:19 +03:00
Dmitri Gekhtman
e58ba66681
[gcp][doc][minor] project_id is required (#16266) 2021-06-05 01:00:11 -07:00
Chris K. W
2e11ac678f
[autoscaler] Additional Autoscaler Metrics (#16198) 2021-06-04 23:19:17 -07:00
architkulkarni
e5fad4bc2d
[Core] [runtime env] Inject ray[default] into pip dependencies (#16268)
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
2021-06-05 00:22:33 -05:00