Commit graph

5176 commits

Author SHA1 Message Date
mwtian
49a57aa477
[Scheduling] Report resource demand for infeasible 1-CPU tasks (#19000) 2021-09-30 22:03:02 -07:00
Jiajun Yao
d64872dd67
Fix python mutable default argument anti-pattern (#19028) 2021-10-01 13:05:02 +09:00
Edward Oakes
8e5d48d668
[runtime_env] Remove deprecated override_environment_variables and worker_env fields (#18213) 2021-09-30 18:55:24 -05:00
Jiajun Yao
81b052f222
[core] Fix port collision between metrics agent port and metrics export port (#19016) 2021-09-30 16:15:42 -07:00
Ian Rodney
02d1f659ba
[Workflows] Use RAY_ADDRESS in Tests (#19012) 2021-09-30 13:05:51 -07:00
Chris K. W
61d058fe66
[client] skip test_wrapped_actor_creation on windows (#19013)
* skip test_wrapped_actor_creation on windows

* rerun windows ci
2021-09-30 13:04:43 -07:00
Frank Luan
732af42ae9
[Sort benchmark] Two-stage reduce (#17055)
* [WIP] Sorting benchmark

* Separate num_mappers and num_reducers

* Add tests

* Fix tests

* Tracing

* Separate num_mappers and num_reducers

* Two-stage reduce

* Back pressure to avoid excessive spilling

* Make merger_concurrency an option

* Fix tests

* Tweaks

* Remote writers

* Format

* WIP

* Address comments

* Fix tests and address comments

* Lint

* Fix mount points for testing

* Simplify code path

* Address comments
2021-09-30 12:39:11 -07:00
architkulkarni
0f0b161ea1
Revert "Revert "[Serve] [doc] Improve runtime env doc"" (#18943)
* Revert "Revert "[Serve] [doc] Improve runtime env doc (#18782)" (#18935)"

This reverts commit e4f4c79252.
2021-09-30 13:28:44 -05:00
Clark Zinzow
e384a6c91f
(TaskPool) Cancel all transformation tasks when one task fails or when SIGINT is received. (#18991) 2021-09-30 10:56:30 -07:00
gjoliver
e61f2c72d7
Upgrade bazel version to 4.2.1 (#18996) 2021-09-30 10:50:54 -07:00
mwtian
d12e35ce53
[Object manager] don't abort entire pull request on race condition in concurrent chunk receive (#18955) 2021-09-30 10:19:54 -07:00
Simon Mo
910553c3bb
[Core] Add private method to retrieve current task queue length (#18964) 2021-09-30 09:20:04 -07:00
Amog Kamsetty
98ac3f601c
[SGD] v1 to v2 Migration Guide (#18887)
* wip

* add guide

* fix test

* address comments

* add to docs

* fix

* remove markdown

* add warning to all pages

* formatting

* fix

* links

* Update doc/source/raysgd/v2/migration-guide.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* Update doc/source/raysgd/v2/migration-guide.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* Update doc/source/raysgd/v2/migration-guide.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* Update doc/source/raysgd/v2/migration-guide.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* Update doc/source/raysgd/v2/migration-guide.rst

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>

* address comments

* address comments

* fix

* address comments

Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
2021-09-30 09:15:21 -07:00
architkulkarni
bf6e50813c
[runtime env] Parse local pip/conda requirements files locally upon task/actor definition (#18988) 2021-09-30 09:47:15 -05:00
Chris K. W
291fd36dee
_ray_trace_ctx fix follow-up (#18950)
* sanity check

* add test case

* fix assert

* refactor

* check kwargs instead of _kwargs

* format
2021-09-29 23:53:04 -07:00
Clark Zinzow
74b5d3d8f7
[Datasets] Minimize truncation on balanced splits. (#18953)
* Minimize truncation on balanced splits.

* Refactor into subroutines.

* Feedback and fixes.
2021-09-29 21:57:08 -07:00
Alex Wu
5709c6501b
[dataset][usability] Dataset dependencies (#18346) 2021-09-29 17:29:31 -07:00
Clark Zinzow
73a6cda812
Handle empty datasets properly in most Dataset transformations. (#18983) 2021-09-29 17:27:03 -07:00
Eric Liang
aa985e1a9c
Fix false positive error message from autoscaler events (#18981) 2021-09-29 15:51:18 -07:00
Antoni Baum
573c66a755
[GCP] Update GCP TPU config (#18634)
* [autoscaler] Update GCP TPU config

* Preemptible by default

* Remove libtpu link from head node

* Workaround
2021-09-29 12:41:26 -07:00
Jiajun Yao
ed9118393c
Listen to 127.0.0.1 by default on mac osx (#18904) 2021-09-29 11:40:19 -07:00
Eric Liang
3665c99896
Deflake test_failure_2.py::test_warning_for_infeasible_zero_cpu_actor 2021-09-29 11:39:16 -07:00
Dmitri Gekhtman
944309c017
Revert "[nightly] Deflaky nightly test many_nodes_actor_test (#18582)" (#18954)
* Revert "[nightly] Deflaky nightly test many_nodes_actor_test (#18582)"

This reverts commit fc6a739e4b.

* move to large test

Co-authored-by: Yi Cheng <chengyidna@gmail.com>
2021-09-29 11:02:14 -04:00
Chong-Li
42744f29ee
[GCS] Make Gcs-based actor scheduler's bookkeeping consistent (#18546)
* Make Gcs-based scheduler's bookkeeping consistent

* Remove this from lambda function

* Fix lambda function

* Trigger SchedulePendingActors

* Test for acquiring/releasing resources

* Reorganize structure

* Avoid overloading post

* Fix gcs_actor_manager_test

* Fix post counter and rename some func

* Fix unique_ptr

* Fix unique_ptr

* Fix book lint error

* Lint

Co-authored-by: Chong-Li <lc300133@antgroup.com>
2021-09-29 05:53:34 -07:00
matthewdeng
91a5f67261
[SGD] add share_cuda_visible_devices config flag (#18958) 2021-09-29 00:21:46 -07:00
Eric Liang
4d763d3ffd
Increase metrics fetch timeout in autoscaler for large clusters 2021-09-28 15:24:44 -07:00
Edward Oakes
73b8936aa8
[runtime_env] Unify rpc::RuntimeEnv with serialized_runtime_env field (#18641) 2021-09-28 15:13:15 -05:00
Edward Oakes
96054953cc
[serve] Remove python_methods logic and raise an error dynamically instead (#18927) 2021-09-28 09:51:46 -07:00
Chris K. W
191af472ac
[client] remove ray_trace_ctx from kwargs if tracing disabled (#18926) 2021-09-28 09:47:43 -07:00
Ian Rodney
0d3544588e
[AutoRun] Fix Auto-Run for Client (#18457)
* remove old version

* auto init first attempt

* arg for fn decorator

* default to True

* ray.method should not autostart

* comments

* no auto init on global state fns

* tiny test fix

* quick tests

* respond to comments

* explain func

* fix comments

* forgot to save

* fix again

* fix reconnect tests

* fix medium tests

* fix workflows test

* Better fix for workflows
2021-09-28 08:00:26 -07:00
Richard Liaw
227aa9e89b
[tune] change delimiter for results (#16573)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-09-28 10:03:00 +01:00
Kai Fricke
6be87a3453
[tune] fix tune list-trials metric (#18914) 2021-09-28 09:59:32 +01:00
Yi Cheng
e4f4c79252
Revert "[Serve] [doc] Improve runtime env doc (#18782)" (#18935)
This reverts commit d4d71985d5.
2021-09-27 21:52:13 -07:00
Eric Liang
caf34a452c
Unify ArrowTensorType tables and Tensor blocks (#18867) 2021-09-27 16:24:09 -07:00
Maxim Egorushkin
be0133da1d
[Autoscaler][GCP] Allow Google Compute Engine instance templates. (#18620)
Co-authored-by: Maxim Egorushkin <maxim.egorushkin@gmail.com>
Co-authored-by: Ian <ian.rodney@gmail.com>
2021-09-27 16:08:41 -07:00
architkulkarni
d4d71985d5
[Serve] [doc] Improve runtime env doc (#18782) 2021-09-27 16:12:03 -05:00
Antoni Baum
72cc0c9bda
[SGDv2] Add Tune-Cifar-PyTorch-PBT example (#18860)
* [SGDv2] Add Tune-Cifar-PyTorch-PBT example

* Update python/ray/util/sgd/v2/BUILD

* Lint

* Update example

* Update docs
2021-09-27 09:22:40 -07:00
Jernej Makovsek
d6758ff92a
[tune] Fix HEBOSearch installation docs (#18861)
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
2021-09-27 09:06:14 +01:00
Chen Shen
aaae8c122b
Fix pid None in logs (#18898) 2021-09-25 17:07:32 -07:00
Chris K. W
0669e651b9
[client] Bump client proto version (#18866) 2021-09-24 14:35:59 -07:00
Eric Liang
11a2dfcaab
Improve unschedulable task warning messages by integrating with the autoscaler (#18724) 2021-09-24 12:19:58 -07:00
architkulkarni
fbf5f5d56b
[runtime env] [Serve] Fix error when uris field is None (#18874) 2021-09-24 14:07:17 -05:00
dependabot[bot]
40aec916d7
[RLlib](deps): Bump pyglet from 1.5.0 to 1.5.18 in /python/requirements/rllib (#16687)
* [RLlib](deps): Bump pyglet in /python/requirements/rllib

Bumps [pyglet](https://github.com/pyglet/pyglet) from 1.5.0 to 1.5.18.
- [Release notes](https://github.com/pyglet/pyglet/releases)
- [Changelog](https://github.com/pyglet/pyglet/blob/v1.5.18/RELEASE_NOTES)
- [Commits](https://github.com/pyglet/pyglet/compare/v1.5.0...v1.5.18)

---
updated-dependencies:
- dependency-name: pyglet
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

* fix

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: sven1977 <svenmika1977@gmail.com>
2021-09-24 19:32:55 +02:00
Guyang Song
337005d5a5
[C++ API][hotfix] fix C++ worker dynamic library loading issue on macOS (#18877)
* fix C++ worker in macox

* fix
2021-09-24 23:39:00 +08:00
Simon Mo
565131a854
[Serve] Support http_location=FixedNumber (#18731) 2021-09-23 15:59:12 -07:00
Simon Mo
5aa1e08633
[Serve] Exit run_forever when actor shutdown (#18820) 2021-09-23 15:17:31 -07:00
Yi Cheng
b5ccee6ad3
Skip failed actor test (#18815) 2021-09-23 11:02:02 -07:00
Kai Fricke
2d46e0e14b
[tune] Fix Analysis.dataframe() documentation and enable passing of mode=None (#18850) 2021-09-23 18:27:54 +01:00
Stephanie Wang
7b1e594412
[core] Fix bug in ref counting protocol for nested objects (#18821)
* Fix assertion crash

* test, lint

* todo

* tests

* protocol

* test

* fix

* lint

* header

* recursive

* note

* forward test

* lock

* lint

* unneeded check
2021-09-23 09:45:12 -07:00
Alex Wu
5d57eed598
[Workflow] Serialization cleanup (#18328)
* notes

* notes

* .

* seems to work?

* .

* seems to work

* needs tests

* needs tests

* parallelize uploads

* fixed

* fixed

* .

* dumb test

* .

* .

* fix festsg

* .

* works

* .:

* .

* .

* .

* Update common.py

* .

* almost removed special case for inputs

* lint

* lint

* .

* handle edge case

* .

* .

* lint

* needs dedupe

* needs dedupe

* still need to not leak cache

* still need to not leak cache

* probably fails edge cases?

* probably fails edge cases?

* works?

* cleanup

* passes test?

* ???

* done?

* may work?

* may work?

* .

* .

* Revert "."

This reverts commit 6aee40630637783d1756e226861b518668112337.

* Revert "."

This reverts commit 040a0e59e731d1f4e3b85ca2153474fc97963ae8.

* Revert "may work?"

This reverts commit fc26b54627c3c72dfdbaf0e79ba89d7503db4a94.

* Revert "may work?"

This reverts commit 85f48bb11a5c1764ef2cf3701ec41eb948fc7fc1.

* Revert "done?"

This reverts commit 573f4e0cb98417494b30c7a36987391d9bb8d064.

* passs tests

* lint

* cleanup

* bug fix

* bug fix

* print

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-09-23 09:18:59 -07:00