Commit graph

9210 commits

Author SHA1 Message Date
Sasha Sobol
fcb044d47c
[autoscaler] make 0 default min/max workers for head node (#17757)
* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* comments. logging

* better wording (logs)

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>

* fix logging message

* fix max workers in raycluster.yaml

* use default values of 0 for min/max workders in a helm chart

* add missing line back

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
2021-08-25 14:56:20 -04:00
Jiao
b52c873027
[serve] Use list_deployments in benchmark (#18050) 2021-08-25 12:26:46 -05:00
Simon Mo
7c01ea7d2f
[Serve][Hotfix] Skip get_deployment on windows (#18029) 2021-08-25 11:27:31 -05:00
Simon Mo
ca61e43e76
[dashboard] Fix GCS retry error message in dashboard_head (#18057) 2021-08-25 11:23:36 -05:00
Edward Oakes
18ae99d8d8
[codeowners] Add @kombuchafox as code owner for dashboard snapshot (#18051) 2021-08-25 10:56:11 -05:00
Nikita Vemuri
a9c731edd3
[serve] Remove requirement to specify namespace for serve.start(detached=True) (#17470) 2021-08-25 10:39:32 -05:00
Kai Fricke
4c3276644e
[release] After buildkite ask step, use RAY_TEST_REPO pipeline (#18074) 2021-08-25 15:58:38 +02:00
Sven Mika
b6aa8223bc
[RLlib] Fix final_scale's default value to 0.02 (see OrnsteinUhlenbeck exploration). (#18070) 2021-08-25 14:22:09 +02:00
Guyang Song
16502cc438
[Event] support multi-thread context copy (#17919) 2021-08-25 14:03:20 +08:00
Tao Wang
0b5f5890f7
[Named Actor] Throw RayException when getting named actor timed out (#17998)
* [Named Actor]throw RayException when getting named actor timed out

* lint

* correct the message

* lint

* nice catch
2021-08-25 13:50:53 +08:00
SangBin Cho
43da68e657
Fix a nightly dask on ray test (#18060) 2021-08-24 22:15:34 -07:00
Chen Shen
08193b8e0a
fix more windows failures (#18058) 2021-08-24 21:19:45 -07:00
Eric Liang
5571d2436d
Deprecate MLDataset (#18030) 2021-08-24 13:04:11 -07:00
Chen Shen
69fd5fb678
[CI] increase object spill test timeout for mac (#18036) 2021-08-24 13:03:50 -07:00
Jiajun Yao
25ef452b15
[Core] Fix typo in local_mode_task_submitter.cc (#18046) 2021-08-24 13:03:05 -07:00
Sven Mika
9883505e84
[RLlib] Add [LSTM=True + multi-GPU]-tests to nightly RLlib testing suite (for all algos supporting RNNs, except R2D2, RNNSAC, and DDPPO). (#18017) 2021-08-24 21:55:27 +02:00
architkulkarni
97dd13be09
[Serve] [dashboard] Fix formatting bugs in cluster snapshot (#17977)
* show "unversioned" in actor metadata

* hash deployment names

* update test

* replace "Unversioned" with "None"

* bypass convert to camelCase for deployment names

* fix convert_case default to match previous setting

* lint

* replace deployment_name_hash with underscore
2021-08-24 12:06:26 -07:00
gjoliver
9c7456933c
[RLlib] Small parallel iterator doc fix. (#18043) 2021-08-24 19:57:58 +02:00
Yi Cheng
995d3cb487
Update id_specification.md (#18035) 2021-08-24 10:49:56 -07:00
xwjiang2010
5d68657246
[Tune] Sanitize trial checkpoint filename. (#17985) 2021-08-24 10:08:36 -07:00
Antoni Baum
88163c4755
[docs] Add a TPU example to the docs (#17959)
* Add a TPU example to the docs

* Add a line about TPU API

* Add link to TPU pods

* Clarify
2021-08-24 10:08:26 -07:00
Chen Shen
3a04cb0d73
fix windows test failures (#18022) 2021-08-24 09:28:51 -07:00
Sven Mika
ff7740e508
[RLlib] Fix the git-bisect helper/debugging script. (#18040) 2021-08-24 18:08:29 +02:00
Antoni Baum
1f8ce1ede8
[tune] Explicitly instantiate skopt categorical spaces (#18005) 2021-08-24 17:11:21 +02:00
Alex Wu
6e3dd7b3cf
Revert "[Core]make thread of client manager in gcs server configurable (#17978)" (#18041)
This reverts commit f0edbf0d30.
2021-08-24 07:57:59 -07:00
Qing Wang
7c1f14ddd8
Do not connect in constructor to avoid potential risk. (#17916)
* Do not connect in ctor.

* Fix lint.

Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-08-24 16:41:30 +08:00
Kai Fricke
ff68251f89
[release] Add python 3.9 to fix-docker-latest.sh (#18037) 2021-08-24 10:07:03 +02:00
Chen Shen
7e3e0d1535
[Test] Add C++ tsan test (#17875) 2021-08-24 00:57:32 -07:00
wanxing
abb46de4dc
[object store refactor 5/n] Add eviction policy tests (#17984)
* add eviction policy tests

* fix object_lifecycle_manager_test build

* make IsObjectExists private
2021-08-24 00:50:28 -07:00
Kai Fricke
d058f98546
[RLlib] Add GPU tests to CI (run per-PR). (#17891)
Co-authored-by: simon-mo <simon.mo@hey.com>
2021-08-24 09:20:45 +02:00
Tao Wang
f0edbf0d30
[Core]make thread of client manager in gcs server configurable (#17978) 2021-08-24 11:27:35 +08:00
dependabot[bot]
15adedc72c
[tune](deps): Bump sigopt in /python/requirements/tune (#17996)
Bumps [sigopt](https://sigopt.com/) from 7.4.0 to 7.5.0.

---
updated-dependencies:
- dependency-name: sigopt
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-08-23 15:18:59 -07:00
dependabot[bot]
f97a292867
[tune](deps): Bump dask[complete] in /python/requirements/tune (#17997)
Bumps [dask[complete]](https://github.com/dask/dask) from 2021.06.1 to 2021.8.1.
- [Release notes](https://github.com/dask/dask/releases)
- [Changelog](https://github.com/dask/dask/blob/main/docs/release-procedure.md)
- [Commits](https://github.com/dask/dask/compare/2021.06.1...2021.08.1)

---
updated-dependencies:
- dependency-name: dask[complete]
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-08-23 15:18:37 -07:00
Yi Cheng
5849f80e41
[Core] Fix typo of actor repr (#18011) 2021-08-23 14:33:51 -07:00
Edward Oakes
1a61082ed4
[serve] Remove deprecated endpoints code (#17989) 2021-08-23 13:53:09 -07:00
architkulkarni
f9de05d44c
[Doc] fix incorrect conda example in runtime env doc (#17972) 2021-08-23 13:44:04 -07:00
Amog Kamsetty
4c384df526
fix wheel links (#17973) 2021-08-23 13:43:34 -07:00
Chen Shen
0f894e9cbd
revert ebs cold start (#18010) 2021-08-23 13:40:31 -07:00
chenk008
b9978dd02b
[Core] revert: revert Unified worker starter (#18008) 2021-08-23 13:34:32 -07:00
Yi Cheng
fd71bde9b4
[client] Allow multiple client connections from one driver (#17942) 2021-08-23 13:01:58 -07:00
Sven Mika
d47ed62083
[Testing RLlib] Increase timeout for nightly multi-GPU learning tests. (#17987) 2021-08-23 10:37:50 +02:00
Kai Fricke
fca8af88d2
[release] Fix e2e environment variable passing from pipeline (#18000) 2021-08-23 09:26:37 +02:00
Chen Shen
e369ecab43
Fix EBS cold start in Mac (#18001) 2021-08-22 20:03:59 -07:00
Dmitri Gekhtman
13d5d0f9ef
[autoscaler][hotfix] Update node list after terminating unhealthy nodes (#17992)
* Update nodes; update test.

* consistency

* lint
2021-08-22 18:22:10 -04:00
Clark Zinzow
5ca28b1cc8
[Core] Update Bazel (to 3.4.1), gRPC, boringssl, and absl as a precursor to gRPC streaming PR. (#17903)
* Update Bazel (to 3.4.1), gRPC, boringssl, absl.

* Always reinstall Bazel if needing to upgrade to a new Bazel version.

* Add patch for properly detecting Windows Python headers when building gRPC.

* Add minimum Bazel version check.

* Update docs with new Bazel version.
2021-08-21 11:33:11 -07:00
Sven Mika
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. (#17928) 2021-08-21 17:05:48 +02:00
Edward Oakes
b969aa3c80
[dashboard] Don't start dashboard agent when missing dependencies (#17966) 2021-08-21 01:04:21 -07:00
77loopin
c6b24fcb5d
[RayClient] Add the guide for k8s Ingress (#17736)
Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
Co-authored-by: seungjaebaek <seungjaebaek@linecorp.com>
2021-08-20 18:31:03 -07:00
Eric Liang
58e35a21b4
Add split_at_indices() (#17990) 2021-08-20 15:35:22 -07:00
Lixin Wei
05502da271
Add dispatch proxy to event tracker (#17983) 2021-08-20 15:32:10 -07:00