Commit graph

9 commits

Author SHA1 Message Date
Dmitri Gekhtman
fc4ac71deb
[minor] Fix legacy OSS operator test (#23540)
A legacy K8s test fails due to incorrect usage of @ray.method which only started raising errors after the Ray 1.12.0 branch cut.
This PR removes the use of @ray.method in the test.

Some context in #23271 and #23471

In addition, I noticed some of the test were flakey due to out-of-memory issues. For that reason, I've doubled the memory request and limits in the legacy operator's example files.

I've also added CPU limits in an example file that was missing them -- it makes the most sense for consistency with Ray's resource model to use CPU limits in K8s configs.

Finally, I added an extra note to the instructions for running the tests.
2022-04-18 17:47:42 -07:00
Dmitri Gekhtman
f51566e622
Prep K8s operator for the Ray 1.11.0 release. (#22264)
For consistency and safety, we fix an explicit 6379 port for all default and example configs for Ray on K8s.
Documentation is updated to recommend matching Ray versions in operator and Ray cluster.
2022-02-09 18:59:50 -08:00
Sasha Sobol
fcb044d47c
[autoscaler] make 0 default min/max workers for head node (#17757)
* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* make 0 default min/max workers for head node

* fix helm charts, test, defaults for head

* fix test, docs

* comments. logging

* better wording (logs)

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>

* fix logging message

* fix max workers in raycluster.yaml

* use default values of 0 for min/max workders in a helm chart

* add missing line back

Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
2021-08-25 14:56:20 -04:00
Holden Karau
b9dae93bfa
Add ephemeral-storage: 1Gi requests but no limits. (#17854)
* Add ephemeral-storage: 1Gi requests but no limits. This is useful when scheduling in a storage constrained env since ray assumes it has ephemeral storage to use.

* Add ephemeral-storage: 1Gi to b/deploy/charts/ray/templates/operator_cluster_scoped.yaml b/deploy/charts/ray/templates/operator_namespaced.yaml
2021-08-17 21:10:39 -04:00
Holden Karau
e0f8e18173
Make the ray logs visible (#17810) 2021-08-15 17:16:55 -04:00
crdnb
113ed2a07c
[kubernetes] Adding cpu limit to make ray helm chart working in environments which require set resource limits (#16701) 2021-06-30 13:31:55 -07:00
Travis Addair
050a076de9
[k8s] Refactored k8s operator to use kopf for controller logic (#15787)
Co-authored-by: Dmitri Gekhtman <dmitri.m.gekhtman@gmail.com>
2021-06-01 12:00:55 -07:00
Dmitri Gekhtman
27c2f570f1
[kubernetes] pin the K8s config yamls to ray:latest instead of ray1.3 (#15988) 2021-06-01 19:12:35 +03:00
Dmitri Gekhtman
95c3d88cac
[autoscaler][kubernetes] Helm chart (#15614) 2021-05-17 16:55:10 -07:00