ray/python
Dmitri Gekhtman 8971422d8f
[autoscaler] Use drain node api in autoscaler before terminating nodes (#20013)
* wip

* Draft

* Use bytest for node id

* remove stray helm change

* fix autoscaler init arg

* don't forget to instantiate new load metrics dict

* remove extraneous diff

* Timeout, comments, function signature.

* typo

* another comment

* tweak

* docstring

* shorter timeout

* Use a better error code

* missing self

* Dedent example

* Add drain node prometheus metric.

* comment

* Update tests part 1: test_autoscaler.py

* Update tests part 2: test_resource_demand_scheduler

* lint

* Update tests part 3: test_autoscaling_policy

* Unit tests for new Prometheus metric and DrainNode error handling.

* comment

* removed unused function

* Try adding ability to mock out process termination to fake node provider

* Add integration test.

* fix

* fix

* lint

* Improve log message

* fix

* Simplify test

* Fix doc example

* remove unused dict

* Mock out process termination in a subclass

* Add add doc string and comment explaining prune active ips.

* Comment: wtf is use_node_id_as_ip

* one more comment

* more explanation

* period

* tweak
2021-11-11 08:31:40 -08:00
..
ray [autoscaler] Use drain node api in autoscaler before terminating nodes (#20013) 2021-11-11 08:31:40 -08:00
requirements [CI] Pin shortuuid to fix CI (#20153) 2021-11-08 12:08:32 -08:00
asv.conf.json [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
build-wheel-macos-arm64.sh [CI] [macOS] avoid installing latest setuptools (#20064) 2021-11-04 21:35:03 -07:00
build-wheel-macos.sh [CI] [macOS] avoid installing latest setuptools (#20064) 2021-11-04 21:35:03 -07:00
build-wheel-manylinux2014.sh [dashboard] Rename "new_dashboard" -> "dashboard" (#18214) 2021-09-15 11:17:15 -05:00
build-wheel-windows.sh [Dashboard] Include the dashboard in Windows wheels (#19575) 2021-10-22 17:57:36 -07:00
MANIFEST.in [Build] Another attempt at building Python 3.9 MacOS wheels (#16347) 2021-06-10 10:20:30 -07:00
README-building-wheels.md [build] Build wheels with manylinux2014 (#11621) 2020-11-03 19:36:32 -08:00
requirements.txt [serve] Add Google Cloud Storage as a backend (#20104) 2021-11-10 19:45:19 -08:00
requirements_linters.txt [Lint] Add flake8-bugbear (#19053) 2021-10-03 23:24:11 -07:00
requirements_ml_docker.txt [Deps] Bump tensorflow on Docker image and add Codeowners (#20041) 2021-11-05 00:58:34 -07:00
setup.py [autoscaler] AWS Autoscaler CloudWatch Integration (#18619) 2021-11-09 11:48:55 -08:00