ray/python
Ameer Haj Ali a4dbb271bd
[hotfix][autoscaler] Request resources refactor2 (#12661)
* prepare for head node

* move command runner interface outside _private

* remove space

* Eric

* flake

* min_workers in multi node type

* fixing edge cases

* eric not idle

* fix target_workers to consider min_workers of node types

* idle timeout

* minor

* minor fix

* test

* lint

* eric v2

* eric 3

* min_workers constraint before bin packing

* Update resource_demand_scheduler.py

* Revert "Update resource_demand_scheduler.py"

This reverts commit 818a63a2c86d8437b3ef21c5035d701c1d1127b5.

* reducing diff

* make get_nodes_to_launch return a dict

* merge

* weird merge fix

* auto fill instance types for AWS

* Alex/Eric

* Update doc/source/cluster/autoscaling.rst

* merge autofill and input from user

* logger.exception

* make the yaml use the default autofill

* docs Eric

* remove test_autoscaler_yaml from windows tests

* lets try changing the test a bit

* return test

* lets see

* edward

* Limit max launch concurrency

* commenting frac TODO

* move to resource demand scheduler

* use STATUS UP TO DATE

* Eric

* make logger of gc freed refs debug instead of info

* add cluster name to docker mount prefix directory

* grrR

* fix tests

* moving docker directory to sdk

* move the import to prevent circular dependency

* smallf fix

* ian

* fix max launch concurrency bug to assume failing nodes as pending and consider only load_metric's connected nodes as running

* small fix

* request_resources -> min workers

* test fixes

* add race condition tests

* Eric

* fixes

* semi final

* semi final

* lint

* lint

Co-authored-by: Ameer Haj Ali <ameerhajali@ameers-mbp.lan>
Co-authored-by: Alex Wu <alex@anyscale.io>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-12-08 18:41:30 -08:00
..
ray [hotfix][autoscaler] Request resources refactor2 (#12661) 2020-12-08 18:41:30 -08:00
asv.conf.json [docs] Move all /latest links to /master (#11897) 2020-11-10 10:53:28 -08:00
build-wheel-macos.sh [Dashboard] Optimize dashboard datacenter (#11391) 2020-10-27 23:49:31 -07:00
build-wheel-manylinux2014.sh fix linux wheel build (#9896) 2020-11-17 15:49:42 +08:00
build-wheel-windows.sh Add shellcheck support (#8574) 2020-07-30 18:39:28 -05:00
MANIFEST.in [autoscaler] Replace cluster yaml validation with json schema v… (#7261) 2020-03-10 18:58:55 -07:00
README-building-wheels.md [build] Build wheels with manylinux2014 (#11621) 2020-11-03 19:36:32 -08:00
requirements.txt Ray debugger stepping between tasks (#12075) 2020-12-06 21:50:18 -08:00
requirements_linters.txt [CI] Add mypy to ci (#11117) 2020-10-05 11:20:45 -07:00
requirements_ml_docker.txt [docker] Fix GPU support for tensorflow (#10779) 2020-09-17 10:56:58 -07:00
requirements_rllib.txt [RLlib] Fix most remaining RLlib algos for running with trajectory view API. (#12366) 2020-12-01 17:41:10 -08:00
requirements_tune.txt fix (#12528) 2020-12-01 09:58:12 -08:00
setup.py Fix not cp jars (#12456) 2020-11-30 13:53:09 +08:00