Tao Wang
e1075c0a82
[GCS]Fill resource fields when re-report heartbeat after gcs restarted ( #12097 )
2020-11-25 11:07:02 +08:00
Edward Oakes
dae137b919
Don't allow 'optional' files in setup.py ( #12359 )
2020-11-24 17:41:58 -06:00
Eric Liang
5895554555
[autoscaler] Raise node "start" deadline to 900s, make configurable ( #12316 )
2020-11-24 12:16:01 -08:00
Edward Oakes
4ada3e4c99
[serve] Incremental change towards async control loop for replica startup ( #12281 )
2020-11-24 13:06:08 -06:00
roireshef
888357d251
added address resolution fix for running in docker containers ( #11944 )
...
* added address resolution fix for running in docker containers
* added address resolution fix for running in docker containers (java)
* Update RayNativeRuntime.java
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-24 10:34:56 -08:00
Edward Oakes
be0fa7b8b4
Properly specify kubectl-rsync.sh in setup.py ( #12356 )
2020-11-24 12:13:29 -06:00
Gekho457
e66ddab190
[autoscaler/k8s] Handle unavailable k8s API ( #12283 )
2020-11-24 11:13:15 -06:00
Kai Fricke
b94bfdfa99
[tune] use default anonymous metric _metric
if at least a mode is set ( #12159 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-23 20:09:33 -08:00
Edward Oakes
135f2e0602
[serve] Only use typed BackendConfig in controller ( #12275 )
2020-11-23 20:56:04 -06:00
Edward Oakes
be2187f942
Remove use_legacy_config tests ( #12272 )
2020-11-23 20:30:50 -06:00
Edward Oakes
1b3c48b922
Remove unused _post_get_hooks ( #12270 )
2020-11-23 20:30:30 -06:00
Edward Oakes
a93ed4583d
Remove unused print_failed_task ( #12267 )
2020-11-23 16:33:10 -06:00
Richard Liaw
e59fe65d3d
[tune] Fix logging for dockersyncer ( #12196 )
2020-11-23 14:29:41 -08:00
Edward Oakes
4e43650851
Remove deprecated f._submit API ( #12268 )
2020-11-23 16:20:22 -06:00
Sumanth Ratna
6346ea0be6
[tune] Allow None values in TBX logger ( #12262 )
2020-11-23 12:57:23 -08:00
viotemp1
4c4f189f97
[tune] nevergrad add points_to_evaluate ( #12207 )
2020-11-23 12:51:04 -08:00
Ian Rodney
1cf95cb081
[Serve] Remove old report-queue-code ( #12211 )
2020-11-23 12:42:05 -08:00
Ian Rodney
7f1f16d99e
[Core] Force get_actor(name)'s name to be non-empty string ( #12218 )
2020-11-23 12:38:16 -08:00
Richard Liaw
40428c9b05
[tune] Make tf distributed testing smaller ( #12173 )
2020-11-23 12:15:10 -08:00
Edward Oakes
c99c376d66
Remove unused logging job id field ( #12266 )
2020-11-23 13:00:15 -06:00
Kaushik B
643c07e451
[tune] update param & install hints for Search Algos ( #12242 )
...
* fix(tune): change objective_name param to metric for AxSearch
* fix(tune): install hints for search algos
2020-11-22 15:39:41 -08:00
SangBin Cho
eb89c567e2
[Metrics] Skip prom file test on Windows. ( #12234 )
2020-11-22 00:32:18 -08:00
Ameer Haj Ali
2a9499a1bd
[autoscaler] cover autofilling resources with try except. ( #12222 )
...
Co-authored-by: Ameer Haj Ali <ameerhajali@ameers-mbp.lan>
Co-authored-by: Alex Wu <alex@anyscale.io>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Eric Liang <ekhliang@gmail.com>
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-11-21 13:04:24 -08:00
Kai Fricke
608d0378c4
[tune] Add test for infinite trials ( #12156 )
2020-11-21 12:54:01 -08:00
Eric Liang
839517743d
Support ray.* in remote functions for Ray client ( #12177 )
2020-11-20 13:28:46 -08:00
Richard Liaw
48042be8bb
[tune] Avoid dependency on Kubernetes ( #12188 )
...
* fix-kubernetes
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* kub
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-20 13:01:20 -08:00
Simon Mo
d200f620ee
Deflake test_router ( #12175 )
2020-11-19 18:37:46 -08:00
dHannasch
4b2c5daf45
State which IP addresses are failing to match. ( #11957 )
...
* State which IP addresses are failing to match.
* Use f-string.
* action item?
* I could swear swear this passed with length 80 before
* wait, this is how it wants f-strings
* reword
* action item
* f
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
* f
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
* f
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
2020-11-19 17:25:25 -08:00
Eric Liang
e72abcd0aa
Enable even more new scheduler tests ( #12096 )
2020-11-19 16:47:18 -08:00
Kai Fricke
f1ace386db
[tune] detect docker and kubernetes syncers ( #12108 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-19 12:17:17 -08:00
SangBin Cho
7d67af6c2a
[Metrics] Add stats to measure process startup time + scheduling stats. ( #12100 )
...
* Add new stats.
* Fix issues.
2020-11-19 11:04:26 -08:00
Kai Fricke
6999075c75
[tune] Add seed
parameter to BOHB ( #12160 )
2020-11-19 10:27:16 -08:00
Philipp Moritz
ff82af1588
Clean up requirements.txt ( #12136 )
2020-11-19 09:27:09 -08:00
Xianyang Liu
9481ecd180
[data] MLDataset based on ParallelIterator ( #11849 )
2020-11-19 00:33:37 -08:00
Barak Michener
2fe1321c3f
[ray_client] __getattr__ for the API Import interface ( #12089 )
...
* move all things that import real-ray into the server folder
* change the import line and have a __getattr__-able API stub
* formatting
* remove unused (duplicated) util file
* Remove module methods (but leave comment on why)
2020-11-18 22:42:02 -08:00
Ian Rodney
a74f1885db
Revert "[CLI] Fix ray commands when RAY_ADDRESS used ( #11989 )" ( #12135 )
...
* Revert "[CLI] Fix ray commands when RAY_ADDRESS used (#11989 )"
This reverts commit d23d326560
.
* only check environment for CLI commands
* use new fns
* fixing docs
* rename and return "auto"
* Update python/ray/_private/services.py
Co-authored-by: Eric Liang <ekhliang@gmail.com>
* Update services.py
* Update services.py
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-18 22:41:10 -08:00
dHannasch
5bc4976550
More informative error message if ray start fails to connect to Redis ( #11880 )
...
* Chain original redis.ConnectionError. More importantly, print out the address so people don't have to dig out --logging-level debug to get the number wait_for_redis_to_start() already knows.
Check the Redis password.
* f
2020-11-18 19:28:10 -08:00
Richard Liaw
0d388c4d31
[autoscaler] remove unnecessary print output ( #12131 )
2020-11-18 18:33:48 -08:00
Richard Liaw
2bb6db5e64
[tune] temporary revert of verbosity changes ( #12132 )
2020-11-18 18:27:41 -08:00
Ameer Haj Ali
4717fcd9c0
[autoscaler] give max_workers precedence over min_workers in resource demand scheduler ( #12106 )
2020-11-18 16:24:48 -08:00
Ameer Haj Ali
d826452e0b
[autoscaler] fix max_workers bug in resource_demand_scheduler by counting the head node ( #12123 )
2020-11-18 15:24:38 -08:00
Ian Rodney
e086ddc18f
[core] Add Recursive task cancelation ( #11923 )
2020-11-18 15:18:40 -08:00
Alex Wu
e9c9ba9c9f
[New Scheduler] Don't start tasks if the owner is dead ( #12050 )
2020-11-18 11:34:19 -08:00
Ameer Haj Ali
eef624750c
[ray client] ray wait() implementation ( #12072 )
2020-11-18 11:33:57 -08:00
Kai Fricke
2b60c5774b
[tune] cache checkpoint serialization ( #12064 )
2020-11-18 09:03:53 -08:00
Ian Rodney
d23d326560
[CLI] Fix ray commands when RAY_ADDRESS used ( #11989 )
...
* [CLI] Fix ray commands when RAY_ADDRESS used
* erics suggestion
2020-11-17 23:44:59 -08:00
Philipp Moritz
b96516e9d3
[core] Remove google dependency ( #12085 )
2020-11-17 19:01:00 -08:00
fangfengbin
f400333841
[Placement Group]Placement Group supports gcs failover(Part2) ( #12003 )
...
* add testcase
* fix ut
* fix review comment
* fix review comment
* fix review comments
* fix ut bug
* add part code
* add part code
* add part code
* add testcase
* add part code
* fix ut bug
* fix ut timeout bug
* fix ut bug
Co-authored-by: 灵洵 <fengbin.ffb@antgroup.com>
2020-11-18 10:59:26 +08:00
Simon Mo
c476037c97
[Core] Async API should raise on all RayError ( #12043 )
...
Before this PR we are raising just RayTaskError, this means errors
like RayActorError(Actor Died) won't be propogated and thrown at
`await object_ref`. This PR fixes that.
2020-11-17 17:20:30 -08:00
Stephanie Wang
f6bdd5ab17
[New Scheduler] Spillback from the queue of tasks assigned to the local node ( #12084 )
2020-11-17 16:13:59 -08:00