Amog Kamsetty
1d3941e41a
[Tests] Skip failing windows tests ( #13495 )
...
* skip failing windows tests
* skip more
* remove
* updates
2021-01-15 20:51:33 -08:00
Eric Liang
ee6332dbb0
Bump dev branch to 2.0 to avoid endless version bump toil ( #13497 )
...
* wip
* fix
* fix
2021-01-15 17:41:17 -08:00
Barak Michener
68e3a0e0e1
[ray_client]: fix wrong reference in server_pickler ( #13474 )
...
Change-Id: Ie3d219541b1875e986e72e3ae73ece145c715acf
2021-01-15 15:49:38 -08:00
Eric Liang
4aeb0ea550
Return version info from Ray client connect, to allow for discovering version mismatches
2021-01-15 14:27:26 -08:00
Ian Rodney
0ec9ddabc1
[docker/dashboard] Fix ray dashboard ( #12899 )
2021-01-15 10:03:01 -08:00
Barak Michener
84e110a949
[ray_client]: Support runtime_context as metadata ( #13428 )
2021-01-14 14:37:00 -08:00
Clark Zinzow
9a658b568f
[Core] Ownership-based Object Directory: Consolidate location table and reference table. ( #13220 )
...
* Added owned object reference before Plasma put on Create() + Seal() path.
* Consolidated location table and reference table in reference counter.
* Restore type in definition.
* Clean up owned reference on failed Seal().
* Added RemoveOwnedObject test for reference counter.
* Guard against ref going out of scope before location RPCs.
* Add 'owner must have ref in scope' precondition to documentation for object location methods.
* Move to separate Create() + Seal() methods for existing objects.
* Clearer distinction between Create() and Seal() methods.
* Make it clear that references will normally be cleaned up by reference counting.
2021-01-14 13:48:10 -08:00
Siyuan (Ryans) Zhuang
d1e9887be2
[Serialization] New custom serialization API ( #13291 )
...
* new serialization API with doc & test
* add more notes
* refine notes
* doc
2021-01-14 13:15:31 -08:00
Amog Kamsetty
07e97fe4c2
[xgb] re-enable xgboost_ray tests ( #13416 )
...
* re-enable
* fix
* update xgb_ray version
2021-01-14 22:14:44 +01:00
Edward Oakes
7ba87b8abe
Fix getting runtime context dict in driver ( #13417 )
2021-01-14 14:41:53 -06:00
Ian Rodney
411e37ce3f
[serve] Properly obey SERVE_LOG_DEBUG=0 ( #13460 )
2021-01-14 12:24:22 -08:00
Micah Yong
c89ebdd94a
[Core][CLI] ray status
and ray memory
no longer starts a new job ( #13391 )
...
* Access memory info in ray memory via GlobalStateAccessor rather than calling ray.init()
* Modify ray status cli so that it doesn't start a new job via ray.init()
* Remove local test file
* Access memory info in ray memory via GlobalStateAccessor rather than calling ray.init()
* Modify ray status cli so that it doesn't start a new job via ray.init()
* Remove local test file
* Make status and error args required in commands.py#debug.status
* Remove unnecessary imports
* Access memory info in ray memory via GlobalStateAccessor rather than calling ray.init()
* Modify ray status cli so that it doesn't start a new job via ray.init()
* Remove local test file
* Access memory info in ray memory via GlobalStateAccessor rather than calling ray.init()
* Modify ray status cli so that it doesn't start a new job via ray.init()
* Remove local test file
* Make status and error args required in commands.py#debug.status
* Remove unnecessary imports
* Job 38482.1 should now pass
* Resolve merge conflict
2021-01-14 10:12:16 -08:00
Dmitri Gekhtman
2d772a5a6d
[kubernetes][minor] Operator garbage collection fix ( #13392 )
2021-01-14 10:40:15 -06:00
Barak Michener
9c6d892eec
[ray_client]: fix exceptions raised while executing on the server on behalf of the client ( #13424 )
2021-01-14 10:38:01 -06:00
Ameer Haj Ali
2f7ba25efb
[joblib] joblib strikes again but this time on windows ( #13212 )
2021-01-14 10:36:52 -06:00
fangfengbin
4a6c53da46
[Core]Fix raylet scheduling bug ( #13452 )
...
* [Core]Fix raylet scheduling bug
* fix lint error
* fix lint error
Co-authored-by: 灵洵 <fengbin.ffb@antgroup.com>
2021-01-14 14:50:32 +01:00
Amog Kamsetty
560299972c
Revert "Enable Ray client server by default ( #13350 )" ( #13429 )
...
This reverts commit 912d0cbbf9
.
2021-01-13 21:28:54 -08:00
dHannasch
ad015cb7df
Split out the part of get_node_ip_address for which the docstring is correct ( #12796 )
2021-01-14 11:32:56 +08:00
Amog Kamsetty
3f42e6bafe
[Tune] Pin Transitive Dependencies ( #13358 )
2021-01-13 19:10:21 -08:00
Eric Liang
602c103eae
Make request_resources() use internal kv instead of redis pub sub ( #13410 )
2021-01-13 17:30:43 -08:00
Edward Oakes
9ef48b16b6
[serve] Pull out goal management logic into AsyncGoalManager class ( #13341 )
2021-01-13 18:35:25 -06:00
Edward Oakes
c6fc7124d1
[tune] Fix f-string in error message ( #13423 )
2021-01-13 18:34:21 -06:00
Simon Mo
15501a4151
Fix Serve release test ( #13385 )
2021-01-13 15:06:23 -08:00
Dmitri Gekhtman
1968b2f9d8
[autoscaler/k8s] [CI] Kubernetes test ray up, exec, down ( #12514 )
2021-01-13 15:03:56 -08:00
Eric Liang
6de5711690
Plumb retries update ( #13411 )
2021-01-13 13:49:57 -08:00
Barak Michener
8f48c64507
[ray_client]: Fix multiple attempts at checking connection ( #13422 )
2021-01-13 13:36:01 -08:00
Barak Michener
0b22341bc9
[ray_client]: Wait for ready and retry on ray.connect() ( #13376 )
...
* [ray_client]: wait until connection ready
Change-Id: Ie443be60c33ab7d6da406b3dcaa57fbb7ba57dd6
* lint
Change-Id: I30f8e870bbd5f8859a9f11ae244e210f077cedd0
* docs and retry minimum
Change-Id: I43f5378322029267ddd69f518ce8206876e2129d
2021-01-13 00:19:15 -08:00
Eric Liang
912d0cbbf9
Enable Ray client server by default ( #13350 )
...
* update
* fix
* fix test
* update
2021-01-12 21:31:01 -08:00
Ian Rodney
2e70743077
[Serve] Backend state unit tests ( #13319 )
2021-01-12 14:54:04 -06:00
architkulkarni
e560933f9c
[Serve] Add dependency management support for driver not running in a conda env ( #13269 )
2021-01-12 09:57:15 -08:00
Kai Fricke
518427627b
[tune] buffer trainable results ( #13236 )
...
* Working prototype
* Pass buffer length, fix tests
* Don't buffer per default
* Dispatch and process save in one go, added tests
* Fix tests
* Pass adaptive seconds to train_buffered, stop result processing after STOP decision
* Fix tests, add release test
* Update tests
* Added detailed logs for slow operations
* Update python/ray/tune/trial_runner.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* Apply suggestions from code review
* Revert tests and go back to old tuning loop
* nit
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-01-12 18:52:47 +01:00
Eric Liang
470fda190a
Forgot overwrite parameter in Ray client internal kv
2021-01-11 17:50:06 -08:00
Amog Kamsetty
0452a3a435
[Tune] Rename MLFlow to MLflow ( #13301 )
2021-01-11 17:36:55 -08:00
Eric Liang
de5bc24c60
Implement internal kv in ray client ( #13344 )
...
* kv internal
* fix
2021-01-11 14:54:52 -08:00
Eric Liang
fbb9795374
[client] Report number of currently active clients on connect ( #13326 )
...
* wip
* update
* update
* reset worker
* fix conn
* fix
* disable pycodestyle
2021-01-11 14:53:12 -08:00
architkulkarni
c43fa12e73
[Serve] Support Starlette streaming response ( #13328 )
2021-01-11 13:27:44 -08:00
Edward Oakes
62e1ad3973
[serve] Cleanup backend state, move checkpointing and async goal logic inside ( #13298 )
2021-01-11 11:45:43 -06:00
Edward Oakes
93006c2ba5
Use wait_for_condition to reduce flakiness in test_queue.py::test_custom_resources ( #13210 )
2021-01-10 19:32:59 -06:00
Barak Michener
6f0083ed10
add the method annotation and a comment explaining what's happening ( #13306 )
...
Change-Id: I848cc2f0beaed95340d9de7cca19a50c78d9da9a
2021-01-10 15:54:10 -08:00
Akash Patel
94a873fc4d
remove empty extras streaming deps ( #12933 )
2021-01-10 12:09:27 -08:00
Kai Fricke
d4b0a9fadf
[tune] convert search spaces: parse spec before flattening ( #12785 )
...
* Parse spec before flattening
* flatten after parse
* Test for ValueError if grid search is passed to search algorithms
2021-01-09 18:21:49 +01:00
Edward Oakes
d434ba6518
[serve] Clean up EndpointState interface, move checkpointing inside of EndpointState ( #13215 )
2021-01-08 22:36:19 -06:00
Philipp Moritz
c5ae30d1d4
Do not give an error if both RAY_ADDRESS
and address
is specified on initialization ( #13305 )
...
* Finalize handling of RAY_ADDRESS
* lint
2021-01-08 18:31:32 -08:00
Barak Michener
eb6f403b97
[ray_client]: first draft of documentation ( #13216 )
2021-01-08 15:38:36 -08:00
Ian Rodney
f916549602
[Cancellation] Make Test Cancel Easier to Debug ( #13243 )
...
* first commit
* lint-fix
2021-01-08 14:52:43 -08:00
Alex Wu
6ca4fb1054
[Pull manager] Only pull once per retry period ( #13245 )
...
* .
* docs
* cleanup
* .
* .
* .
* .
Co-authored-by: Alex <alex@anyscale.com>
2021-01-08 14:51:11 -08:00
Edward Oakes
66daed99f5
Remove top-level ray.connect() and ray.disconnect() APIs ( #13273 )
2021-01-08 15:26:20 -06:00
dependabot[bot]
300a22d8f7
[tune](deps): Bump gluoncv from 0.9.0 to 0.9.1 in /python/requirements ( #13287 )
2021-01-08 11:42:58 -08:00
dependabot[bot]
3569b78237
[tune](deps): Bump mlflow from 1.13.0 to 1.13.1 in /python/requirements ( #13286 )
2021-01-08 11:42:18 -08:00
Philipp Moritz
a247c71e2e
[ray_client] Add metadata to gRPC requests ( #13167 )
2021-01-07 23:58:15 -08:00