Commit graph

4610 commits

Author SHA1 Message Date
Philipp Moritz
ac912f0ce1
Allow using breakpoint() to drop into Ray debugger (#17025)
* Set PYTHONBREAKPOINT

* update tests

* update

* update docs

* fix docs

* skip ray functions

* ok

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* breakpoint() is only working in Python > 3.6

* add note

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-13 13:52:17 -07:00
Ian Rodney
fac6045c87
[GCP] Allow Head Node to Launch Workers with IAM Role (#17027) 2021-07-13 10:44:34 -07:00
Amog Kamsetty
38b5b6d24c
Revert "[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565)" (#17036)
This reverts commit e4123fff27.
2021-07-13 09:57:15 -07:00
Kai Fricke
27d80c4c88
[RLlib] ONNX export for tensorflow (1.x) and torch (#16805) 2021-07-13 12:38:11 -04:00
Edward Oakes
f7759fa484
[core] Add ray.util.list_actors() API (#16642) 2021-07-13 10:00:28 -05:00
Sven Mika
e4123fff27
[RLlib] Simplify multiagent config (automatically infer class/spaces/config). (#16565) 2021-07-13 06:38:14 -04:00
Ian Rodney
9cb80fcf17
[Client][Proxy] Handle Non-Default Redis Password (#16885) 2021-07-12 23:57:51 -07:00
Eric Liang
e7350ff828
Fix flaky test_plasma_unlimited::test_fallback_allocation_failure (#17016)
* fix

* fix catch
2021-07-12 20:17:23 -07:00
Siyuan (Ryans) Zhuang
a8b57c78d6
[Workflow] Workflow management - Part II (#16907) 2021-07-12 17:31:23 -07:00
Qing Wang
4bde71ca86
[Java][Core] Support get current actor handle. (#14900) 2021-07-12 15:27:54 -07:00
corentinmarek
24e00fcb1b
Add initialization for transport params for non s3 storage (#16054) 2021-07-12 10:47:49 -07:00
Edward Oakes
87e6f99b9c
[serve] Bump timeouts on test_deploy and re-enable (#16969) 2021-07-12 11:38:02 -05:00
Wansoo Kim
c9e8c12f8c
[Refactor] Minor Refactoring and Typing (#16964) 2021-07-12 15:37:07 +01:00
Tao Wang
eed0ffc6ff
[Core]Align storage of session_dir in java/python so it can be accessed u… (#16958)
* Align storage of session_dir in java/python so they can be accessed using internal kv manager

* align cpp
2021-07-12 17:42:13 +08:00
qicosmos
298d2afc35
[Ray Log] remove glog dependency (#16077) 2021-07-12 17:06:52 +08:00
gurunath
e3966f59e3
[tune] explicitly raising tune import Error “[tune]” (#16575)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-11 23:40:10 -07:00
Antoni Baum
0935ec30d0
[tune] Add information about environment variables to tune.run docstring (#16980) 2021-07-11 17:20:17 -07:00
Julius Frost
a88b217d3f
[rllib] Enhancements to Input API for customizing offline datasets (#16957)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-10 15:05:25 -07:00
Scott Graham
3334357c58
[autoscaler] [azure] Fix Azure Autoscaling Failures (#16640)
Co-authored-by: Scott Graham <scgraham@microsoft.com>
2021-07-10 11:55:00 -07:00
Nikita Vemuri
6d36d7ed7e
[Serve] Call FastAPIWrapper class constructor before startup hooks (#16941)
* run constructor before startup hooks

* address comments

Co-authored-by: Nikita Vemuri <nikitavemuri@nikitas-mbp.attlocal.net>
2021-07-09 09:39:32 -07:00
Dmitri Gekhtman
27a9ae5e13
[autoscaler][gcp] Retry GCP BrokenPipeError (#16952) 2021-07-08 13:54:29 -07:00
Maxim Egorushkin
9cb5c9a422
Never convert trial_id to float when loading progress.csv. (#16959)
* Never convert trial_id to float when loading progress.csv.

* Formatting updated.

Co-authored-by: Maxim Egorushkin <maxim.egorushkin@gmail.com>
2021-07-08 11:06:11 -07:00
SongGuyang
560fd15568
[C++ worker] support build and add C++ worker to python wheel (#16496) 2021-07-08 14:42:26 +08:00
Clark Zinzow
cc215353e2
[Datasets] Adds Dataset.iter_batches(). (#16853) 2021-07-07 22:01:20 -07:00
Frank Luan
7c0320175c
Actor fix (#16955) 2021-07-07 20:51:36 -07:00
Clark Zinzow
9358dd4bc2
[Datasets] Port JSON and CSV readers to datasource API. (#16938)
* Port JSON and CSV readers to datasource API.

* Formatting.

* Moved datasources to datasource dir, created shared FileBasedDatasource.

* Confirm that accessing dataset schema raises an error.

* Formatting.

* Return None for unknown metadata instead of raising an error.

* Feedback.
2021-07-07 20:32:04 -07:00
Kai Yang
e925051ce4
[Core] Get node to connect for driver in global state accessor (#16810) 2021-07-08 11:21:12 +08:00
Amog Kamsetty
3c482cd6c8
Skip more test_deploy tests on OSX (#16943)
* skip more

* skip more
2021-07-07 16:53:21 -07:00
Simon Mo
f4671d55d8
Bump log monitor's sleep duration to 0.1s (#16939)
We observed in long running serving scenarios the log monitor
consistently uses 10% of cpus when there is no new lines. Hopefully
this new sleep duration should shrink that usage
2021-07-07 15:41:34 -07:00
Chen Shen
0421fa188e
[core] use fallocate for fallback allocation to avoid SIGBUS (#16824) 2021-07-07 14:50:11 -07:00
Dmitri Gekhtman
2f42b0c4b9
[kubernetes] K8s keep gpu zero override (#16887) 2021-07-07 13:45:34 -07:00
Chen Shen
dbd3260141
[core] Deprecate QuotaAwareEvictionPolicy (#16911) 2021-07-07 13:44:41 -07:00
Eric Liang
3b9f6ccc5e
Remove autoinit from ray.data (#16925) 2021-07-07 13:44:10 -07:00
Amog Kamsetty
b79ef3ba0f
[Serve] Skip more test_deploy tests on OSX (#16937) 2021-07-07 10:44:01 -07:00
Antoni Baum
8f41a34079
[tune] Placement group manager fixes (#16844)
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-07-07 10:42:19 -07:00
Antoni Baum
b737b2a877
[joblib] Improved object store management for Pool (#16879)
* Improved object store management for Pool

* Update docs, hints

* Add test

* Nit

* Nit
2021-07-07 10:39:18 -07:00
Dmitri Gekhtman
c6497c6520
[client][test] Client multiprocessing tests + client api minor fix (#16904) 2021-07-07 09:47:27 -07:00
Eric Liang
03f99100ea
Enable ray auto init by default (#16861) 2021-07-06 21:56:32 -07:00
Eric Liang
ca083e16d4
[dataset] Fix conversion to pyarrow tables in several transforms (#16916) 2021-07-06 20:40:57 -07:00
Amog Kamsetty
7318a212fb
[Serve] Skip test_redeploy_multiple_replicas on OSX (#16915) 2021-07-06 18:58:36 -07:00
Eric Liang
7e52fde8a3
Fix num returns error message (#16865) 2021-07-06 14:57:26 -07:00
Stefan Schneider
d4babd69c1
[windows] correct symlinks for files (node.py) (#16817) 2021-07-06 10:01:13 -07:00
Dmitri Gekhtman
a27a8172cc
[autoscaler] Handle node type key change/deletion (#16691) 2021-07-06 09:06:58 -07:00
Kai Fricke
4178655ba7
[tune] Pass custom sync_to_cloud templates to durable trainables (#16739) 2021-07-06 09:50:59 +01:00
Eleven Liu
e250abf689
[tune] Sort top results by metric (#16576) 2021-07-06 08:59:31 +01:00
Eric Liang
4af36faea1
[docs] Cleanup workflow api.py pydoc and spell out ObjectRef for clarity (#16857)
* cleanup types

* docs

* clarify
2021-07-06 00:59:06 -07:00
Kai Yang
7c21be5450
[Object spilling] Clean up spilled objects on disk when Raylet starts (#16669) 2021-07-05 12:01:25 +08:00
Vince Jankovics
63ce4b4e97
[tune] Fix step for MLflow log_trial_result (#16840)
* Fix step for MLflow log_trial_result

* fix test

* lint

Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-07-03 10:06:45 -07:00
Yi Cheng
4bb3883a73
[dataset] deduct filesystem automatically (#16762) 2021-07-03 00:50:59 -07:00
Siyuan (Ryans) Zhuang
122bf309fa
[Workflow] Workflow management - Part I (#16838)
* refactoring

* share fate with the driver

* move TODOs to correct locations

* disable objectref test

* test raise exception when use object ref as inputs
2021-07-02 22:12:45 -07:00