Kai Fricke
a8dbc44f9a
[ci] minimal dependency install test ( #18071 )
2021-08-31 15:26:25 +02:00
Chen Shen
5f3ec7634b
Fix off by one test bug ( #18239 )
2021-08-31 00:07:03 -07:00
Clark Zinzow
e154f87cab
Added split_at_indices to DatasetPipeline. ( #18243 )
2021-08-31 00:06:35 -07:00
Eric Liang
db9b5f142d
Disable worker logs temporarily during driver breakpoints ( #18192 )
2021-08-30 20:26:16 -07:00
Stephanie Wang
8e06db7280
Revert "[Core] revert: revert Unified worker starter ( #18008 )" ( #18228 )
...
This reverts commit b9978dd02b
.
2021-08-30 17:28:41 -07:00
Yi Cheng
7a65815108
[workflow] Defer input preparation until run ( #18225 )
2021-08-30 16:37:34 -07:00
Antoni Baum
5be6bda4cf
[tests] Add Ludwig CI test ( #18126 )
2021-08-30 12:27:39 -07:00
Eric Liang
1adce7da4e
Revert "Auto discover dashboard agent port ( #17855 )" ( #18217 )
...
This reverts commit 53ddb551d5
.
2021-08-30 10:46:37 -07:00
Yi Cheng
f579822790
[workflow] Workflow inside virtual actor ( #18066 )
2021-08-30 10:40:22 -07:00
Chen Shen
7631d042bb
[Test] increase timeout for object spilling test caused by EBS cold storage issue ( #18200 )
2021-08-30 00:28:26 -07:00
SangBin Cho
0e968c1e82
[Core] Reduce spilling threshold ( #17910 )
...
* Lower the threshold
* ip
* Handle test failure
* lint
* last fix
* .
* Retry
2021-08-30 00:09:35 -07:00
fyrestone
53ddb551d5
Auto discover dashboard agent port ( #17855 )
2021-08-30 12:06:28 +08:00
Stephanie Wang
7bc1ef0dd9
[core] Prestart workers up to available CPU limit ( #18166 )
...
* Prestart workers according to num available CPUs
* lint
* Prestart min(available CPU, backlog)
* Fix test, adjust policy
* debug
* retry
* lint
2021-08-29 14:11:53 -07:00
Yi Cheng
d5cd95364b
[workflow] Some usability issues fixing ( #18133 )
2021-08-28 16:51:00 -07:00
Amog Kamsetty
3b77840c1b
PyTorch Lightning Updates ( #17876 )
2021-08-27 23:15:51 -07:00
Antoni Baum
e7bbadb920
[tune] Extend Tune Callback API ( #17794 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-08-27 18:05:12 -07:00
Antoni Baum
714193ce6f
[SGDv2] Tensorboard Callback ( #17824 )
...
* [SGD] save checkpoints to disk
* fix test; add logs
* Extend SGDv2 callback API
* Move json file creation to JsonLoggerCallback
* TBXLoggerCallback
* Simplify, fix linear example
* rename log_dir to logdir for consistency with tune
* Add test
* Fix
* Break up logging classes
* Fix error
* Update type hint for results
* Refactor
Co-authored-by: Matthew Deng <matthew.j.deng@gmail.com>
2021-08-27 17:50:26 -07:00
Eric Liang
95b5ad12ba
Initial version of workflow documentation ( #18138 )
2021-08-27 16:20:48 -07:00
Jiao
c7e38ceb10
[serve] Better constructor failure handling ( #16922 )
2021-08-27 18:05:22 -05:00
mwtian
26679d62c5
[Core][ObjectRef] Change default to not record call stack during ObjectRef creation ( #18078 )
2021-08-27 15:45:34 -07:00
Clark Zinzow
c0598de82a
[Datasets] Port write APIs to use file-based datasources. ( #18135 )
2021-08-27 15:24:54 -07:00
Chen Shen
28e6ae5ce0
[Test] fix object spilling 2 ( #18141 )
2021-08-27 13:52:42 -07:00
Clark Zinzow
aee7ba2510
[Datasets] Add from_numpy() and to_numpy() APIs ( #18146 )
2021-08-27 13:33:11 -07:00
Yi Cheng
ed7124663a
[workflow] Fix nested workflow with catch exception bug ( #18145 )
2021-08-27 10:53:15 -07:00
Chen Shen
feeb20e920
[CI][rfc] Fix flaky test_multi_node:test_cleanup_on_driver_exit
2021-08-27 10:51:01 -07:00
Tao Wang
7620afb8be
[Deploy]Don't start shard redis in local if we specify external redis. ( #17856 )
...
* Don't start shard redis in local if we specify external redis
* lint
* reuse primary as shard
* add test
* lint
* lint
* lint
2021-08-27 16:45:09 +08:00
Antoni Baum
56089ae926
[tune] Add max_concurrent_trials
argument to tune.run
( #17905 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-08-27 09:12:50 +02:00
xwjiang2010
cc45d3a725
[tune] Update trial resources on resume. ( #17975 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-08-27 09:12:18 +02:00
Eric Liang
d52ffd926e
Add task / actor name to driver log prefix ( #18105 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-26 16:18:04 -07:00
Edward Oakes
6fa05ed708
[runtime_env] Better error message for working_dirs that exceed the max size ( #18092 )
2021-08-26 15:21:12 -05:00
Edward Oakes
3dc3f6102f
[serve] Remove unused ServeRequest codepath ( #18120 )
2021-08-26 15:08:00 -05:00
Chen Shen
a7365b74e6
[CI][easy] run test_nested_id both inlined and from plasma store. ( #18081 )
2021-08-26 10:32:06 -07:00
Edward Oakes
c16e730f0e
[runtime_env] Better error message when ray client server fails to start ( #18088 )
2021-08-26 11:35:11 -05:00
Edward Oakes
83bfbf3b59
[runtime_env] Fix warning message to not say ray.connect() ( #18084 )
2021-08-26 11:34:44 -05:00
Edward Oakes
0c5f7a698d
[core] Add type hints to public APIs in worker.py ( #18049 )
2021-08-26 09:51:44 -05:00
Edward Oakes
07c05e16fa
[serve] Remove unused HTTP method metadata & matching ( #18079 )
2021-08-26 09:49:49 -05:00
Edward Oakes
0dd75c2f87
[serve] Skip namespace test in test_standalone on windows ( #18093 )
2021-08-26 09:48:30 -05:00
Kai Fricke
34cf5db109
[tune] Fix hyperopt points to evaluate for nested lists ( #18113 )
2021-08-26 14:34:22 +02:00
Raphael CHEN
edac59f18f
[tune] correctly validate initial points for random search ( #17282 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-08-26 10:24:51 +02:00
xwjiang2010
3fa20962cf
[Tune] Fix local mode. Add explicit concurrency limiter for local mode. ( #18023 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-08-26 09:31:40 +02:00
Chen Shen
a29b157e2e
[core] better error message for lost objects ( #18068 )
2021-08-26 00:03:29 -07:00
Clark Zinzow
52bd0e1737
[Datasets] Drop empty tables when reading Parquet pieces. ( #18098 )
2021-08-26 00:02:08 -07:00
Eric Liang
e1f69ceb5e
Add documentation for DatasetPipeline.from_iterable ( #18106 )
2021-08-25 22:31:23 -07:00
Yi Cheng
f66c51de2d
[workflow] Support named step in workflow ( #17915 )
2021-08-25 19:51:45 -07:00
Clark Zinzow
ab0a638411
[Datasets] Add pyarrow minimum version check. ( #18082 )
2021-08-25 15:59:12 -07:00
Clark Zinzow
db1bdd28fc
Add pyarrow filesystem test coverage for Parquet reading. ( #18087 )
2021-08-25 15:58:57 -07:00
Edward Oakes
406dc0b458
[core] Disallow empty string namespace + add validation tests ( #18048 )
2021-08-25 15:37:02 -05:00
Eric Liang
451c501bbd
[doc] Add API stability annotations to developer guide ( #18052 )
2021-08-25 13:34:43 -07:00
SangBin Cho
c2d488a504
[Usability] Add backpressure to log monitor ( #17774 )
...
* Add backpressure to log monitor
* Added a test and addressed code review.
* Addressed code review.
2021-08-25 13:07:18 -07:00
Sasha Sobol
fcb044d47c
[autoscaler] make 0 default min/max workers for head node ( #17757 )
...
* make 0 default min/max workers for head node
* fix helm charts, test, defaults for head
* fix test, docs
* make 0 default min/max workers for head node
* fix helm charts, test, defaults for head
* fix test, docs
* comments. logging
* better wording (logs)
Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
* fix logging message
* fix max workers in raycluster.yaml
* use default values of 0 for min/max workders in a helm chart
* add missing line back
Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
2021-08-25 14:56:20 -04:00