Kai Fricke
9352cb781c
[release tests] Fix microbenchmark base image, network overhead cluster wait time, add long running tests ( #16355 )
2021-06-16 21:37:17 +01:00
Jiao
c6436ba7d6
[Serve] Add ray serve's logging context manager ( #16468 )
...
* Add ray serve's logging context manager
* Add ray serve's logging context manager
run formatting script scripts/format.sh
* fix missing package-lock json file
* linter
Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-06-16 13:17:07 -07:00
Clark Zinzow
00eb833de2
[Core] Stopgap fix for async actor lost object bug, and adds reproduction as test. ( #16414 )
...
* Support asyncio with max_concurrency == 1.
* Added test that reproduces lost object error.
* Create a fiber thread per caller instead of sharing a fiber thread among all callers.
* Formatting.
* Remove debug print statement.
* Try to accomodate dumb stupid linter that apparently doesn't know that async list comprehensions landed in Python 3.6, let alone await in list literals.
2021-06-16 12:39:45 -07:00
SangBin Cho
5997d19a5a
[Test] Global gc unit test flakniess fix ( #16471 )
2021-06-16 09:26:04 -07:00
SangBin Cho
90599d3562
[Pubsub] Use a pubsub module for Ownership based object directory ( #16407 )
...
* in progress
* In progress 2
* progress
* OBOD pubsub done
* Fix
* Fix a bug.
* Clean up getObjectLocationOwner
* Fix a build issue.
* Lint issue.
* test fix in progress
* continue debugging
* in progress
* Fix issues again.
* Formatting
* formating
* fix issues.
* Revert "fix issues."
This reverts commit 2da577e68abc6278e03d64a60e8b96c3136145bf.
* Fix a critical bug.
* Revert "Revert "fix issues.""
This reverts commit 6546ecbd1eb9798de0bf990b30b85a3ca3e5b4ad.
* Addressed code review.
2021-06-16 09:15:13 -07:00
mwtian
2f7d535253
[Test] Use Ray client in XGBoost train_small release test ( #16319 )
2021-06-16 14:39:32 +01:00
qicosmos
0f87eca3e9
[C++ Worker]Generate a template project for users ( #16337 )
2021-06-16 17:45:45 +08:00
Ian Rodney
90805d302f
[Client] Fix ArgParse ( #16456 )
...
Co-authored-by: Ian Rodney <ilr@anyscale.com>
2021-06-15 23:52:02 -07:00
Antoni Baum
ec7d7c8630
[Tune] Add soft imports test ( #16450 )
2021-06-15 18:50:21 -07:00
Eric Liang
5967cd3cf3
Make placement_group=None work as expected. ( #16437 )
...
* update
* add task test
* fix
2021-06-15 18:30:53 -07:00
Antoni Baum
2fb10e6730
[SGD] Add support for native Torch AMP in SGD ( #16382 )
...
* SGD native AMP initial commit
* SGD native amp second pass
* Update docs
* Update TorchTrainer doc
* Temp fix release test
* Update release/sgd_tests/sgd_gpu/sgd_gpu_app_config.yaml
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-15 17:48:21 -07:00
Amog Kamsetty
ca22df2367
[Dask] Re-enable scheduler on dask_shuffle example ( #16405 )
2021-06-15 17:47:57 -07:00
Amog Kamsetty
d23494d25a
[CI] Move test_shuffle to Medium tests ( #16447 )
...
* move
* unskip test
2021-06-15 17:45:54 -07:00
architkulkarni
412085dea7
[Runtime Env] filter out post wheel tests from doc tests ( #16439 )
2021-06-15 15:34:45 -07:00
Amog Kamsetty
3bf8f94fa3
Skip test_shuffle on Travis ( #16449 )
2021-06-15 13:54:01 -07:00
Ruoyun Huang
562018b55a
[sgd] Use target label count as training batch size ( #16400 )
2021-06-15 12:09:51 -07:00
junfan.zhang
2abb1e1d38
Fix misleading tips in scripts.py ( #16426 )
2021-06-15 11:42:05 -07:00
Eric Liang
823c9af20d
Skip test_shuffle_hang on Windows
2021-06-15 11:21:22 -07:00
Antoni Baum
adb3b61c03
[tune] Fix ConcurrencyLimiter
batch mode never finishing if searcher limits concurrency itself ( #16416 )
2021-06-15 11:18:12 -07:00
Simon Mo
577fa08649
[Buildkite] Drop the duplicated Travis tests ( #16420 )
2021-06-15 10:43:17 -07:00
Alex Wu
ba9fd06f87
Integrate scalability envelope with releaser ( #16417 )
...
* .
* .
* .
* .
* .
* .
* .
* success
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-15 10:42:55 -07:00
Amog Kamsetty
f3ad50fe6a
[SGD] Rename release tests ( #16410 )
...
Test failures unrelated
2021-06-15 17:16:40 +01:00
Eric Liang
1ef207abb6
Call Unblockifneeded ( #16422 )
2021-06-15 08:40:23 -07:00
Chong-Li
500248163f
[GCS] Fix: bookkeeping normal task resources in GCS ( #16371 )
2021-06-15 21:13:25 +08:00
Sven Mika
a5831f9429
[RLlib] Fix bandit example scripts and add all scripts to CI testing suite.
2021-06-15 13:30:31 +02:00
Sven Mika
d0014cd351
[RLlib] Policies get/set_state fixes and enhancements. ( #16354 )
2021-06-15 13:08:43 +02:00
SangBin Cho
f3ab162c5e
Fix nightly release test issues. ( #16419 )
2021-06-15 00:43:08 -07:00
Eric Liang
992437eafe
Yield plasma lock to other threads during long-running gets ( #16408 )
2021-06-14 16:23:05 -07:00
Simon Mo
5f4495108e
Fix macOS compilation ( #16412 )
2021-06-14 13:30:38 -07:00
Simon Mo
6763e95079
Revert "Revert "[Buildkite] cleanup artifacts directory between builds"" ( #16385 )
2021-06-14 11:30:55 -07:00
SangBin Cho
b4e2ca39f9
[Pubsub] Using OBOD command batch for both reference counting and wait for object eviction ( #16334 )
...
* In progress/
* Basic implementation for wait for object eviction done
* Port ref count
* Fixing tests.
* Fix unit testse and remove unnecessary code
* In progress with ref count test
* Command batch done.
* done.
* Add a implementation note
* Fix all issues.
* Addressed the first batch of code review.
* one last thing; fix unit test
* Fix all issues.
* Fix a type issue.
* Fix the type issue
2021-06-14 10:10:35 -07:00
Eric Liang
f93ca2b673
Make it much simpler to turn on event stats ( #16401 )
2021-06-14 09:51:24 -07:00
Amog Kamsetty
f9936c4252
[Dask] Dask Example Tests ( #16346 )
...
* add examples
* update dask docs
* add build file
* formatting
* fix ci command
* fix
* Update python/ray/util/dask/BUILD
* newline
* fix pytest fixtures
* fixes
* formatting
* fix shuffle example
2021-06-12 20:25:45 -07:00
Eric Liang
acb439e8f2
Prioritize get requests over wait request, and disallow overcommit of wait requests in unlimited allocation mode ( #16351 )
2021-06-12 14:06:43 -07:00
Qingyun Wu
dae3ac1def
[Tune] Add new searchers from FLAML ( #16329 )
2021-06-12 02:10:51 -07:00
Xianyang Liu
59f639f9db
[core] Fixes connect from worker node failed ( #16045 )
...
* fixes connect from worker node
* add UT
* fixes
* address comments
2021-06-11 18:51:46 -07:00
SangBin Cho
eb7344069b
[Test] Improving tests ( #16368 )
...
* Improve testing
* Fix tsets.
2021-06-11 18:29:22 -07:00
Chen Shen
24e409f948
[spilled object push optimization 3/3] ObjectManager Push from Spilled Object ( #16364 )
2021-06-11 15:57:51 -07:00
Philipp Moritz
ab092d901f
Increase redis connection timeout ( #16384 )
2021-06-11 15:57:35 -07:00
architkulkarni
be1129e04f
[Serve] Add tests for Serve quickstart with ray client ( #16344 )
2021-06-11 15:43:47 -07:00
Chris K. W
3fa9f2e5d6
[Modin] Add tests for modin ( #16260 )
...
Adds modin tests that run with and without ray client.
2021-06-11 12:23:33 -07:00
Simon Mo
2cdaf132b5
Revert "[Buildkite] cleanup artifacts directory between builds ( #16307 )" ( #16372 )
2021-06-11 08:33:26 -07:00
Kai Fricke
e8f8e9f328
[tune] Adjust searcher sample bounds to match Tune API ( #15899 )
2021-06-11 14:31:08 +01:00
matthewdeng
9c36ff81fa
[release] add golden notebook tests for dask/xgboost and modin/xgboost ( #16231 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-06-11 10:03:04 +01:00
Qing Wang
9a58e6a2e0
Remove unintended introduced words in comment. ( #16367 )
2021-06-11 13:35:42 +08:00
SongGuyang
379c87dbbb
[C++ worker] bugfix of incorrect flag name "redis_password" and add test to support connecting to external cluster ( #16219 )
2021-06-11 11:49:13 +08:00
Eric Liang
47bbca04be
Add fallback allocator stats to "ray memory" ( #16362 )
2021-06-10 18:33:59 -07:00
Simon Mo
8d56a36d28
[Buildkite] cleanup artifacts directory between builds ( #16307 )
2021-06-10 17:59:52 -07:00
Chen Shen
dd677f367e
[spilled object push optimization 2/3] Refactor ObjectManager's Push for integrating with SpilledObject ( #16352 )
2021-06-10 16:29:19 -07:00
Eric Liang
af11ec079a
update serve verbiage ( #16360 )
2021-06-10 16:23:38 -07:00