Commit graph

6138 commits

Author SHA1 Message Date
Kai Yang
cbc5dac6c4
[Java] fix shutdown lock typo (#11410) 2020-10-20 22:31:27 +08:00
Lingxuan Zuo
aed739fbf4
[Log] Ignore callstacktrace test for windows (#11413) 2020-10-20 15:21:29 +08:00
DK.Pino
15cbfae8ad
fix cross language ut (#11492) 2020-10-20 14:14:40 +08:00
DK.Pino
1b3b009f7a
[PlacementGroup]Add guarded by in placement group scheduler ut (#11306)
* add GUARDED_BY for success_placement_groups_ and failure_placement_groups_ vector

* update lint

* update lint

* update logical

* update lint

* change int to unsigned int

* update lint

* rename vector_mutex_ to placement_group_requests_mutex_

* resolve comment

* add int() for windows
2020-10-19 18:54:35 -07:00
Max Fitton
0a9cc9cce5
Revert "remove .fake build files (#11478)" (#11488)
This reverts commit 3ed3dea004.
2020-10-19 18:48:32 -07:00
Simon Mo
1bce7518f3
[Serve][Doc] Add FAQ Page (#11258) 2020-10-19 17:15:22 -07:00
Lee moon soo
09932ee72b
[docs] correct docker image name in Launch Ray in Docker section (#11475) 2020-10-19 16:01:03 -07:00
Kai Fricke
ed81010b0d
[tune] support hierarchical search spaces for hyperopt (#11431)
* support hierarchical search spaces for hyperopt

* Reduce num samples

* Fix prefix
2020-10-19 16:00:08 -07:00
Max Fitton
3ed3dea004
remove .fake build files (#11478)
Co-authored-by: Max Fitton <max@semprehealth.com>
2020-10-19 15:36:47 -07:00
Kai Fricke
2f74fe5b71
[tune/docs] Add PTL example to tune docs/examples (#11474) 2020-10-19 14:47:58 -07:00
Max Fitton
79fd1cfe6f
Update dashboard build instructions for new dashboard (#11471)
Co-authored-by: Max Fitton <max@semprehealth.com>
2020-10-19 13:12:22 -07:00
Sumanth Ratna
4946c1e6de
[docker] Minor improvements (#11352) 2020-10-19 12:42:53 -07:00
Ian Rodney
acbd12eabf
[Docker] Set Docker as the Default (#11416) 2020-10-19 10:53:30 -07:00
Max Fitton
f500292d41
[Dashboard] Turn on New Dashboard by Default (#11321) 2020-10-19 12:31:11 -05:00
Servon
202b1859ef
[tune] Update ZOOpt to better support the latest Ray (#11462)
Co-authored-by: Servon <zewen.li@polixir.ai>
2020-10-19 09:28:11 -07:00
Edward Oakes
798bd6a359
[dashboard] Add /api/cluster_status endpoint (#11456) 2020-10-19 11:00:47 -05:00
fangfengbin
da89cb19eb
[GCS]Fix node info idempotent bug (#11423)
* [GCS]Fix node info idempotent bug

* Fix review comment

Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-19 10:23:33 +08:00
Lee moon soo
8581dd2fb1
[Autoscaler] Staroid node provider followup improvements (#11408) 2020-10-18 00:26:11 -07:00
Lee moon soo
48b75a6922
[docker] latest -> nightly on docker image tag on build-docker.sh script (#11403) 2020-10-18 00:24:28 -07:00
Ian Rodney
f37e967ada
[Docs] READmes for DockerHub (#11350) 2020-10-16 15:46:13 -07:00
Ian Rodney
6d6a536601
[Docker] Fix Docker Stop on Teardown (#11400) 2020-10-16 15:45:33 -07:00
DK.Pino
6233cef22c
[Placement Group] Remove useless placement group argument in ray remote (#11389)
* remove useless placement_group argument in @ray_remote

* remove
2020-10-16 14:17:18 -07:00
Gekho457
50be2970dc
[autoscaler]Type hints for commands.py and sdk.py. (#11354) 2020-10-16 13:45:36 -07:00
Sven Mika
2aec77e305
[RLlib] Fix two test cases that only fail on Travis. (#11435) 2020-10-16 13:53:30 -05:00
Kai Fricke
f890808c14
[docs] Add xgboost_ray to community integrations (#11428) 2020-10-16 10:15:28 -07:00
Richard Liaw
7d1015d23e
[Doc] Fix Core Link (#11438)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-16 10:10:56 -07:00
Lingxuan Zuo
2f3ec4ef75
add streaming data writer unit tests (#11387) 2020-10-16 14:42:44 +08:00
mattearllongshot
049985549b
Restore uptime timeout to 5 seconds (#11300) 2020-10-15 17:21:11 -07:00
herve-alanaai
436202bcfd
[docs] Fix typos in documentation (#11414) 2020-10-15 17:00:48 -07:00
Ian Rodney
afd797b896
[docker] Check for GPUs before setting runtime-nvidia (#11418) 2020-10-15 15:43:09 -07:00
Amog Kamsetty
38eb61442b
[SGD] Callback API for SGD+Tune (#11316)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-15 15:22:14 -07:00
Sven Mika
414041c6dd
[RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
Sumanth Ratna
60a4be4a59
[tune] Remove metric and mode kwargs from create_searcher (#11335) 2020-10-14 21:44:36 -07:00
Sumanth Ratna
3fe757391b
[tune] Add Basic Variant Generator to search algorithm shim function (#11334)
* Add Basic Variant Generator

* Add 'random' key to SEARCH_ALG_IMPORT

Co-authored-by: Kai Fricke <kai@anyscale.com>

Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-10-14 21:41:47 -07:00
Vishnu Deva
00e0f14c6f
[tune] restore trials when sync_on_checkpoint is False (#11355)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-14 19:09:34 -07:00
Kai Fricke
f7120d2a18
[tune] Make metrics parameter optional in pytorch lightning integration (#11402) 2020-10-14 17:50:34 -07:00
SongGuyang
34191107a3
[cpp worker] fix crash in empty args task (#11363) 2020-10-14 16:48:34 -07:00
SangBin Cho
666fcde8ca
[Placement group] Input validation (#11152)
* Add a basic input validation.

* Addressed code review.
2020-10-14 13:56:41 -07:00
Ameer Haj Ali
a10e36ca04
Make the logging of gc.collect() freed refs appear in DEBUG not INFO (#11353) 2020-10-14 13:14:35 -07:00
Alex Wu
7466ce82df
[Autoscaler] Placement group autoscaling (#11243) 2020-10-14 13:11:46 -07:00
Eric Liang
aefcf901d3
[docs] Add sklearn integration link 2020-10-14 13:07:23 -07:00
SangBin Cho
b1481c6acf
Revert "[PlacementGroup]Add node manager test framework (#11174)" (#11398)
This reverts commit 241e765d3a.
2020-10-14 11:09:20 -07:00
Lingxuan Zuo
149ec5f6bf
[Log] dump stacktrace from glog lib (#11360)
* dump stacktrace from glog lib

* fix windows compile

* add comments for getcallstack
2020-10-14 10:52:12 -07:00
Kai Yang
abc6126814
[Java] Release actor instance reference when Ray.exitActor() is invoked (#11324) 2020-10-14 13:12:59 +08:00
fangfengbin
c926838411
[GCS]Fix GcsActorManagerTest multithreading bug (#11361)
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-13 21:36:40 -07:00
Simon Mo
5637093f44
Add Serve load testing tool to long running test yaml (#11386) 2020-10-13 20:24:57 -07:00
Simon Mo
866193b01c
Fix cluster yaml for serve benchmarks (#11383)
- Separate out single node and multiple node yamls
- Remove cluster_synced_files, somehow it breaks for me
2020-10-13 19:30:18 -07:00
fangfengbin
241e765d3a
[PlacementGroup]Add node manager test framework (#11174)
* add part code

* add part code

* add part code

* add part code

* add part code

* add part code

* fix ut bug

* fix ut bug

Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-13 19:27:11 -07:00
Max Fitton
cd9dcfca0d
[Dashboard] CPU/GPU usage details in actor pane (#11269) 2020-10-13 20:23:23 -05:00
Amog Kamsetty
933cf6675c
[Tune] Changes for Pytorch Lightning 1.0 (#11375) 2020-10-13 15:50:11 -07:00