Commit graph

6176 commits

Author SHA1 Message Date
Ian Rodney
acbd12eabf
[Docker] Set Docker as the Default (#11416) 2020-10-19 10:53:30 -07:00
Max Fitton
f500292d41
[Dashboard] Turn on New Dashboard by Default (#11321) 2020-10-19 12:31:11 -05:00
Servon
202b1859ef
[tune] Update ZOOpt to better support the latest Ray (#11462)
Co-authored-by: Servon <zewen.li@polixir.ai>
2020-10-19 09:28:11 -07:00
Edward Oakes
798bd6a359
[dashboard] Add /api/cluster_status endpoint (#11456) 2020-10-19 11:00:47 -05:00
fangfengbin
da89cb19eb
[GCS]Fix node info idempotent bug (#11423)
* [GCS]Fix node info idempotent bug

* Fix review comment

Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-19 10:23:33 +08:00
Lee moon soo
8581dd2fb1
[Autoscaler] Staroid node provider followup improvements (#11408) 2020-10-18 00:26:11 -07:00
Lee moon soo
48b75a6922
[docker] latest -> nightly on docker image tag on build-docker.sh script (#11403) 2020-10-18 00:24:28 -07:00
Ian Rodney
f37e967ada
[Docs] READmes for DockerHub (#11350) 2020-10-16 15:46:13 -07:00
Ian Rodney
6d6a536601
[Docker] Fix Docker Stop on Teardown (#11400) 2020-10-16 15:45:33 -07:00
DK.Pino
6233cef22c
[Placement Group] Remove useless placement group argument in ray remote (#11389)
* remove useless placement_group argument in @ray_remote

* remove
2020-10-16 14:17:18 -07:00
Gekho457
50be2970dc
[autoscaler]Type hints for commands.py and sdk.py. (#11354) 2020-10-16 13:45:36 -07:00
Sven Mika
2aec77e305
[RLlib] Fix two test cases that only fail on Travis. (#11435) 2020-10-16 13:53:30 -05:00
Kai Fricke
f890808c14
[docs] Add xgboost_ray to community integrations (#11428) 2020-10-16 10:15:28 -07:00
Richard Liaw
7d1015d23e
[Doc] Fix Core Link (#11438)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-16 10:10:56 -07:00
Lingxuan Zuo
2f3ec4ef75
add streaming data writer unit tests (#11387) 2020-10-16 14:42:44 +08:00
mattearllongshot
049985549b
Restore uptime timeout to 5 seconds (#11300) 2020-10-15 17:21:11 -07:00
herve-alanaai
436202bcfd
[docs] Fix typos in documentation (#11414) 2020-10-15 17:00:48 -07:00
Ian Rodney
afd797b896
[docker] Check for GPUs before setting runtime-nvidia (#11418) 2020-10-15 15:43:09 -07:00
Amog Kamsetty
38eb61442b
[SGD] Callback API for SGD+Tune (#11316)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-15 15:22:14 -07:00
Sven Mika
414041c6dd
[RLlib] Do not create env on driver iff num_workers > 0. (#11307) 2020-10-15 18:21:30 +02:00
Sumanth Ratna
60a4be4a59
[tune] Remove metric and mode kwargs from create_searcher (#11335) 2020-10-14 21:44:36 -07:00
Sumanth Ratna
3fe757391b
[tune] Add Basic Variant Generator to search algorithm shim function (#11334)
* Add Basic Variant Generator

* Add 'random' key to SEARCH_ALG_IMPORT

Co-authored-by: Kai Fricke <kai@anyscale.com>

Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-10-14 21:41:47 -07:00
Vishnu Deva
00e0f14c6f
[tune] restore trials when sync_on_checkpoint is False (#11355)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-10-14 19:09:34 -07:00
Kai Fricke
f7120d2a18
[tune] Make metrics parameter optional in pytorch lightning integration (#11402) 2020-10-14 17:50:34 -07:00
SongGuyang
34191107a3
[cpp worker] fix crash in empty args task (#11363) 2020-10-14 16:48:34 -07:00
SangBin Cho
666fcde8ca
[Placement group] Input validation (#11152)
* Add a basic input validation.

* Addressed code review.
2020-10-14 13:56:41 -07:00
Ameer Haj Ali
a10e36ca04
Make the logging of gc.collect() freed refs appear in DEBUG not INFO (#11353) 2020-10-14 13:14:35 -07:00
Alex Wu
7466ce82df
[Autoscaler] Placement group autoscaling (#11243) 2020-10-14 13:11:46 -07:00
Eric Liang
aefcf901d3
[docs] Add sklearn integration link 2020-10-14 13:07:23 -07:00
SangBin Cho
b1481c6acf
Revert "[PlacementGroup]Add node manager test framework (#11174)" (#11398)
This reverts commit 241e765d3a.
2020-10-14 11:09:20 -07:00
Lingxuan Zuo
149ec5f6bf
[Log] dump stacktrace from glog lib (#11360)
* dump stacktrace from glog lib

* fix windows compile

* add comments for getcallstack
2020-10-14 10:52:12 -07:00
Kai Yang
abc6126814
[Java] Release actor instance reference when Ray.exitActor() is invoked (#11324) 2020-10-14 13:12:59 +08:00
fangfengbin
c926838411
[GCS]Fix GcsActorManagerTest multithreading bug (#11361)
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-13 21:36:40 -07:00
Simon Mo
5637093f44
Add Serve load testing tool to long running test yaml (#11386) 2020-10-13 20:24:57 -07:00
Simon Mo
866193b01c
Fix cluster yaml for serve benchmarks (#11383)
- Separate out single node and multiple node yamls
- Remove cluster_synced_files, somehow it breaks for me
2020-10-13 19:30:18 -07:00
fangfengbin
241e765d3a
[PlacementGroup]Add node manager test framework (#11174)
* add part code

* add part code

* add part code

* add part code

* add part code

* add part code

* fix ut bug

* fix ut bug

Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-10-13 19:27:11 -07:00
Max Fitton
cd9dcfca0d
[Dashboard] CPU/GPU usage details in actor pane (#11269) 2020-10-13 20:23:23 -05:00
Amog Kamsetty
933cf6675c
[Tune] Changes for Pytorch Lightning 1.0 (#11375) 2020-10-13 15:50:11 -07:00
Sven Mika
a6a94d3206
[RLlib] Fix test_env_with_subprocess.py. (#11356) 2020-10-13 12:42:20 -07:00
J Seppänen
63fa0a53a3
[k8s] Fix kubernetes cloud cluster example configuration (#11364) 2020-10-13 12:28:55 -07:00
Ian Rodney
84617f6ff6
[docker] Script for quickly fixing all Latest images (#11351) 2020-10-13 09:36:40 -07:00
Simon Mo
39e809fa03
Update microbenchmark script to use Python 3.8 wheel (#11357) 2020-10-13 09:27:52 -07:00
fangfengbin
0c02427da2
[GCS]Eviction of destroyed actors cached in GCS (#11338) 2020-10-13 15:34:35 +08:00
Lingxuan Zuo
c84a9b457c
[Streaming] add barrier helper tests (#11107) 2020-10-13 09:55:55 +08:00
Ian Rodney
6426fb3fff
[CI] Fix-Up Docker Build (Use Python) (#11139) 2020-10-12 14:22:51 -07:00
Sven Mika
1ebcdf236f
[RLlib] Add support for custom MultiActionDistributions. (#11311) 2020-10-12 13:50:43 -07:00
Sven Mika
0c0f67c14d
[RLlib] ARS/ES eval workers not working: Issue 9933. (#11308) 2020-10-12 13:49:48 -07:00
Sven Mika
8ea1bc5ff9
[RLlib] Allow for more than 2^31 policy timesteps. (#11301) 2020-10-12 13:49:11 -07:00
Sven Mika
f5e2cda68a
[RLlib] SAC: log_alpha not being learnt when on GPU. (#11298) 2020-10-12 13:48:44 -07:00
Julius Frost
7dcfd258cd
[RLlib] Assert LongTensor in SAC Discrete PyTorch (#11245) 2020-10-12 13:47:21 -07:00