Commit graph

2444 commits

Author SHA1 Message Date
mehrdadn
50f6272fcc
Replace ps call with psutil (#8851)
* Replace ps call with psutil

* Minor formatting

Co-authored-by: Mehrdad <noreply@github.com>
Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>
2020-06-09 14:21:19 -07:00
Sumanth Ratna
57212254e6
Fix dragonfly install instructions (#8866) 2020-06-09 13:05:04 -07:00
Richard Liaw
fc54dc8652
[tune] Make test_api faster (#8844) 2020-06-09 12:45:27 -07:00
Richard Liaw
d7b64ef279
[tune] BayesOpt - finish early when optimizer converges (#8808) 2020-06-09 12:09:39 -07:00
Simon Mo
6c3062906f
[Serve] Batching in Worker Replica (#8709) 2020-06-09 11:29:16 -07:00
mehrdadn
f93bb008bb
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() (#8839)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-08 21:29:46 -07:00
Siyuan (Ryans) Zhuang
2f690d1866
Simplify plasma store config (#8823)
* simplify plasma store config
2020-06-08 20:47:22 -07:00
Kai Yang
db5cc5c8da
fix test_global_state_api due to the temporary object (#8800)
* fix test_global_state_api due to the temporary object

* update

* Update python/ray/tests/test_advanced_3.py

Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>

Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>
2020-06-09 11:42:40 +08:00
Simon Mo
5e2bd6ecb9
Fix asyncio re-entry error message (#8842) 2020-06-08 17:43:01 -07:00
SangBin Cho
3388864768
[Core] Clean up detached actors (#8759) 2020-06-08 11:22:01 -05:00
fangfengbin
68718b33b4
GCS Server add SIGTERM signal handler (#8795) 2020-06-08 17:26:36 +08:00
mehrdadn
f68183d778
Error-checking for a couple of corruption issues (#8059)
* Extra error handling
* Handle connection closed in Redis monitor
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-07 15:43:00 +02:00
Edward Oakes
5d124489a9
[serve] Require backend when creating endpoint (#8764) 2020-06-06 21:10:42 -05:00
Ian Rodney
b07b4f2e55
Revert "[autoscaler] Create Docker Command Runner" (#8816)
This reverts commit 54189bca5a.
2020-06-06 14:21:44 -07:00
Stephanie Wang
b160b83d3e
[core] Queue subscription/unsubscription commands in the GCS (#8756)
* Only remove callback index if in map

* test

* Queue subscription commands

* lint

* Check status

* update

* update

* update

* Disable GCS restart tests

* lint
2020-06-05 19:49:19 -07:00
Ian Rodney
54189bca5a
[autoscaler] Create Docker Command Runner (#8806)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-06-05 17:29:27 -07:00
Alex Wu
2c485a2598
[autoscaler] Command runner interactivity (#7198) 2020-06-05 17:08:38 -07:00
Sven Mika
25c0974543
[RLlib] Issue 8412 (Adam vars not stored in ModelV2). (#8480) 2020-06-05 21:07:02 +02:00
krfricke
e62c1d2051
[tune] Use scientific notation in tune dashboard (#8782)
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-06-05 10:41:07 -07:00
Anil Choudhary
1dda659918
[tune]TrialRunner wait on global checkpoint syncdown (#8798) 2020-06-05 10:39:59 -07:00
Edward Oakes
4155d5830f
[serve] Replace actor error retry logic with max_task_retries (#8768) 2020-06-05 10:45:28 -05:00
Amog Kamsetty
9410e5884d
[Tune] Parametrize Cloud Syncing Frequency (#8771) 2020-06-04 18:55:50 -07:00
Ian Rodney
d452932740
[autoscaler] Improve Logtimer log messages (#8753) 2020-06-04 18:07:27 -07:00
Ameer Haj Ali
d966d98729
cleanup to support provider's custom ssh command runner (#8720)
* cleanup to support provider's custom ssh command runner

* clean up

* trailing white spaces fix

* k8s signature fix

Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-06-04 13:27:17 -07:00
Ian Rodney
09f89ff49d
[autoscaler] Improve SSH Command Failure Logging (#8751) 2020-06-04 12:38:20 -07:00
SangBin Cho
e372c06257
Hotfix dashboard broken tests. (#8757) 2020-06-04 09:44:00 -07:00
fangfengbin
84a8f2ccb5
Support reloading storage data when gcs server restarts (#8650) 2020-06-04 14:53:20 +08:00
Eric Liang
a24d117c68
[autoscaler] Refactor code in preparation for multi instance type support (#8632)
* wip refactor

* add util

* wip

* fix

* fix

* remove

* remove extraneous string type for sg
2020-06-03 12:53:55 -07:00
Ian Rodney
474bbc28bf
Warn if Autoscaling-config flag not set. (#8677) 2020-06-03 12:21:07 -07:00
Ian Rodney
7a2c9524d1
[Core] Randomize and 'Reserve' Port Generated for Node Manager (#8628) 2020-06-03 12:19:03 -07:00
Siyuan (Ryans) Zhuang
7fa64f2b24
Clean up unused Python code (#8755) 2020-06-03 12:09:19 -07:00
Max Fitton
b9f0f7ae5b
Dashboard minor refactor and first unit tests (#8705) 2020-06-03 11:04:55 -05:00
krfricke
f4ee3e76d8
[tune] last-n-avg
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-06-02 20:06:04 -07:00
SangBin Cho
7c43991100
[GCS] Monitor.py bug fix (#8725)
* comment.

* Fix bugs.

* Used pubsub message instead.

* Added a ray.actors test
2020-06-02 16:06:36 -07:00
Edward Oakes
0306e4d589
[serve] Refer to serve "instances," not "clusters" (#8746) 2020-06-02 15:16:29 -07:00
Edward Oakes
2e82e05e4b
[serve] Add list_backends and list_endpoints (#8737) 2020-06-02 15:14:10 -07:00
Simon Mo
c5544eb070
[Async] Remove Monitor + Cleanup Code (#8691) 2020-06-02 14:11:16 -07:00
Edward Oakes
e91f095d98
[Serve] Remove ray_init_kwargs in serve.init (#8747) 2020-06-02 14:05:35 -07:00
krfricke
4d0e9f3c71
[Dashboard] Sort IDLE workers to bottom in dashboard (#8708)
* Sort IDLE workers to bottom in dashboard

* Fixed linting error

Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-06-02 14:00:59 -07:00
Stephanie Wang
aa06c3b15a
Eager eviction even when object pinning is disabled (#8561)
* Eager eviction even when object pinning is disabled, add regression test

* Make test more robust

* lint
2020-06-02 11:48:03 -07:00
Edward Oakes
ae312af435
Remove accidental passes in rllib, tune (#8742) 2020-06-02 12:29:17 -05:00
Edward Oakes
57bf0e43f0
fix docstring (#8736) 2020-06-02 08:55:20 -07:00
Lingxuan Zuo
4cbbc15ca7
[GCS] Global state accessor from node resource table (#8658) 2020-06-02 14:01:00 +08:00
Alec Brickner
207ab44129
Raise major version limit for msgpack (#8466) 2020-06-01 20:00:36 -07:00
Alex Wu
a2ec282033
[Doc] Dataset lint fix (#8719) 2020-06-01 19:43:06 -07:00
Simon Mo
4cef1ee591
[Serve] Cleanup Router Implementation (#8718) 2020-06-01 19:21:28 -07:00
Alex Wu
dcf58a43dc
[SGD] Dataset API (#7839) 2020-06-01 15:48:15 -07:00
SangBin Cho
cd5a207d69
[Dashboard] Frontend Lint Fix. (#8696) 2020-06-01 11:29:01 -07:00
fangfengbin
016337d4eb
Heartbeat table uses gcs pub-sub instead of redis accessor (#8655) 2020-05-30 23:17:25 +08:00
Siyuan (Ryans) Zhuang
ebea5c4111
Update cloudpickle to version 1.4.1 (#8577) 2020-05-29 17:55:48 -07:00