Commit graph

7690 commits

Author SHA1 Message Date
Clark Zinzow
a86277a93c
[dask-on-ray] Fix Dask-on-Ray examples in docs (#14461) 2021-03-17 10:37:32 -07:00
Ian Rodney
10250d737f
[Autoscaler] Add tests around docker run options (#14713) 2021-03-17 10:13:51 -07:00
Edward Oakes
c781197755
[serve] Temporarily disable ray client test (#14733) 2021-03-17 08:48:05 -07:00
Edward Oakes
aab7ccc466
[serve] Deprecate client-based API in favor of process-wide singleton (#14696) 2021-03-17 09:39:54 -05:00
Sven Mika
69202c6a7d
[RLlib] Obsolete usage tracking dict via sample batch. (#13065) 2021-03-17 08:18:15 +01:00
Akash Patel
6e326cc239
upgrade setproctitle dep (#14538) 2021-03-16 21:58:36 -07:00
Ian Rodney
8a936ad64d
[Autoscaler Docs] Use worker_run_options (#14721)
Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-03-16 18:04:27 -07:00
Siyuan (Ryans) Zhuang
f30ac73640
update cloudpickle to commit 6e0f571 (#14693) 2021-03-16 12:36:43 -07:00
Ian Rodney
bd641a5e71
Revert "[Core] Added event loop metrics for posts. (#14546)" (#14692) 2021-03-16 10:38:45 -07:00
Edward Oakes
5a45e3351f
add Serve service by default (#14711) 2021-03-16 10:34:30 -07:00
Eric Liang
b240f5f0c9
Incremental refactor of runtime_env for consistency (#14632) 2021-03-16 10:11:50 -07:00
Sven Mika
78a134efa2
[RLlib] Add HowTo set env seed to our custom env example script. (#14471) 2021-03-16 08:12:27 +01:00
Tao Wang
897b84b300
[large scale]Add option for disable/enable context connection and disable asynchro… (#14596) 2021-03-16 15:09:13 +08:00
Edward Oakes
ae2c20c1ac
[serve] Include required and available resources in slow startup message (#14695) 2021-03-15 21:32:07 -05:00
Kathryn Zhou
01dda99b8c
Export cluster statistics to Prometheus (#14612) 2021-03-15 19:28:13 -07:00
Ian Rodney
d251bb676d
[Autoscaler] Get_Head_Node should return an up-to-date node (#14579) 2021-03-15 17:48:18 -07:00
Ian Rodney
4160f18e46
[Autoscaler] Remove Dangerous Default Values (#14694) 2021-03-15 17:47:40 -07:00
Xianyang Liu
c3d8ef1b49
[autoscaler] Auto detect memory resource + P0 bug fix in k8s (#14567)
* auto detect memory resources

* format

* fixes

* fixes

* fixes

* fixes

* fixes

* address comments and fix ci

* address comments

* address comments

* corrent comments

* fixes
2021-03-16 02:32:54 +02:00
Ezra-H
eda01f9f8c
[Collective] distributed cpu tests (#14591)
Co-authored-by: Hao Zhang <zhisbug@gmail.com>
2021-03-15 16:30:57 -07:00
Edward Oakes
72615ae590
[metrics] Improve custom metrics docs, add an example on how to use them (#14690) 2021-03-15 17:37:02 -05:00
Edward Oakes
f1a42c4744
[serve] [ray_client] Basic test for using Ray client with Serve (#14163) 2021-03-15 15:30:32 -07:00
Mark
14562a2b90
Fix the docstring for ray.cancel (#14670)
* Fix the docstring for ray.cancel

* Update python/ray/worker.py

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-03-15 15:16:17 -07:00
Edward Oakes
9f1421e3a3
[metrics] Standardize histogram to Prometheus API (#14627) 2021-03-15 14:37:56 -05:00
architkulkarni
8b17ec7c6d
[Serve] Disable auto conda env setting if using Ray Client (#14672) 2021-03-15 14:36:46 -05:00
DK.Pino
ef0c91f605
[Placement Group] [Doc] Fix PG doc display problem. (#14665) 2021-03-15 11:56:05 -07:00
Edward Oakes
d90cd545d1
[serve] Deprecate system-level batching with warning, update the docs (#14648) 2021-03-15 13:47:01 -05:00
Edward Oakes
dda3ab0161
[metrics] Cleanup package ref (#14658) 2021-03-15 13:00:57 -05:00
Cassidy Laidlaw
ef5adc1476
Include py.typed in MANIFEST.in to make sure it is packaged in wheels (#14492) 2021-03-15 10:43:12 -07:00
Edward Oakes
c409b5b63a
[metrics] Standardize Gauge to Prometheus interface (#14626) 2021-03-15 12:15:26 -05:00
Ian Rodney
63d77e0d4c
[Testing] Deflake test_object_spilling (#14678) 2021-03-15 10:12:53 -07:00
architkulkarni
dc2be14462
[Serve] Fix bug where create_backend fails in (base) conda env (#14633) 2021-03-15 10:06:41 -05:00
Sven Mika
4e17f95927
[RLlib] Unflake 2 test cases (SAC cont. cartpole). (#14620) 2021-03-15 14:03:54 +01:00
Tao Wang
c572563e1e
[large scale]Add enable sharding option and disable sharding for gcs client (#14600) 2021-03-15 19:35:00 +08:00
Siyuan (Ryans) Zhuang
b92531918e
Make use of C++14 'make_unique' (#14663) 2021-03-15 03:00:52 -07:00
Brian Yu
a65002514c
[Doc] Update Slurm documentation examples (#14673) 2021-03-15 00:27:13 -07:00
Kai Yang
8a0b117caa
[Java] minor fix (#14641) 2021-03-15 15:24:22 +08:00
Richard Liaw
c2aeccaf14
[tune] revert all mnist tests (#14677)
This reverts commit 3f557348a2.
2021-03-14 23:58:13 -07:00
Eric Liang
3bdcca7ee5
Add basic concurrency test for ray client (#14630) 2021-03-13 11:24:57 -08:00
Dmitri Gekhtman
6babd1928c
[Kubernetes][dashboard][minor] Fix uptime (#14655) 2021-03-12 18:30:13 -06:00
Edward Oakes
66be4801c6
Add deprecation warning to Counter.record() (#14622) 2021-03-12 17:04:28 -06:00
Eric Liang
b47036d014
Bump Ray client protocol version; fix dataclasses dependency for py 3.6 (#14654) 2021-03-12 14:58:34 -08:00
Tao Wang
3402b1752f
[GCS]Report job error to gcs instead of direct publishing (#14617)
* [GCS]Report job error to gcs instead of direct publishing

* fix compile
2021-03-12 14:54:08 -08:00
Richard Liaw
3f557348a2
[tune] re-enable MNIST tests! (#14561) 2021-03-12 13:35:43 -08:00
Eric Liang
2ba49c2701
Distinguish between grpc client and server events in asio metrics (#14637) 2021-03-12 11:13:59 -08:00
Dmitri Gekhtman
a90cffe26c
[dashboard][k8s] Better CPU reporting when running on K8s (#14593) 2021-03-12 12:02:15 -06:00
Sven Mika
ee4b6e7e3b
[RLlib] Unity3D example broken due to change in ML-Agents API. Attention-net prev-n-a/r. Attention-wrapper works with images. (#14569) 2021-03-12 18:27:25 +01:00
Raphael CHEN
c93961e070
[tune] Enable list of tuning hyperparameters in BOHB (#14487)
* [tune] Enable list of tuning hyperparameters in BOHB

* More concise code

Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>

* Add comment to `unflatten_list_dict`

* Fix lint

* Fix lint

* Add test for `unflatten_list_dict`

Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
2021-03-12 09:22:44 -08:00
Kai Yang
f60bd3afee
[Java] some small improvements (#14565) 2021-03-12 12:26:55 +08:00
Edward Oakes
9cf328d616
[serve] Application-level batching initial commit (#14610) 2021-03-11 21:16:08 -06:00
Clark Zinzow
7b3102dd32
Add resource report lag warning. (#14611) 2021-03-11 17:29:45 -08:00