Commit graph

7560 commits

Author SHA1 Message Date
architkulkarni
dc2be14462
[Serve] Fix bug where create_backend fails in (base) conda env (#14633) 2021-03-15 10:06:41 -05:00
Sven Mika
4e17f95927
[RLlib] Unflake 2 test cases (SAC cont. cartpole). (#14620) 2021-03-15 14:03:54 +01:00
Tao Wang
c572563e1e
[large scale]Add enable sharding option and disable sharding for gcs client (#14600) 2021-03-15 19:35:00 +08:00
Siyuan (Ryans) Zhuang
b92531918e
Make use of C++14 'make_unique' (#14663) 2021-03-15 03:00:52 -07:00
Brian Yu
a65002514c
[Doc] Update Slurm documentation examples (#14673) 2021-03-15 00:27:13 -07:00
Kai Yang
8a0b117caa
[Java] minor fix (#14641) 2021-03-15 15:24:22 +08:00
Richard Liaw
c2aeccaf14
[tune] revert all mnist tests (#14677)
This reverts commit 3f557348a2.
2021-03-14 23:58:13 -07:00
Eric Liang
3bdcca7ee5
Add basic concurrency test for ray client (#14630) 2021-03-13 11:24:57 -08:00
Dmitri Gekhtman
6babd1928c
[Kubernetes][dashboard][minor] Fix uptime (#14655) 2021-03-12 18:30:13 -06:00
Edward Oakes
66be4801c6
Add deprecation warning to Counter.record() (#14622) 2021-03-12 17:04:28 -06:00
Eric Liang
b47036d014
Bump Ray client protocol version; fix dataclasses dependency for py 3.6 (#14654) 2021-03-12 14:58:34 -08:00
Tao Wang
3402b1752f
[GCS]Report job error to gcs instead of direct publishing (#14617)
* [GCS]Report job error to gcs instead of direct publishing

* fix compile
2021-03-12 14:54:08 -08:00
Richard Liaw
3f557348a2
[tune] re-enable MNIST tests! (#14561) 2021-03-12 13:35:43 -08:00
Eric Liang
2ba49c2701
Distinguish between grpc client and server events in asio metrics (#14637) 2021-03-12 11:13:59 -08:00
Dmitri Gekhtman
a90cffe26c
[dashboard][k8s] Better CPU reporting when running on K8s (#14593) 2021-03-12 12:02:15 -06:00
Sven Mika
ee4b6e7e3b
[RLlib] Unity3D example broken due to change in ML-Agents API. Attention-net prev-n-a/r. Attention-wrapper works with images. (#14569) 2021-03-12 18:27:25 +01:00
Raphael CHEN
c93961e070
[tune] Enable list of tuning hyperparameters in BOHB (#14487)
* [tune] Enable list of tuning hyperparameters in BOHB

* More concise code

Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>

* Add comment to `unflatten_list_dict`

* Fix lint

* Fix lint

* Add test for `unflatten_list_dict`

Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
2021-03-12 09:22:44 -08:00
Kai Yang
f60bd3afee
[Java] some small improvements (#14565) 2021-03-12 12:26:55 +08:00
Edward Oakes
9cf328d616
[serve] Application-level batching initial commit (#14610) 2021-03-11 21:16:08 -06:00
Clark Zinzow
7b3102dd32
Add resource report lag warning. (#14611) 2021-03-11 17:29:45 -08:00
Eric Liang
ee2bf0f989
Improved object store memory behavior with respect /dev/shm size (#14629) 2021-03-11 17:29:06 -08:00
Edward Oakes
5e2a3df7cd
Allow returning an actor handle from a remote call (#13476) 2021-03-11 16:52:09 -06:00
Edward Oakes
8e778d6f42
[serve] Remove more Counter.record()s (#14628) 2021-03-11 12:54:38 -06:00
Eric Liang
4c1df378bb
Point load_package tests to ray-project GH instead of personal (#14605) 2021-03-11 10:46:36 -08:00
Michael Luo
020c9439dd
[RLlib] CQL Documentation + Tests (#14531) 2021-03-11 18:51:39 +01:00
architkulkarni
9b6d2ca345
[Core] Add runtime_env option to actor and task options, with conda_env (#14430) 2021-03-11 10:09:38 -06:00
Clark Zinzow
5a788474aa
[Core] First pass at privatizing non-public Python APIs. (#14607)
* async_compat

* utils

* cluster_utils

* compat

* function_manager

* import_thread

* memory_monitor

* monitor, log_monitor, ray_process_reaper

* metrics_agent

* parameter

* prometheus_exporter

* ray_logging

* signature
2021-03-10 22:47:28 -08:00
Eric Liang
081c960b59
Fix missing __init__ for wheels (#14615) 2021-03-10 18:13:58 -08:00
Jirka Fajfr
153dcd3734
Upgrading Redis to 6.0.10 in order to be functional on Apple silicon (M1) (#14299)
Co-authored-by: Jirka Fajfr <jirka.fajfr@enverus.com>
2021-03-10 15:17:01 -08:00
Yi Cheng
ad8e35b919
[ray] Update cpp to std14 (#14441) 2021-03-10 14:05:52 -08:00
Eric Liang
4e8b53b3d0
Add an experimental load_package API (#14552) 2021-03-10 13:13:49 -08:00
Clark Zinzow
566dcea56a
[Core] Added event loop metrics for posts. (#14546)
* Added event loop metrics for posts.

* io_context_proxy --> instrumented_io_context

* Fix feature flag, chrono-->absl, trim the stats, inline functions, reformat stats string.

* Make stats struct mutex plain lock instead of reader-writer lock.

* Mutex reader locking, std::array double braces initialization.

* Fix Bazel BUILD formatting.
2021-03-10 11:52:45 -08:00
Edward Oakes
8111ff5c3f
[serve] Use placement groups to bypass autoscaler throttling (#13844) 2021-03-10 13:33:44 -06:00
Edward Oakes
55a28cee52
[serve] Count -> Counter (#14571) 2021-03-10 11:59:44 -06:00
Stephanie Wang
0f3530da3b
[core] Only consider actual workers when killing idle workers (#14578) 2021-03-10 09:30:19 -08:00
Eric Liang
dcb22af50d
Use vendored cloudpickle (#14576) 2021-03-09 22:08:45 -08:00
burglarralgrub
dfcb9c356e
Remove the --java-worker-options parameter (#14563) 2021-03-10 10:49:31 +08:00
Stephanie Wang
b187693121
[docs] Fix links for installing wheels for a specific commit (#14572)
* Fix doc

* version
2021-03-09 16:55:03 -08:00
Alex Wu
e1fbb8489e
[core] Supress infeasible warning (#14068) 2021-03-09 16:37:56 -08:00
Richard Liaw
ea7d4c6607
[placement groups] fix gpu ids for bundles (#14574) 2021-03-09 15:11:59 -08:00
Hao Zhang
2505bc8aa9
[Collective] Ray CPU collectives now available (#14277)
Co-authored-by: YLJALDC <dal177@ucsd.edu>
Co-authored-by: Ezra-H <huangrh9@gmail.com>
Co-authored-by: Ezra-H <44772185+Ezra-H@users.noreply.github.com>
2021-03-09 15:02:16 -08:00
Yi Cheng
ed8935406b
[core] Minimal support for runtime env (#14270) 2021-03-09 11:53:58 -08:00
Alex Wu
ba6cebe30f
Raylet request resource report endpoint (#14291)
* .

* done?

* raylet side done?

* .

* .

* .

* client

* .

* fix tests

* make ci happy

* lint

* cleanup

* clang sucks

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-03-09 09:50:50 -08:00
Maxime RICHE
9a7fbd3cdf
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. (#14208) 2021-03-09 17:26:20 +01:00
Ian Rodney
6d5511cf80
Revert "reset memory for tasks and actors to 5% when cached memory ad…" (#14556)
This reverts commit 6f151ad510.
2021-03-09 08:19:55 -08:00
SongGuyang
134152937a
fix doc (#14555) 2021-03-09 18:57:03 +08:00
Qing Wang
29d5b110de
Update doc about installing Ray Java (#14383)
* Fix

* Update doc/source/installation.rst

Co-authored-by: Kai Yang <kfstorm@outlook.com>

* Update doc/source/installation.rst

Co-authored-by: Kai Yang <kfstorm@outlook.com>

* Update doc/source/walkthrough.rst

Co-authored-by: Kai Yang <kfstorm@outlook.com>

* Address comments.

* lint

Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-03-09 18:03:13 +08:00
Kai Fricke
43e098402a
[tune] make tune.with_parameters() work with the class API (#14532)
* [tune] make `tune.with_parameters()` work with the class API

* Update python/ray/tune/utils/trainable.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-09 09:36:17 +01:00
qicosmos
f2348a5456
[C++ worker] Add ray register part1 (#14436) 2021-03-09 13:57:17 +08:00
Yiran Wang
a06dc39d9f
[Autoscaler] Check if SSH is available every 5 sec, not 10 (#14484) 2021-03-08 20:58:21 -08:00