Raphael CHEN
c93961e070
[tune] Enable list of tuning hyperparameters in BOHB ( #14487 )
...
* [tune] Enable list of tuning hyperparameters in BOHB
* More concise code
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
* Add comment to `unflatten_list_dict`
* Fix lint
* Fix lint
* Add test for `unflatten_list_dict`
Co-authored-by: Sumanth Ratna <sumanthratna@gmail.com>
2021-03-12 09:22:44 -08:00
Kai Yang
f60bd3afee
[Java] some small improvements ( #14565 )
2021-03-12 12:26:55 +08:00
Edward Oakes
9cf328d616
[serve] Application-level batching initial commit ( #14610 )
2021-03-11 21:16:08 -06:00
Clark Zinzow
7b3102dd32
Add resource report lag warning. ( #14611 )
2021-03-11 17:29:45 -08:00
Eric Liang
ee2bf0f989
Improved object store memory behavior with respect /dev/shm size ( #14629 )
2021-03-11 17:29:06 -08:00
Edward Oakes
5e2a3df7cd
Allow returning an actor handle from a remote call ( #13476 )
2021-03-11 16:52:09 -06:00
Edward Oakes
8e778d6f42
[serve] Remove more Counter.record()
s ( #14628 )
2021-03-11 12:54:38 -06:00
Eric Liang
4c1df378bb
Point load_package tests to ray-project GH instead of personal ( #14605 )
2021-03-11 10:46:36 -08:00
Michael Luo
020c9439dd
[RLlib] CQL Documentation + Tests ( #14531 )
2021-03-11 18:51:39 +01:00
architkulkarni
9b6d2ca345
[Core] Add runtime_env option to actor and task options, with conda_env ( #14430 )
2021-03-11 10:09:38 -06:00
Clark Zinzow
5a788474aa
[Core] First pass at privatizing non-public Python APIs. ( #14607 )
...
* async_compat
* utils
* cluster_utils
* compat
* function_manager
* import_thread
* memory_monitor
* monitor, log_monitor, ray_process_reaper
* metrics_agent
* parameter
* prometheus_exporter
* ray_logging
* signature
2021-03-10 22:47:28 -08:00
Eric Liang
081c960b59
Fix missing __init__ for wheels ( #14615 )
2021-03-10 18:13:58 -08:00
Jirka Fajfr
153dcd3734
Upgrading Redis to 6.0.10 in order to be functional on Apple silicon (M1) ( #14299 )
...
Co-authored-by: Jirka Fajfr <jirka.fajfr@enverus.com>
2021-03-10 15:17:01 -08:00
Yi Cheng
ad8e35b919
[ray] Update cpp to std14 ( #14441 )
2021-03-10 14:05:52 -08:00
Eric Liang
4e8b53b3d0
Add an experimental load_package API ( #14552 )
2021-03-10 13:13:49 -08:00
Clark Zinzow
566dcea56a
[Core] Added event loop metrics for posts. ( #14546 )
...
* Added event loop metrics for posts.
* io_context_proxy --> instrumented_io_context
* Fix feature flag, chrono-->absl, trim the stats, inline functions, reformat stats string.
* Make stats struct mutex plain lock instead of reader-writer lock.
* Mutex reader locking, std::array double braces initialization.
* Fix Bazel BUILD formatting.
2021-03-10 11:52:45 -08:00
Edward Oakes
8111ff5c3f
[serve] Use placement groups to bypass autoscaler throttling ( #13844 )
2021-03-10 13:33:44 -06:00
Edward Oakes
55a28cee52
[serve] Count -> Counter ( #14571 )
2021-03-10 11:59:44 -06:00
Stephanie Wang
0f3530da3b
[core] Only consider actual workers when killing idle workers ( #14578 )
2021-03-10 09:30:19 -08:00
Eric Liang
dcb22af50d
Use vendored cloudpickle ( #14576 )
2021-03-09 22:08:45 -08:00
burglarralgrub
dfcb9c356e
Remove the --java-worker-options
parameter ( #14563 )
2021-03-10 10:49:31 +08:00
Stephanie Wang
b187693121
[docs] Fix links for installing wheels for a specific commit ( #14572 )
...
* Fix doc
* version
2021-03-09 16:55:03 -08:00
Alex Wu
e1fbb8489e
[core] Supress infeasible warning ( #14068 )
2021-03-09 16:37:56 -08:00
Richard Liaw
ea7d4c6607
[placement groups] fix gpu ids for bundles ( #14574 )
2021-03-09 15:11:59 -08:00
Hao Zhang
2505bc8aa9
[Collective] Ray CPU collectives now available ( #14277 )
...
Co-authored-by: YLJALDC <dal177@ucsd.edu>
Co-authored-by: Ezra-H <huangrh9@gmail.com>
Co-authored-by: Ezra-H <44772185+Ezra-H@users.noreply.github.com>
2021-03-09 15:02:16 -08:00
Yi Cheng
ed8935406b
[core] Minimal support for runtime env ( #14270 )
2021-03-09 11:53:58 -08:00
Alex Wu
ba6cebe30f
Raylet request resource report endpoint ( #14291 )
...
* .
* done?
* raylet side done?
* .
* .
* .
* client
* .
* fix tests
* make ci happy
* lint
* cleanup
* clang sucks
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-03-09 09:50:50 -08:00
Maxime RICHE
9a7fbd3cdf
[RLlib] Add coin game env. Matrix social dilemma env. With tests and examples. ( #14208 )
2021-03-09 17:26:20 +01:00
Ian Rodney
6d5511cf80
Revert "reset memory for tasks and actors to 5% when cached memory ad…" ( #14556 )
...
This reverts commit 6f151ad510
.
2021-03-09 08:19:55 -08:00
SongGuyang
134152937a
fix doc ( #14555 )
2021-03-09 18:57:03 +08:00
Qing Wang
29d5b110de
Update doc about installing Ray Java ( #14383 )
...
* Fix
* Update doc/source/installation.rst
Co-authored-by: Kai Yang <kfstorm@outlook.com>
* Update doc/source/installation.rst
Co-authored-by: Kai Yang <kfstorm@outlook.com>
* Update doc/source/walkthrough.rst
Co-authored-by: Kai Yang <kfstorm@outlook.com>
* Address comments.
* lint
Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-03-09 18:03:13 +08:00
Kai Fricke
43e098402a
[tune] make tune.with_parameters()
work with the class API ( #14532 )
...
* [tune] make `tune.with_parameters()` work with the class API
* Update python/ray/tune/utils/trainable.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-09 09:36:17 +01:00
qicosmos
f2348a5456
[C++ worker] Add ray register part1 ( #14436 )
2021-03-09 13:57:17 +08:00
Yiran Wang
a06dc39d9f
[Autoscaler] Check if SSH is available every 5 sec, not 10 ( #14484 )
2021-03-08 20:58:21 -08:00
Dmitri Gekhtman
4a7d9e71bb
[dashboard][kubernetes] Show container's memory info on K8s, not the physical host's. ( #14499 )
...
* random doc typo
* more reasonable memory output
* no if
* get rid of comment
2021-03-08 18:59:41 -08:00
Edward Oakes
59221b2f31
[metrics] Standardize metrics.Count API to prometheus counter ( #14498 )
2021-03-08 20:47:46 -06:00
architkulkarni
505d2b6abe
[Serve] [Doc] Add small dashboard section under Serve Monitoring ( #14328 )
2021-03-08 20:41:42 -06:00
fyrestone
3616424f10
Disable dashboard tune module if pandas version is incorrect ( #14381 )
2021-03-08 20:40:59 -06:00
fyrestone
2da58bb021
[Dashboard] Fix reporter agent ( #14378 )
2021-03-08 13:12:34 -06:00
Ian Rodney
b6c4f21fda
fix docker build ( #14536 )
2021-03-08 09:33:26 -08:00
Edward Oakes
04c009712d
Revert "Revert "Support accessing underlying attributes in RayTaskErr… ( #14449 )
2021-03-08 11:04:10 -06:00
Sven Mika
732197e23a
[RLlib] Multi-GPU for tf-DQN/PG/A2C. ( #13393 )
2021-03-08 15:41:27 +01:00
Kai Fricke
b0bf44b154
[tune/docs] Add high level trial runner flow to documentation ( #14468 )
...
* [tune/docs] Add high level trial runner flow to documentation
* Apply suggestions from code review
2021-03-08 10:35:54 +01:00
Kai Yang
7977474899
[Core] Filter out dead nodes when getting address info from redis ( #14440 )
2021-03-08 15:48:26 +08:00
Edward Oakes
8e139046b9
[metrics] Remove unused unit field from cython classes ( #14497 )
2021-03-07 20:06:02 -06:00
Richard Liaw
dec3aa3453
Split tests for timeout ( #14516 )
2021-03-07 16:46:52 -08:00
Eric Liang
3fab5e2ada
Switch memory units to bytes ( #14433 )
2021-03-06 19:32:35 -08:00
Richard Liaw
5fc761c562
Fix test_advanced_3 timeout ( #14509 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-03-06 10:59:06 -08:00
EscapeReality846089495
33b271aa97
[tune] Fixed save_to_dir w/ os.replace ( #14510 )
...
The method save_to_dir of the class Searcher in ray.tune.suggest.suggestion.py uses the os.rename method to replace tmp_search_ckpt to current ckpt. os.rename method will raise the [WinError 183] or file exists error of other operating system. os.replace is the currect way.
2021-03-06 01:14:56 -08:00
Alex Wu
2395e25fc0
[hotfix][core] Load balancing spillback feature flag ( #14457 )
2021-03-05 16:45:33 -08:00