Sven Mika
d47ed62083
[Testing RLlib] Increase timeout for nightly multi-GPU learning tests. ( #17987 )
2021-08-23 10:37:50 +02:00
Kai Fricke
fca8af88d2
[release] Fix e2e environment variable passing from pipeline ( #18000 )
2021-08-23 09:26:37 +02:00
Chen Shen
e369ecab43
Fix EBS cold start in Mac ( #18001 )
2021-08-22 20:03:59 -07:00
Dmitri Gekhtman
13d5d0f9ef
[autoscaler][hotfix] Update node list after terminating unhealthy nodes ( #17992 )
...
* Update nodes; update test.
* consistency
* lint
2021-08-22 18:22:10 -04:00
Clark Zinzow
5ca28b1cc8
[Core] Update Bazel (to 3.4.1), gRPC, boringssl, and absl as a precursor to gRPC streaming PR. ( #17903 )
...
* Update Bazel (to 3.4.1), gRPC, boringssl, absl.
* Always reinstall Bazel if needing to upgrade to a new Bazel version.
* Add patch for properly detecting Windows Python headers when building gRPC.
* Add minimum Bazel version check.
* Update docs with new Bazel version.
2021-08-21 11:33:11 -07:00
Sven Mika
494ddd98c1
[RLlib] Replace "seq_lens" w/ SampleBatch.SEQ_LENS. ( #17928 )
2021-08-21 17:05:48 +02:00
Edward Oakes
b969aa3c80
[dashboard] Don't start dashboard agent when missing dependencies ( #17966 )
2021-08-21 01:04:21 -07:00
77loopin
c6b24fcb5d
[RayClient] Add the guide for k8s Ingress ( #17736 )
...
Co-authored-by: Dmitri Gekhtman <62982571+DmitriGekhtman@users.noreply.github.com>
Co-authored-by: seungjaebaek <seungjaebaek@linecorp.com>
2021-08-20 18:31:03 -07:00
Eric Liang
58e35a21b4
Add split_at_indices() ( #17990 )
2021-08-20 15:35:22 -07:00
Lixin Wei
05502da271
Add dispatch proxy to event tracker ( #17983 )
2021-08-20 15:32:10 -07:00
Chen Shen
dac1ba632e
[usability][rfc] ray status show demand summary by default ( #17892 )
2021-08-20 15:29:37 -07:00
Chris K. W
e3fb9650b2
[Client] Skip client object ref, actor handle, and actor ref dealloc/del if client package has already been cleaned up ( #17969 )
2021-08-20 15:18:43 -07:00
Chen Shen
31482563c2
[Test] fix-mac-test by avoiding cold start ( #17988 )
2021-08-20 15:04:29 -07:00
Edward Oakes
3ea5c0dc6b
[serve] Remove deprecated routing code (ServeStarletteRouter) ( #17986 )
2021-08-20 16:56:45 -05:00
Simon Mo
8236b7412e
[Serve] Mark serve.start beta API (instead of stable) ( #17956 )
2021-08-20 16:36:48 -05:00
SangBin Cho
cd42d30d7b
[Core] Removing GCS object directory from raylet ( #17962 )
2021-08-20 12:57:16 -07:00
Chen Shen
3dbb2e0020
change the way test run ( #17930 )
2021-08-20 11:26:16 -07:00
Chen Shen
89f988e9cc
add dataset shuffle data loader ( #17917 )
2021-08-20 11:26:01 -07:00
Edward Oakes
30541025e5
[serve] Remove deprecated APIs from code & docs ( #17754 )
2021-08-20 11:59:45 -05:00
Stephanie Wang
b8fe776638
[core] Fix inlined nested ids ( #17834 )
...
* test
* Use ObjectRef instead of ObjectID in nested refs
* java
* doc
* java
* build
* build
* x
* lint
* simplify
* fix
2021-08-20 08:58:29 -07:00
Amog Kamsetty
9416fce91b
[SGD] v2 Tune integration + iterator API ( #17839 )
...
* [SGD] implement SGD Trainer.to_tune_trainable
* address some comments
* add RESULT_DUPLICATE
* extract trainable creation logic out of Trainer
* add 1 CPU for driver
* use class attribute to fix serialization issues
* add examples
* add test for tune error
* tune
* test tune_linear
* run_iterator
* add to build file
* Update python/ray/util/sgd/v2/trainer.py
Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
* Update python/ray/util/sgd/v2/trainer.py
Co-authored-by: matthewdeng <matthew.j.deng@gmail.com>
* address comments
* fix tests & address comments
* resolve merge
* lint
* fix
* add team tag to tests
* fix tests
* lint
Co-authored-by: Matthew Deng <matthew.j.deng@gmail.com>
2021-08-20 08:31:21 -07:00
simonsays1980
60aee4a330
[RLlib] Add example script for bare metal Policy with custom view_requirements
. ( #17896 )
2021-08-20 12:17:13 +02:00
Jingyu-Peng
40330ca439
Fix loading dynamic functions/classes when using code_search_path ( #17605 )
2021-08-20 17:24:11 +08:00
Antoni Baum
0a1228ef6e
Add configurable autosuspend for connect tests ( #17958 )
2021-08-20 10:57:41 +02:00
Sven Mika
8248ba531b
[RLlib] Redo #17410 : Example script: Remote worker envs with inference done on main node. ( #17960 )
2021-08-20 08:02:18 +02:00
Eric Liang
236b772465
Revert "[GCS] GCS Based Actor Scheduler ( #16580 )" ( #17941 )
...
This reverts commit a9b4545502
.
2021-08-19 21:46:52 -07:00
Eric Liang
661ac4e37b
Remove last traces of ref-counting flag ( #17932 )
2021-08-19 21:08:13 -07:00
architkulkarni
36c26578a7
[runtime env] [test] Add nightly test to verify Ray wheel URLs are valid ( #17938 )
2021-08-19 15:48:37 -07:00
Chen Shen
a16a25852a
[Core] fix event race condition ( #17947 )
2021-08-19 14:20:34 -07:00
matthewdeng
d081ee9d87
[SGD v2] Save checkpoints to disk ( #17807 )
...
* [SGD] save checkpoints to disk
* fix test; add logs
* rename log_dir to logdir for consistency with tune
* address comments: add run level directories, add CheckpointConfig
* check for empty strings
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
* address comments - refactor CheckpointStrategy, remove run_dir and checkpoint_dir configurability
* fix Trainer docs
* Update python/ray/util/sgd/v2/checkpoint.py
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
* remove construct_path_with_default
Co-authored-by: Antoni Baum <antoni.baum@protonmail.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-08-19 14:18:51 -07:00
Sven Mika
a2d96c513a
[RLlib] Expand machine for nightly multi-gpu learning tests. ( #17955 )
2021-08-19 22:27:30 +02:00
Eric Liang
238941f857
Ray workflow comparison examples + add to tests ( #17880 )
2021-08-19 12:19:08 -07:00
architkulkarni
5ed3f0ce35
[Serve] [Dashboard] Add end times and DELETED state for endpoints ( #17898 )
2021-08-19 11:10:42 -05:00
Kai Fricke
21d90a0e9a
Increase disk for serve tests ( #17606 )
2021-08-19 17:51:19 +02:00
Kai Fricke
651aae76b9
[release] Ask for configuration in buildkite ( #17948 )
2021-08-19 17:51:05 +02:00
Alex Wu
318ba6fae0
Revert "[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. ( #17410 )" ( #17951 )
...
This reverts commit 8fc16b9a18
.
2021-08-19 07:55:10 -07:00
Kai Fricke
622f724f61
Update release process ( #17888 )
2021-08-19 13:34:51 +02:00
souravraha
f5fcb3c576
Fixes bug #17424 . ( #17437 )
2021-08-19 12:23:36 +02:00
Sven Mika
8fc16b9a18
[RLlib] Add example script for how to have n remote (parallel) envs with inference happening on "main" (possibly GPU) node. ( #17410 )
2021-08-19 12:14:50 +02:00
Kai Fricke
0eee355d2e
Terminate session instead of stop ( #17946 )
2021-08-19 10:26:59 +02:00
Alex Wu
497446063c
[hotfix] Fix test owners lint ( #17945 )
...
Co-authored-by: Alex <alex@anyscale.com>
2021-08-18 23:41:58 -07:00
Chong-Li
5e22257cec
[GCS] Fix: GCS Based Actor Scheduler ( #17944 )
2021-08-18 23:40:35 -07:00
Clark Zinzow
d958457d07
[Core] Second pass at privatizing APIs. ( #17885 )
...
* gcs_utils
* resource_spec
* profiling
* ray_perf and ray_cluster_perf
* test_utils
2021-08-18 20:56:33 -07:00
architkulkarni
4c6a695dab
[Doc] Runtime env docstring fix monospace formatting ( #17929 )
2021-08-18 20:53:41 -07:00
Simon Mo
b573864928
[CI] Add test owners ( #17893 )
2021-08-18 18:38:31 -07:00
Eric Liang
a9073d16f4
Revert "[Core] Unified worker initiators ( #17401 )" ( #17935 )
...
This reverts commit c3764ffd7d
.
2021-08-18 18:06:24 -07:00
Chen Shen
89d83228f6
[Core][Plasma-store] add stats-collector that eagerly collect stats
2021-08-18 13:47:50 -07:00
Chong-Li
a9b4545502
[GCS] GCS Based Actor Scheduler ( #16580 )
2021-08-18 13:44:59 -07:00
Clark Zinzow
e2c7706f76
Add support for an app config override to the release test script, allowing better integration with compile-on-product. ( #17913 )
2021-08-18 13:35:27 -07:00
Yi Cheng
ddc2e59af5
[workflow] Simplify the workflow storage layer ( #17883 )
2021-08-18 13:26:50 -07:00