fangfengbin
0e3bbbeba3
[Test] Try deflaking gcs server test by adding log ( #15332 )
...
Co-authored-by: 灵洵 <fengbin.ffb@antgroup.com>
2021-04-15 21:16:09 -07:00
Richard Liaw
dc80d9f42a
[flaky] fix mnist ptl data cache ( #15344 )
...
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-04-15 16:24:17 -07:00
SangBin Cho
a54d69f535
[Test] Split long runtime env tests. ( #15340 )
...
* [Test] Split long runtime env tests.
* Addressed code review.
2021-04-15 14:28:28 -07:00
SangBin Cho
1d87e4447d
[Test] increase the test size of test io that consistenly times out ( #15341 )
2021-04-15 14:02:41 -07:00
Siyuan (Ryans) Zhuang
4de1f35b3e
run_function_on_all_workers
only once in the driver (#15203 )
2021-04-15 13:58:36 -07:00
Richard Liaw
eaa3ce3f40
Fix release test -- client remote put ( #15325 )
...
* fix-test
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* fix
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
* Update python/ray/util/client/server/dataservicer.py
* Update python/ray/util/client/server/dataservicer.py
* Update python/ray/_private/ray_client_microbenchmark.py
2021-04-15 13:30:38 -07:00
Kai Fricke
1c783e2eeb
[tune] Allow 0 CPU head bundles in for placement group factories ( #15338 )
2021-04-15 20:21:35 +01:00
SangBin Cho
4dd4756c09
[Test] skip flaky pg tests. ( #15337 )
2021-04-15 11:55:19 -07:00
SangBin Cho
df9329160e
[Tests] Dask on ray release test ( #15256 )
...
* done.
* Linting.
* Update readme
* Update.
* Fix issues.
2021-04-15 10:30:17 -07:00
Sven Mika
8b3554e37e
[RLlib] Remove all (already soft-deprecated) SampleBatch.data
from code. ( #15335 )
2021-04-15 19:19:51 +02:00
Sven Mika
c90de315e5
[RLlib] APEX returns incorrect default resources (PleacementGroupFactory) colocated missing replay actors. ( #15295 )
2021-04-15 16:50:42 +01:00
Sven Mika
e961d2f4b2
[RLlib] Improve example scripts for attention nets, CartPole LSTM, and custom RNN-models. ( #15329 )
2021-04-15 16:11:34 +02:00
Sven Mika
45d6560759
[RLlib] Fix flakey custom_fast_model_torch/tf tests. ( #15330 )
2021-04-15 16:10:29 +02:00
Ameer Haj Ali
981fa5829a
[client] Enable ClientObjectRef Comparisons ( #15320 )
2021-04-15 16:46:44 +03:00
SangBin Cho
c2e240e866
[Doc] Update object spilling doc ( #15301 )
2021-04-14 23:38:04 -07:00
Simon Mo
57b6053cda
[Buildkite] Turn off Travis linux builds (except wheels) ( #15316 )
...
* [Buildkite] Turn off Travis linux builds (except wheels)
* naming
2021-04-14 20:37:37 -07:00
Siyuan (Ryans) Zhuang
b81d805f40
[Doc] fix ray client doc ( #15308 )
2021-04-14 20:35:15 -07:00
Yi Cheng
a9402c21e6
Revert "Revert "[runtime_env] Add support of exclusion ( #15241 )" ( #15303 )" with fixing ( #15310 )
...
* Revert "Revert "[runtime_env] Add support of exclusion (#15241 )" (#15303 )"
This reverts commit 775deca5ad
.
* fix
2021-04-14 20:34:53 -07:00
Stephanie Wang
6b2da7eda8
[core] Log warning on bad max task args value ( #15314 )
2021-04-14 20:34:08 -07:00
SangBin Cho
27ab0c7633
[Test] Skip the failing rllib example test. ( #15321 )
2021-04-14 20:19:44 -07:00
Simon Mo
5f0be94989
[Buildkite] Use the build link for Travis Tracker ( #15317 )
2021-04-14 18:58:23 -07:00
SangBin Cho
d0e83c43ca
[Release Test] Modify parameter to reduce stress ( #15048 )
...
* Fix.
* Fix.
2021-04-14 18:27:20 -07:00
SangBin Cho
e0bbfaf87e
[Log] Fix log monitor issue. ( #15302 )
2021-04-14 18:11:24 -07:00
Yi Cheng
0caf96be94
Take care of failed killing request ( #15313 )
2021-04-14 18:07:10 -07:00
Charles
82e730078f
[autoscaler] Converting assert False into useful exceptions. ( #15306 )
2021-04-14 16:16:37 -07:00
Simon Mo
c4b1985a5b
[Serialization] Pydantic -> serialization_addons.py
and Ray Client support. ( #15181 )
2021-04-14 15:21:13 -07:00
Simon Mo
5289690d1c
[Buildkite] Fix Bazel Logs Upload ( #15285 )
2021-04-14 12:47:31 -07:00
SangBin Cho
775deca5ad
Revert "[runtime_env] Add support of exclusion ( #15241 )" ( #15303 )
...
This reverts commit 359b5ce06b
.
2021-04-14 11:58:53 -07:00
Richard Liaw
59bf3a7b22
ray[cluster] -> ray[default] ( #15251 )
2021-04-14 09:37:04 -07:00
Antoni Baum
b93bd9bef4
[tune] Set correct Optuna TrialState on trial complete ( #15283 )
2021-04-14 15:59:23 +01:00
Sven Mika
bbfa8ffec9
[RLlib] Minor release 1.3 warnings cleanups. ( #15272 )
2021-04-14 14:03:15 +02:00
Sven Mika
ef0f163d16
[RLlib] Discussion 1709: IMPALA (tf and torch) reports sum of entropy (over batch) in stats. Should report mean instead. ( #15290 )
2021-04-14 11:44:25 +02:00
Kai Fricke
aaa14d63a7
[tune] deflake test_convergence, add seed
parameter to OptunaSearch ( #15248 )
...
* De-flake optuna convergence test
* Even higher threshold
* Add `seed` parameter to OptunaSearch
2021-04-14 01:06:49 -07:00
wanxing
0ad0839265
Optimize lambda copy to improve direct call performance. ( #15036 )
2021-04-14 11:02:49 +08:00
Edward Oakes
4ed7a14e23
[serve] Support normal args and kwargs for deployments ( #15172 )
2021-04-13 16:20:50 -05:00
Richard Liaw
f4b2dd94b2
[tune] Cache MNIST and restore MNIST tests ( #15260 )
2021-04-13 14:20:26 -07:00
Simon Mo
7c734c207a
[Buildkite] Upload Bazel Logs to Bucket ( #15259 )
2021-04-13 14:16:42 -07:00
Yi Cheng
359b5ce06b
[runtime_env] Add support of exclusion ( #15241 )
2021-04-13 15:55:12 -05:00
Ian Rodney
d145ad94e4
[Client] Add metadata to Terminate Calls to make ray.kill() and ray.cancel() work ( #15221 )
2021-04-13 23:24:54 +03:00
Ian Rodney
ec3d5f2ef1
[client] Handle ray.put
failures ( #15229 )
2021-04-13 11:23:16 -07:00
Edward Oakes
0f9d1bb223
Serve failure release test fix ( #15276 )
...
This test is currently not tested in CI
2021-04-13 17:49:29 +01:00
Sven Mika
5254d2fb36
[RLlib] Support parallelizing evaluation and training (optional). ( #15040 )
2021-04-13 09:53:35 +02:00
Clark Zinzow
05d99c9432
[dask-on-ray] Don't leak a global enabling of client mode in Dask callback test. ( #15257 )
...
* Don't leak a global enabling of client mode in Dask callback test.
* Enable and disable client_mode_enabled, not _client_hook_enabled.
2021-04-12 22:00:30 -07:00
SangBin Cho
9197552802
Temporarily disable flaky tests. ( #15250 )
2021-04-12 14:09:07 -07:00
Simon Mo
8bf4b37877
[Hotfix] Pin dm-tree
package version ( #15249 )
...
`dm-tree` released a new version https://pypi.org/project/dm-tree/#history
and it depends on `bazel` to build from source, and it conflicts with
our current bazel setup (this conflict is non-trivial to fix).
2021-04-12 13:50:49 -07:00
Clark Zinzow
95659987a4
[Core] Event loop instrumentation - manual instrumentation hooks, instrumentation for deadline timer and local stream socket. ( #15144 )
...
* Added manual hooks in event loop instrumentation.
* Added instrumentation of the deadline timer in the periodical runner.
* Added instrumentation of the local stream socket in the ClientConnection.
* Addressed feedback except for opaque handle.
* Switch to opaque stats handle API.
* Add opaque stats handle destructor check to ensure that RecordExecution is called.
* Revert "Add opaque stats handle destructor check to ensure that RecordExecution is called."
This reverts commit 62cf8fca670d78c1160f0a9526b6cbe6e3a25725.
* Apply suggestions from code review
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
* Other feedback, fixes for code suggestions.
* Prevent handler stats from leaking queueing stats when handler execution is never recorded.
* Enable event loop instrumentation.
* Revert "Enable event loop instrumentation."
This reverts commit df90c504e45e1963dc2ef6c3197dc5c965bc19e7.
* Reorg GCS client and IO context member fields to prevent use-after-free.
Co-authored-by: SangBin Cho <rkooo567@gmail.com>
2021-04-12 13:39:00 -07:00
Ameer Haj Ali
580b2bb9cc
[metrics.Histogram] improve error message ( #15240 )
2021-04-12 21:28:21 +03:00
Tao Wang
4c9eee609c
Revert "Revert "[GCS]Increase heartbeat interval to reduce pressure o… ( #15207 )
...
* Revert "Revert "[GCS]Increase heartbeat interval to reduce pressure on gcs server (#14203 )" (#15194 )"
This reverts commit a9ac4ad890
.
* optimize wait condition to avoid flakey test
* remove unnecessary sleep
2021-04-12 10:45:42 -07:00
Richard Liaw
56c95075d1
Revert "[tune] enable mnist test v3 ( #15198 )" ( #15242 )
...
This reverts commit d913f32126
.
2021-04-12 09:27:55 -07:00
Hao Chen
10ff2f3b4a
Fix duplicate destruction of CoreWorkerProcess instance ( #15245 )
2021-04-12 21:01:21 +08:00