Kai Fricke
7d1e6d3129
[ci/release] Add sanity check for ray wheels hash to release tests ( #18489 )
2021-09-10 17:50:31 +01:00
Kai Fricke
be438fb600
[release] Also download Ray CPP wheels ( #18383 )
2021-09-10 17:49:37 +01:00
Chris K. W
6f94d0f3c9
[client] Use application specific error code to propagate ray errors ( #18278 )
...
* Raise decoded exception if generated by grpc lib
* Switch to missing client_id error to FAILED_PRECONDITION
* switch to ABORTED
* fix comment
* fix decode_exception comment
2021-09-10 09:49:03 -07:00
Sven Mika
3f89f35e52
[RLlib] Better error messages and hints; + failure-mode tests; ( #18466 )
2021-09-10 16:52:47 +02:00
Ameer Haj Ali
ead02b21b9
[client] Fix exception error message ( #18485 )
2021-09-10 14:34:31 +03:00
Guyang Song
03a2c69a8a
Don't add ray-cpp wheel to extras by default ( #18251 )
2021-09-10 09:56:51 +01:00
xwjiang2010
ae689ecc6b
[tune] Add optional Experiment to Searcher/SearchAlgo. ( #17724 )
2021-09-10 09:30:18 +01:00
Edward Oakes
2fcfea10b3
[runtime_env] Move URI deletion logic to the agent, remove util worker code ( #18471 )
2021-09-10 00:13:32 -07:00
Yi Cheng
f2d8f23fb6
[workflow] Define default __getstate__
and __setstate__
( #18459 )
2021-09-09 23:04:00 -07:00
Yi Cheng
965c55fe1b
[workflow] set max retry to 3 ( #18477 )
2021-09-09 23:03:24 -07:00
qicosmos
dd096c8e73
[C++ Worker]Fix abi issue ( #18273 )
2021-09-10 11:53:05 +08:00
SangBin Cho
7b2ed4c1f8
[Placement group] Placement group scheduling hangs due to creation/removal race condition ( #18419 )
2021-09-09 20:39:01 -07:00
SangBin Cho
688dbeb4cb
Revert "[cpp] Upgrade cpp from 14 -> 17 ( #18455 )" ( #18480 )
...
This reverts commit ccc16a46bb
.
2021-09-09 16:47:19 -07:00
matthewdeng
e66f154b14
[release] increase torch_tune_serve timeout to 20 min ( #18481 )
2021-09-09 16:31:14 -07:00
Chen Shen
5f57079041
use clang for C++ debug testing ( #18343 )
2021-09-09 15:48:36 -07:00
Amog Kamsetty
d3d8120db3
[SGD] Fix shutdown hang on macOS Python 3.7 ( #18473 )
2021-09-09 15:32:52 -07:00
Eric Liang
4d2065352b
Increase dataset read parallelism by default ( #18420 )
2021-09-09 15:07:49 -07:00
Yi Cheng
ccc16a46bb
[cpp] Upgrade cpp from 14 -> 17 ( #18455 )
2021-09-09 12:09:21 -07:00
Kai Fricke
395976c8a1
[tune] Never block for results ( #18391 )
...
* [tune] Never block for results
* Fix tests
* Block in tests
* Add comment to test
2021-09-09 12:08:00 -07:00
architkulkarni
0126837868
[ray client] [runtime env] Print error logs in driver upon connection failure ( #18451 )
2021-09-09 13:50:55 -05:00
Simon Mo
d477fd7205
[Serve] Make @ingress accept any ASGI app ( #15464 )
2021-09-09 13:49:37 -05:00
Simon Mo
51bb7c6da8
[Serve] Allow method redefinition for FastAPI ( #18453 )
2021-09-09 13:48:58 -05:00
Edward Oakes
791abd4f04
[serve] Remove root_url from start docstring ( #18472 )
2021-09-09 13:46:56 -05:00
Zhi Lin
2fcd1bcb4b
[Dataset] implement from_spark
, to_spark
and some optimizations ( #17340 )
2021-09-09 11:43:47 -07:00
SangBin Cho
fdd52106bf
[Placement group] Do not report ready task demand ( #18463 )
2021-09-09 11:42:12 -07:00
Nikita Vemuri
0f562874b9
[serve] Use environment variable for root_url
from runtime env ( #18269 )
2021-09-09 12:12:35 -05:00
Dominic Ming
97f71e15d4
[Dashboard] new dashboard event page for API Server event module ( #18330 )
2021-09-09 19:43:48 +08:00
mwtian
26fd10c9e8
[CI] Add clang-tidy to lint ( #18124 )
...
* clang-tidy
* fix
* fix script
* test clang compiler
* fix clang-tidy rules
* Fix windows and other issues.
* Fix
* Improve information when running check-git-clang-tidy-output.sh on different OS
2021-09-09 00:41:53 -07:00
Sven Mika
8a066474d4
[RLlib] No Preprocessors; preparatory PR #1 ( #18367 )
2021-09-09 08:10:42 +02:00
Sven Mika
1520c3d147
[RLlib] Deepcopy env_ctx for vectorized sub-envs AND add eval-worker-option to Trainer.add_policy()
( #18428 )
2021-09-09 07:10:06 +02:00
qicosmos
ba0084e9c7
[C++ Worker]Add gcs global state accessor ( #17976 )
2021-09-09 12:08:08 +08:00
Lixin Wei
df803cee98
Revert "Revert "[Core] Fix ServerCall Leaking ( #17863 )" ( #18410 )" ( #18424 )
2021-09-08 19:55:06 -07:00
architkulkarni
5affb074aa
[Test] deflake test_runtime_env.py::test_no_spurious_worker_startup ( #17809 )
2021-09-08 16:35:08 -07:00
Clark Zinzow
c0ea2755a0
Fix iter_batches dropping batches when prefetching. ( #18441 )
2021-09-08 15:37:38 -07:00
Clark Zinzow
6fc91fd47e
Create directory on write if it doesn't exist. ( #18435 )
2021-09-08 15:31:06 -07:00
Simon Mo
6d24214085
[Release] Make sure to uninstall ray for rllib_tests ( #18448 )
2021-09-08 23:29:40 +01:00
Edward Oakes
f0555f88d6
[runtime_env] Move worker process startup logic to context ( #18341 )
2021-09-08 17:08:27 -05:00
Antoni Baum
dd6abed6ce
[tune] Fix an edge case where DurableTrainable
would not delete checkpoints in remote storage ( #18318 )
...
Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-09-08 15:00:09 -07:00
Sven Mika
cd22a7d1bb
[RLlib] Add locking to PolicyMap in case it is accessed by a RolloutWorker and the same worker's AsyncSampler or the main LearnerThread. ( #18444 )
2021-09-08 23:32:23 +02:00
gjoliver
50cdf551ce
[RLlib] Fix test name typo. ( #18423 )
...
Co-authored-by: Jun Gong <jungong@mbpro.local>
2021-09-08 23:30:37 +02:00
gjoliver
808b683f81
[RLlib] Add a unittest for learning rate schedule used with APEX agent. ( #18389 )
2021-09-08 23:29:40 +02:00
Ian Rodney
c91e0eb065
[Dashboard] Increase Actor Snapshot Size ( #18433 )
2021-09-08 12:06:33 -07:00
Lixin Wei
052ed115e7
[Core] Make It Easier to Grep Debug State Dump ( #18382 )
...
* add keyword to debug dump
* fix
2021-09-08 12:03:54 -07:00
Yi Cheng
6011d4197f
Open [nightly] Add many_nodes_actor_test to nightly test ( #18406 )
2021-09-08 11:15:48 -07:00
Yi Cheng
7126d01c91
[core] upgrade gtest ( #18288 )
...
* up
* up
* format
* up
* flaky fix
* format
* up
* up
* format
* add debug
* up
* up
* up
* up
* up
* format
* fix
* format
* up
* up
* format
2021-09-08 11:15:34 -07:00
Sven Mika
45f60e51a9
[RLlib] DDPPO fixes and benchmarks. ( #18390 )
2021-09-08 19:39:01 +02:00
Sasha Sobol
f76f14fedf
[client] pass _credentials down from init ( #18425 )
2021-09-08 10:30:26 -07:00
Clark Zinzow
b30c41759d
[Datasets] Adds tensor column support (tensors-in-tables) via Pandas/Arrow extension types/arrays. ( #18301 )
2021-09-08 10:09:01 -07:00
mwtian
e427e4a467
Fix flakiness in test_proxy_manager_internal_kv ( #18416 )
2021-09-08 15:46:45 +03:00
Kai Fricke
dac3a8bc8e
[setup] Upstream conda patches ( #17575 )
...
Co-authored-by: Vasilij Litvinov <vasilij.n.litvinov@intel.com>
2021-09-08 10:37:17 +01:00