ZhuSenlin
c39658f368
fix removal of task dependencies ( #13333 )
...
Co-authored-by: senlin.zsl <senlin.zsl@antfin.com>
2021-01-11 09:55:48 -08:00
Edward Oakes
62e1ad3973
[serve] Cleanup backend state, move checkpointing and async goal logic inside ( #13298 )
2021-01-11 11:45:43 -06:00
Sven Mika
5d50d37f45
[RLlib] Issue 13330: No TF installed causes crash in ModelCatalog.get_action_shape()
( #13332 )
2021-01-11 13:19:46 +01:00
Edward Oakes
93006c2ba5
Use wait_for_condition to reduce flakiness in test_queue.py::test_custom_resources ( #13210 )
2021-01-10 19:32:59 -06:00
Barak Michener
6f0083ed10
add the method annotation and a comment explaining what's happening ( #13306 )
...
Change-Id: I848cc2f0beaed95340d9de7cca19a50c78d9da9a
2021-01-10 15:54:10 -08:00
Akash Patel
94a873fc4d
remove empty extras streaming deps ( #12933 )
2021-01-10 12:09:27 -08:00
Kai Fricke
d4b0a9fadf
[tune] convert search spaces: parse spec before flattening ( #12785 )
...
* Parse spec before flattening
* flatten after parse
* Test for ValueError if grid search is passed to search algorithms
2021-01-09 18:21:49 +01:00
Sven Mika
9dd9f72111
[RLlib] Add more detailed Documentation on Model building API ( #13261 )
2021-01-09 12:38:29 +01:00
Michael Luo
67229bf350
[RLlib] SlateQ Documentation ( #13266 )
2021-01-09 11:21:51 +01:00
Edward Oakes
d434ba6518
[serve] Clean up EndpointState interface, move checkpointing inside of EndpointState ( #13215 )
2021-01-08 22:36:19 -06:00
Philipp Moritz
c5ae30d1d4
Do not give an error if both RAY_ADDRESS
and address
is specified on initialization ( #13305 )
...
* Finalize handling of RAY_ADDRESS
* lint
2021-01-08 18:31:32 -08:00
Barak Michener
eb6f403b97
[ray_client]: first draft of documentation ( #13216 )
2021-01-08 15:38:36 -08:00
Ian Rodney
f916549602
[Cancellation] Make Test Cancel Easier to Debug ( #13243 )
...
* first commit
* lint-fix
2021-01-08 14:52:43 -08:00
Alex Wu
6ca4fb1054
[Pull manager] Only pull once per retry period ( #13245 )
...
* .
* docs
* cleanup
* .
* .
* .
* .
Co-authored-by: Alex <alex@anyscale.com>
2021-01-08 14:51:11 -08:00
Edward Oakes
66daed99f5
Remove top-level ray.connect() and ray.disconnect() APIs ( #13273 )
2021-01-08 15:26:20 -06:00
dependabot[bot]
300a22d8f7
[tune](deps): Bump gluoncv from 0.9.0 to 0.9.1 in /python/requirements ( #13287 )
2021-01-08 11:42:58 -08:00
dependabot[bot]
3569b78237
[tune](deps): Bump mlflow from 1.13.0 to 1.13.1 in /python/requirements ( #13286 )
2021-01-08 11:42:18 -08:00
Sven Mika
6f342a2221
[RLlib] Preparatory PR for: Documentation on Model Building. ( #13260 )
2021-01-08 10:56:09 +01:00
Philipp Moritz
a247c71e2e
[ray_client] Add metadata to gRPC requests ( #13167 )
2021-01-07 23:58:15 -08:00
Hao Chen
77cd0d5a21
Fix a crash problem caused by GetActorHandle in ActorManager ( #13164 )
2021-01-08 12:11:08 +08:00
fyrestone
a6d135a072
[Dashboard] Add GET /log_proxy API ( #13165 )
2021-01-08 11:45:07 +08:00
Tao Wang
ab2229dcb7
[GCS] Remove old lightweight resource usage report code path ( #13192 )
2021-01-08 10:30:00 +08:00
Ian Rodney
4aef3d6836
[docker] Pull if image is not present ( #13136 )
2021-01-07 17:17:00 -08:00
Amog Kamsetty
0f5d36ce5e
[Dependabot] Add Dependabot ( #13278 )
...
Co-authored-by: Ian <ian.rodney@gmail.com>
2021-01-07 16:33:02 -08:00
Amog Kamsetty
43f70faa25
[Tune] Pin Tune Dependencies ( #13027 )
...
Co-authored-by: Ian <ian.rodney@gmail.com>
2021-01-07 14:03:06 -08:00
Amog Kamsetty
f68922d043
[Tune] Improve error message for Session Detection ( #13255 )
...
* Improve error message
* log once
2021-01-07 22:40:44 +01:00
Sven Mika
a5b39ef8e2
[RLlib] Fix missing "info_batch" arg (None) in compute_actions
calls. ( #13237 )
2021-01-07 21:25:02 +01:00
Simon Mo
c32ad2fef5
[Release] Use ray-ml image for logn running test ( #13267 )
2021-01-07 10:31:46 -08:00
Max Fitton
5094734205
Update autoscaler-cluster yaml files for release tests ( #13114 )
2021-01-07 11:44:57 -06:00
Simon Mo
01dcb993c7
[Serve] Rescale Serve's Long Running Test to Cluster Mode ( #13247 )
...
Now that `HeadOnly` becomes the new default HTTP location, we can
re-enable the long running tests to use local multi-clusters.
(also fixed the controller's API to match up to date, we should
have caught these, I will open issues for this.)
2021-01-07 08:57:24 -08:00
Sven Mika
bcaff63909
[RLlib] SquashedGaussians should throw error when entropy or kl are called. ( #13126 )
2021-01-07 15:07:35 +01:00
Tao Wang
82c54c67ee
Publish job/worker info with Hex format instead of Binary ( #13235 )
2021-01-07 20:31:58 +08:00
fangfengbin
3669c02821
[GCS]Add gcs actor schedule strategy ( #13156 )
2021-01-07 15:44:33 +08:00
Philipp Moritz
9872fc1801
Start ray client server with 'ray start' ( #13217 )
2021-01-06 21:04:14 -08:00
fangfengbin
9ae5bba7cf
[GCS]Fix gcs table storage GetAll
and GetByJobId
api bug ( #13195 )
2021-01-07 10:37:00 +08:00
Siyuan (Ryans) Zhuang
dde49b8d48
[Serialization] Fix cloudpickle ( #13242 )
2021-01-06 17:21:17 -08:00
Siyuan (Ryans) Zhuang
02ae6c5a9a
[Core] Fix incorrect comment ( #13228 )
2021-01-06 11:37:29 -08:00
Max Fitton
0d61ea9b06
[Release] Add 1.1.0 release test logs ( #13054 )
...
* Add microbenchmark to release logs
* check in many_tasks stress test result
* Add results of placement group stress test for 1.1.0
* Add result for test_dead_actors test and correct the name of test_many_tasks.txt
* Add rllib regression test result
* Add pytorch test results for rllib
* remove extraneous log entries
2021-01-06 11:03:16 -08:00
Lingxuan Zuo
01d4638b49
[Log] fix spdlog init race ( #12973 )
...
* fix spdlog init race
* use global logger
* refine logger name and constructor
2021-01-06 11:02:54 -08:00
dHannasch
695833082d
[Redis] Note that each Redis Connect retry takes two minutes ( #12183 )
...
* Slightly alter error message so it's the same in both cases.
* Each retry takes about two minutes.
2021-01-06 11:00:58 -08:00
Kai Fricke
97211a6170
[Tune] Fix tune serve integration example ( #13233 )
2021-01-06 17:02:04 +01:00
SangBin Cho
32dc5676b4
[Metrics] Record per node and raylet cpu / mem usage ( #12982 )
...
* Record per node and raylet cpu / mem usage
* Add comments.
* Addressed code review.
2021-01-05 21:57:21 -08:00
fangfengbin
779b3876f6
[GCS]Fix TestActorSubscribeAll bug ( #13193 )
2021-01-06 13:52:39 +08:00
fangfengbin
dd14e5a3b3
[BugFix][GCS]Fix gcs_actor_manager_test multithreading bug ( #13158 )
2021-01-06 10:47:06 +08:00
Ian Rodney
92963800f6
[tests] Fix Autoscaler Test failure on Windows ( #13211 )
...
* skip create_or_update tests
* Update python/ray/tests/test_autoscaler.py
Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
Co-authored-by: Ameer Haj Ali <ameerh@berkeley.edu>
2021-01-05 16:48:32 -08:00
Simon Mo
39813ff6b0
[Serve] HTTPOptions for deployment modes ( #13142 )
2021-01-05 16:41:52 -08:00
Amog Kamsetty
bd19ed31e7
[Tune] Fix PBT Transformers Example ( #13174 )
2021-01-05 16:31:11 -08:00
Hao Zhang
7e52351ae5
[Collective] Some necessary abstraction of collective calls before introducing stream management ( #13162 )
2021-01-05 16:20:12 -08:00
Basu Jindal
4e569ee20b
Update multi_agent_independent_learning.py ( #13196 )
...
pettingzoo.utils.error.DeprecatedEnv: waterworld_v0 is now depreciated, use waterworld_v2 instead
2021-01-05 13:44:54 -08:00
Edward Oakes
dc101fd087
[serve] Move controller state into separate files ( #13204 )
2021-01-05 14:37:16 -06:00