Commit graph

7918 commits

Author SHA1 Message Date
Simon Mo
7ba0196863
[Serve] Nested actors should escape placement group (#15524) 2021-04-27 12:45:23 -05:00
Alex Wu
40a6ced996
[core] Handle blocked worker crashes edge case (#15083) 2021-04-27 10:14:12 -07:00
Dominic Ming
3c225bbeaf
[CI] fix npm ci unstable issue (#15537) 2021-04-27 10:11:57 -07:00
fyrestone
43de7f48a7
Fix reported dashboard ip when using 0.0.0.0 (#15506) 2021-04-27 23:48:22 +08:00
Ian Rodney
4db696d365
[Client] Asyncio Client, Sync gRPC Server (#15488) 2021-04-27 08:41:10 -07:00
Amog Kamsetty
643cf4c755
Revert "[Docker] Check torch and tf installation" (#15536)
This reverts commit fb17ef7a40.
2021-04-27 08:40:13 -07:00
Sven Mika
78b776942f
[RLlib] Discussion 1928: Initial lr wrong if schedule used that includes ts=0 (both tf and torch). (#15538) 2021-04-27 17:19:52 +02:00
SebastianBo1995
f5be8d8f74
[Rllib] Offline Learning Bug, different shapes (#15132) 2021-04-27 17:18:17 +02:00
Sven Mika
bb8a286cbc
[RLlib] Support native tf.keras.Model (milestone toward obsoleting ModelV2 class). (#14684) 2021-04-27 10:44:54 +02:00
Ian Rodney
fb17ef7a40
[Docker] Check torch and tf installation 2021-04-27 00:24:23 -07:00
Amog Kamsetty
793145b1ab
[SGD] Split test_torch suite (#15522)
* split test_torch

* remove unused imports

* lint
2021-04-27 00:22:23 -07:00
Kai Fricke
2c11a1aff1
[RLlib] Evaluation parallel to training check, key-error hotfix (#15345) 2021-04-27 08:38:10 +02:00
mvindiola1
9330403200
[RLlib] Mask out padded values for A3C loss with recurrent policy (#15525) 2021-04-27 08:36:04 +02:00
Simon Mo
54f50994bc
[Hotfix] PyArrow version issue (#15530) 2021-04-26 22:28:35 -07:00
Ian Rodney
360b053254
[client] Add support for ray.timeline() (#15448) 2021-04-26 18:32:22 -07:00
Amog Kamsetty
544dff80fa
[Docker] Fix torch GPU install on Ray Docker images (#15473)
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-04-26 16:22:25 -07:00
architkulkarni
a78af85b1c
Revert "[Hotfix] Skip test_runtime_env_complicated (#15515)" (#15520)
This reverts commit a38761b7f2.
2021-04-26 16:02:08 -07:00
Simon Mo
6504c059fe
Wheel Build macOS Python 3.6: 3.6.1 -> 3.6.2 (#15519)
To circumvent the pip issue
https://github.com/pypa/pip/issues/9831
2021-04-26 16:01:39 -07:00
architkulkarni
65497b3795
[Hotfix] pin pip version (#15514) 2021-04-26 14:30:16 -07:00
Yi Cheng
fc701067c3
[runtime env] Support .gitignore exclusion in working dir (#15392) 2021-04-26 16:06:03 -05:00
Simon Mo
a38761b7f2
[Hotfix] Skip test_runtime_env_complicated (#15515) 2021-04-26 11:37:44 -07:00
fcardoso75
c89687d925
Build instructions for Windows (#15422)
* Added build instructions for Windows

* Fixed minor indentantion issue

* Addressing code review comments

* Fixed minor indentation issues

* Fixed minor punctuation issue

* Added bazel to requirements list

* Fixed git download link.

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

* Added explanation for need to enable Developer Mode

* Moving Fast, Debug, and Optimized Builds one level up

* Added pytest to Windows requirements. Removed old dangling Windows text from Linux build instructions

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-04-26 09:49:36 -07:00
Kai Fricke
f56285577b
Bump XGBoost-Ray version for CI/docs (#15507)
Failing tests seem unrelated.
2021-04-26 14:53:21 +01:00
Sven Mika
354c960fff
[RLlib] Fix test_dependency_torch and fix custom logger support for RLlib. (#15120) 2021-04-24 08:13:41 +02:00
Simon Mo
57c0bd9912
[CI] Run Flaky tests in macOS (#15487) 2021-04-23 15:43:06 -07:00
Simon Mo
7b79e0ed4e
[CI] Mark test_actor_pool medium (#15490) 2021-04-23 15:35:22 -07:00
architkulkarni
b08b2c5103
[Core] Add "shim process" setup_worker.py that calls "conda activate" for runtime_env (#15361) 2021-04-23 15:29:52 -05:00
Edward Oakes
ab797d1d62
[serve] Add test for redirections w/ fastapi (#15461) 2021-04-23 14:28:42 -05:00
Eric Liang
93a1ecba4b
Unhandled error messages aren't printed until next interaction with shell (#15432) 2021-04-23 11:00:34 -07:00
Simon Mo
951943c28b
[Core] Add concurrent.futures.Future wrapper for ObjectRef (#15425) 2021-04-23 11:53:46 -05:00
Dmitri Gekhtman
6b0673f207
[doc][Kubernetes][minor] Restructure section labels for operator launch (#14962) 2021-04-23 09:50:58 -07:00
Charles Tapley Hoyt
251558b753
[tune] Fix type annotation in choice (#15038)
The `Categorical.__init__()` takes any sequence, so the type annotation on `choice()` can be relaxed.
2021-04-23 09:45:32 -07:00
Dmitri Gekhtman
fd43e9e6f8
[kubernetes][doc][minor] Add namespace to job creation command (#15442) 2021-04-23 09:44:51 -07:00
Ian Rodney
cc4a610e6a
[doc] Update requirements-rtd.txt (#15485) 2021-04-23 09:35:47 -07:00
Kai Fricke
c08373b0bf
[tune] Add save/restore for ASHA scheduler (#15438) 2021-04-23 09:35:34 -07:00
Sumanth Ratna
ab542f2c45
[tune] Add HEBO to search algorithm shim function (#15468) 2021-04-23 00:17:48 -07:00
fangfengbin
d9780761a3
[GCS]Revert ping_gcs_rpc_server_max_retries to 600 (#14443) 2021-04-23 10:02:38 +08:00
Dmitri Gekhtman
0d0c2418b8
[client][placement groups] Client placement group hooks, attempt #3 (#15382) 2021-04-22 17:18:55 -07:00
Eric Liang
af01a47d59
Add support for tune,serve,rllib tests to flaky builder (#15447) 2021-04-22 15:03:29 -07:00
Micah Yong
53774209cf
[core] Extend ActorPool API to support adding / removing actors (#15228)
* Add has_free, push, and pop to actor_pool.py with corresponding tests

* Remove period

* Change name from pop to pop_idle
2021-04-22 12:45:45 -07:00
Edward Oakes
17865c0569
Remove ray.workers from __init__.py (#15460) 2021-04-22 14:20:04 -05:00
Edward Oakes
668a784553
[serve] Re-add variable route support for old API (#15455) 2021-04-22 14:07:50 -05:00
Simon Mo
79c24146bd
[Hotfix] Upload the flaky test log (#15458) 2021-04-22 10:32:27 -07:00
Sven Mika
b9761d7081
[RLlib] Discussion 1759: SampleBatch._get_slice_indices stuck for R2D2 when using incorrect Trainer. (#15451)
Thanks @Manuscrit for raising this issue!
2021-04-22 19:21:03 +02:00
Sven Mika
7e1a191f17
[RLlib] Remove all remaining tf- and MuJoCo warnings from RLlib. (#15454) 2021-04-22 19:20:19 +02:00
Simon Mo
baa1b0f360
[Serve] FastAPI allow duplicated routes in class based views (#15445) 2021-04-22 11:27:39 -05:00
Sven Mika
bdda73e2dd
[RLlib] Torch multi-GPU bug fixes (discussion 1755). (#15421)
Thanks a lot @Bam4d for raising this and your help on fixing the worker GPU issue for torch!
2021-04-22 11:29:42 +02:00
Sven Mika
7318439c3d
[RLlib] DQN native_ratio (for training intensity) incorrect (discussion 1763). (#15436)
Thanks @Manuscrit !
2021-04-22 11:06:29 +02:00
Jialing He
5403021430
Fix incorrect call function WorkerID::FromBinary (#15449) 2021-04-22 15:44:49 +08:00
Ian Rodney
810a02b3f2
[Azure][Autoscaler] Allow current user to use Docker (#15380) 2021-04-22 00:30:30 -07:00