Commit graph

6811 commits

Author SHA1 Message Date
Eric Liang
b73d4831d4
Add grace period before warning of resource deadlock 2020-12-12 12:02:13 -08:00
Barak Michener
6eb0e6f734
[format] Improve formatting with a real .flake8 file (#12800)
Change-Id: I42acd948dd915bad6b132f8caa9038898b55d6e4
2020-12-12 11:34:30 -08:00
Richard Liaw
2f2bd884a3
[tune] upgrade gpytorch, bump default pytorch to 1.7.0 (#12776)
* upgrade gpytorch

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* pin

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* version-torch

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix-build

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 10:35:33 -08:00
Richard Liaw
7e09f1d934
remove-xgboost-build (#12822)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 10:34:56 -08:00
fangfengbin
c22990a537
[GCS]GCS node manager rename GetNode to GetAliveNode (#12781) 2020-12-12 20:34:43 +08:00
Kai Fricke
5f04ade6ef
[tune] add more stoppers and stopper documentation (#12750)
* Add new stoppers & docs

* Add tests for maximum iteration stopper and trial plateau stopper

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Update doc/source/tune/api_docs/stoppers.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Apply suggestions from code review

* Update python/ray/tune/stopper.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 01:47:19 -08:00
Kai Fricke
905652cdd6
[tune] migrate xgboost callback api (#12745)
* Migrate to new-style xgboost callbacks

* Fix flaky progress reporter test

* Fix import error

* Take last value (not first)
2020-12-12 01:42:20 -08:00
Kai Fricke
42c70be073
[tune] Hyperopt: Directly accept category variables instead of indices (#12715)
* [tune] Hyperopt: Directly accept category variables instead of indices

* Fix interrupt test

* Update python/ray/tune/suggest/hyperopt.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* Apply suggestions from code review

* Update python/ray/tune/suggest/hyperopt.py

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>

* lint

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-12 01:40:53 -08:00
Hao Zhang
0b1fbc5e83
[PR 1/6] Collective in Ray (#12637)
Co-authored-by: YLJALDC <dal177@ucsd.edu>
2020-12-12 01:26:36 -08:00
Alex Wu
aa64cd4534
[New scheduler] Fix test_global_state (#12586) 2020-12-11 21:47:01 -08:00
Edward Oakes
03d869d51c
Hold GIL while submitting (actor) tasks (#12803) 2020-12-11 21:47:16 -06:00
Edward Oakes
aec5c9879e
Add tests for atexit handler behavior (#12808) 2020-12-11 21:47:05 -06:00
Edward Oakes
6262ee1f76
Clarify docs for atexit behavior when using ray.kill (#12807) 2020-12-11 21:45:39 -06:00
Eric Liang
1ce745cf44
Add automatic local GC and plasma debug logs every 10 minutes by default (#12804) 2020-12-11 17:09:58 -08:00
Sven Mika
abb1eefdc2
[RLlib] Issue 12483: Discrete observation space error: "ValueError: ('Observation ({}) outside given space ..." when doing Trainer.compute_action. (#12787) 2020-12-11 22:43:30 +01:00
Alex Wu
676ec363f6
[Object Manager] Pull Manager refactor (#12335) 2020-12-11 11:56:23 -08:00
Simon Mo
3d8c1cbae6
[Serve] Fix Serve Release Tests (#12777) 2020-12-11 11:53:47 -08:00
Eric Liang
4ad4463be6
Add comments to clarify purpose of new scheduler queues (#12730)
* update

* clarify

* update
2020-12-11 11:53:09 -08:00
fangfengbin
9ded69fdaa
[Hotfix] Fix python client lint error (#12783) 2020-12-11 10:15:53 -08:00
Simon Mo
68d7fa2137
Fix exit_actor in asyncio mode (#12693) 2020-12-11 09:35:17 -08:00
Edward Oakes
699ded5328
[serve] Initial commit for CLI (#12770) 2020-12-11 10:31:29 -06:00
Sven Mika
74c98ac38e
[RLlib] Issue 12244: Unable to restore multi-agent PPOTFPolicy's Model (from exported). (#12786) 2020-12-11 16:13:38 +01:00
Tao Wang
295b6e5ce4
Split heartbeat message (#12535)
* first

* xxx

* Split heartbeat message

* only report resource usage when changed

* Fix GetAllResourceUsage

* Fix report resource usage

* Increase default heartbeat interval

* regularize heartbeat interval in test case
2020-12-11 21:19:57 +08:00
Lixin Wei
867d2a8aa3
[Streaming] Add more documents. (#12746)
* add doc

draft

draft

draft

draft

draft

fix

fix

fix

fix

fix

fix

fix

Update README.md

fix

fix

fix

* md to rst

* fix

* fix

* fix

* jpg modified

* add getting envolved

* jpg modified

* Update README.rst

* fix

* fix
2020-12-11 20:36:17 +08:00
Sven Mika
a082ea18b8
[RLlib] Issue 12212: "TFEagerPolicy has no attribute action_sampler_fn. 2020-12-11 12:57:33 +01:00
Stephanie Wang
86b0741026
[new scheduler] Allocate resources for spilled back task to a local view of the remote node (#12711)
* Force report heartbeats if remote resources may be dirty

* lint

* typo

* typo

* unit test

* debug

* Revert "lint"

This reverts commit 6dc7e982ffee98185665eb7c3c8fde0d91938919.

* Revert "Force report heartbeats if remote resources may be dirty"

This reverts commit cbfa9405197df62f874107d55b46715ceae2abd2.

* Local view of resources

* debug travis

* debug

* debug

* debug

* weaken test

* cleanups

* lint

* Revert "debug travis"

This reverts commit 11ff5f4f84e64e9fbd4eecda5b3c7fd07a7130a4.

* revert

* const view, remove unused
2020-12-10 22:43:29 -05:00
Barak Michener
b7f246c451
[ray_client] Include multiple facets of the Ray API (#12736) 2020-12-10 19:09:34 -08:00
Sumanth Ratna
8d1ad25545
[docs] Add troubleshooting section to installation page (#12659)
* Add troubleshooting section to installation docs

* Set fix instructions lang to bash

* Update doc/source/installation.rst

Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-10 18:56:56 -08:00
Ian Rodney
9b3ef2f340
[docs] Fix Docker links (#12702)
* switch autoscaler -> ray-ml

* add more tables
2020-12-10 18:08:48 -08:00
Edward Oakes
62d6b0a558
Fix max_task_retries for named actors (#12762) 2020-12-10 18:24:55 -06:00
Edward Oakes
0e90cbcd19
Remove unused ci/performance_tests (#12767) 2020-12-10 18:23:16 -06:00
Edward Oakes
c7b6ec88ef
[serve] Make serve __del__ log DEBUG level (#12766) 2020-12-10 18:14:55 -06:00
Edward Oakes
3c44c0d3e4
[serve] Long polling for routes in http server (#12724) 2020-12-10 18:02:02 -06:00
Lee moon soo
006856b9a1
fix gpu base image name in build-docker.sh script (#12642) 2020-12-10 14:31:59 -08:00
Sumanth Ratna
932837eb4c
[streaming] Remove unused imports in streaming CI tests (#12722) 2020-12-10 16:27:06 -06:00
Ruoyun Huang
2e084959a1
Fix a wrong import in test_performance.py (#12734) 2020-12-10 16:26:21 -06:00
Eric Squires
231ecffa3d
add tags.lock and tags.temp to .gitignore (#12752)
These can be temporarily created by vim.
2020-12-10 14:24:32 -08:00
Eric Squires
9f70293700
Remove debug extras from setup.py (#12751) 2020-12-10 16:23:11 -06:00
architkulkarni
3fd3cb96ed
[Utils] Add Queue async and batch methods (#12578) 2020-12-10 10:49:18 -06:00
Ian Rodney
38ba238606
[serve] Create FutureResults from ControllerAPI (#12577) 2020-12-10 10:44:08 -06:00
Sven Mika
deb33bce84
[RLlib] Add DQN SoftQ learning test case. (#12712) 2020-12-10 14:55:19 +01:00
Kai Yang
e3b5deb741
[Multi-tenancy] Delete flag enable_multi_tenancy and remove old code path (#10573) 2020-12-10 19:01:40 +08:00
Robert Nishihara
d681991773
Add Discourse to readme and make it more prominent in docs. (#12740) 2020-12-10 01:13:40 -08:00
Ian Rodney
cf30630d2e
[docker] Use legacy resolver (#12741) 2020-12-10 01:12:46 -08:00
Ameer Haj Ali
2f8e308444
[autoscaler] LoadMetrics missed logger.debug (#12714) 2020-12-09 17:19:36 -08:00
Ian Rodney
a9da4f3201
[docker] Make Ray-ml more compatible (#12574) 2020-12-09 16:03:39 -08:00
Stephanie Wang
a776209aec
Revert "Fix dashboard agent check ppid is raylet pid (#12256)" (#12729)
This reverts commit 3ce9286977.
2020-12-09 17:20:38 -05:00
dHannasch
d455cae036
Add period to error message. (#12716) 2020-12-09 15:58:21 -06:00
Richard Liaw
974570b4fb
oops (#12728)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-12-09 13:38:10 -08:00
Keqiu Hu
ee012532fb
[core] Use node manager client pool for GCS service #10398 (#12368)
* raylet client pool

* Fix merging conflict

* Fix documentation typo

* fix linting

* address comments

* fix typo

* remove unintended logging

* address comments

* fix bazel file lint error
2020-12-09 12:44:40 -08:00