1
0
Fork 0
mirror of https://github.com/vale981/ray synced 2025-03-15 07:36:38 -04:00
Commit graph

6925 commits

Author SHA1 Message Date
Hao Chen
d49dadf891
Change Python's ObjectID to ObjectRef () 2020-07-10 17:49:04 +08:00
Tao Wang
6311e5a947
[HOTFIX] Fix compile direct_actor_transport_test on mac () 2020-07-10 17:19:34 +08:00
fangfengbin
35861f17a3
Fix gcs_table_storage testcase bug ()
Co-authored-by: 灵洵 <fengbin.ffb@antfin.com>
2020-07-10 16:16:28 +08:00
Hao Chen
bed1be611e
Fix flaky test_dynres.py () 2020-07-10 10:34:23 +08:00
mehrdadn
dd2cc6eb48
Update hiredis and remove Windows patches ()
Co-authored-by: Mehrdad <noreply@github.com>
2020-07-09 18:45:44 -07:00
Patrick Ames
dc51b08c36
[autoscaler] Allow users to disable the cluster config cache ()
* [autoscaler] Remove autoscaler config cache.

* [autoscaler] Add flag allowing users to explicitly disable the config cache.
2020-07-09 15:47:58 -07:00
Stefan Schneider
6db55ca8db
[docs][rllib] Recommended workflow for training, saving, and testing () 2020-07-09 15:47:10 -07:00
Eric Liang
09b9b81ea4
[autoscaler] Move command runners into separate file and clean up interface. ()
* cleanup

* wip

* fix imports

* fix lint
2020-07-09 15:40:56 -07:00
Zhuohan Li
8a76f4cbb5
[Core] put small objects in memory store ()
* remove the put in memory store

* put small objects directly in memory store

* cast data type

* fix another place that uses Put to spill to plasma store

* fix multiple tests related to memory limits

* partially fix test_metrics

* remove not functioning codes

* fix core_worker_test

* refactor put to plasma codes

* add a flag for the new feature

* add flag to more places

* do a warmup round for the plasma store

* lint

* lint again

* fix warmup store

* Update _raylet.pyx

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-07-09 15:39:40 -07:00
Alex Wu
34b85659d4
[Core] New scheduler fixes ()
* .

* test_args passes

* .

* test_basic.py::test_many_fractional_resources causes ray to hang

* test_basic.py::test_many_fractional_resources causes ray to hang

* .

* .

* useful

* test_many_fractional_resources fails instead of hanging now :)

* Passes test_fractional_resources

* .

* .

* Some cleanup

* git is hard

* cleanup

* Fixed scheduling tests

* .

* .
2020-07-09 15:37:51 -07:00
Alisa
f0a72ad985
[Core] Add placement group scheduler and some api in resource scheduler ()
* Add placement group scheduler and some api of resource scheduler.
Merge fix cv hang in multithread variables race ().

* change the bundle id and delete unit count in bundle

change vector<bundle_spec> to vector<shared_ptr<bundle_spec>>

Add placement group scheduler and some api of resource scheduler.
Merge fix cv hang in multithread variables race ().

change the bundle id and delete unit count in bundle

remove CheckIfSchedulable()

add comments and fix the bug in resource

* fix placement group schedule

* add placement group scheduler and change some api in resource scheduler

* fix by the comments

* fix conflict

* fix lint

* fix lint

* fix bug in merge

* fix lint

Co-authored-by: Lingxuan Zuo <skyzlxuan@gmail.com>
2020-07-09 15:37:18 -07:00
Stephanie Wang
0389735d7a
[core] Pass owner address from the workers to the raylet ()
* Add intended worker ID to GetObjectStatus, tests

* Remove TaskID owner_id

* lint

* Add owner address to task args

* Make TaskArg a virtual class, remove multi args

* Set owner address for task args

* merge

* Fix tests

* Add ObjectRefs to task dependency manager, pass from task spec args

* tmp

* tmp

* Fix

* Add ownership info for task arguments

* Convert WaitForDirectActorCallArgs

* lint

* build

* update

* build

* java

* Move code

* build

* Revert "Fix Google log directory again ()"

This reverts commit 275da2e400.

* Fix free

* fix tests

* Fix tests

* build

* build

* fix

* Change assertion to warning to fix java
2020-07-09 14:35:54 -07:00
mehrdadn
4687b807c4
Combine different severities into the same log files ()
* Combine different severities into the same log files

Co-authored-by: Mehrdad <noreply@github.com>
2020-07-09 14:14:28 -07:00
Richard Liaw
b5103bacd1
[tune] Fix github readme ()
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-07-09 12:37:24 -07:00
mehrdadn
b6c11f3dd7
Bazel move more shell to native rules ()
Co-authored-by: Mehrdad <noreply@github.com>
2020-07-09 09:02:35 -07:00
Siyuan (Ryans) Zhuang
7e1326c0f6
[Core] Enhance common client connection ()
* enhance client connection

* add write buffer async

* read message

* add test
2020-07-09 08:59:37 -07:00
fangfengbin
b97b474ae9
Fix GcsActorManagerTest multi-thread bug () 2020-07-09 20:03:46 +08:00
fangfengbin
62fe403a57
Fix the bug that PollOwnerForActorOutOfScope's reply may be missing () 2020-07-09 17:33:52 +08:00
Sven Mika
01125b8fcf
[RLlib] DQN rainbow eager-mode (keras style NoisyLayer) (preparation for native tf2.x support). () 2020-07-09 10:44:10 +02:00
Ruifang Chen
c37d30a079
Remove unnecessary asan config ()
Co-authored-by: ruifang.crf <ruifang.crf@alibaba-inc.com>
2020-07-09 14:50:54 +08:00
Lingxuan Zuo
0562236533
[Stats] Make aggeration window and report interval configurable. () 2020-07-09 12:44:38 +08:00
Kai Yang
fa1eadd61f
fix java worker log dir () 2020-07-09 10:11:03 +08:00
SangBin Cho
9f8ff2e3b1
[Core] GCS Actor Management Race Condition ()
* GCS Actor management on by default.

* Fix travis config.

* Change condition.

* Finish the initial race condition fix.

* Lint.

* Refine the codebase.

* Finish the initial version

* Improve logic.

* Remove unnecessary log messages.

* Address code review.

* Add tests

* Revert the second race condition that doesn't happen anymore, handle some edge cases. add tests.

* Address the second race condition found.

* Addressed code review.

* Addressed code review.

* Run a new unit test only when gcs actor management is on.
2020-07-08 10:56:52 -07:00
SangBin Cho
c5aa1eb042
[GCS Actor Management] Fix a dashboard doesn't show the correct actor state. ()
* Fix a bug.

* Address code review.

* Remove an unnecessary comment.
2020-07-08 10:55:47 -07:00
mehrdadn
05d4cced4e
[CI] GHA path with spaces ()
Co-authored-by: Mehrdad <noreply@github.com>
2020-07-08 10:44:37 -07:00
Ian Rodney
9172f8c3a6
[core] Store Internal Config in GCS () 2020-07-08 11:22:08 -05:00
Sven Mika
4da0e542d5
[RLlib] DDPG and SAC eager support (preparation for tf2.x) () 2020-07-08 16:12:20 +02:00
ZhuSenlin
42f8f16c04
[GCS] Fix detached actor with empty name () 2020-07-08 12:21:24 +08:00
Tao Wang
f395e48031
[GCS] Add integration test for actor info cleaning () 2020-07-08 10:25:08 +08:00
Max Fitton
ca54da66b6
Worker <> GPU Mapping Information in Dashboard () 2020-07-07 18:58:22 -05:00
yncxcw
4ba4110dec
[Core] Make worker_register_timeout_seconds configurable () 2020-07-07 18:27:15 -05:00
SangBin Cho
8f19f1eafb
[Core] Actor handle refactoring ()
* Marking needed changes.

* Resolve basic dependencies.

* In progress.

* linting.

* In progress 2.

* Linting.

* Refactor done. Cleanup needed.

* Linting.

* Recover kill actor in core worker because it is used inside raylet

* Cleanup.

* Use unique pointer instead. Unit tests are broken now.

* Fix the upstream change.

* Addressed code review 1.

* Lint.

* Addressed code review 2.

* Fix weird github history.

* Lint.

* Linting using clang 7.0.

* Use a better check message.

* Revert cpp stuff.

* Fix weird linting errors.

* Manuall fix all lint issues.

* Update a newline.

* Refactor some interface.

* Addressed all code review.

* Addressed code review
2020-07-07 11:11:41 -07:00
Ian Rodney
f69d8b951e
[core] Disable flaky cancellation test () 2020-07-07 09:52:35 -07:00
Ian Rodney
a1e14380ce
[core] Switch Async Callback to C++ [WIP] ()
Co-authored-by: simon-mo <simon.mo@hey.com>
2020-07-07 09:47:25 -07:00
fangfengbin
8391f66086
Fix gcs actor manager destroy actor crash bug () 2020-07-07 21:12:30 +08:00
Ian Rodney
079c1eaa5c
[autoscaler] Require keyword arguments () 2020-07-06 22:12:59 -07:00
Benjamin Black
1425cdf834
Pettingzoo environment support ()
* added pettingzoo wrapper env and example

* added docs, examples for pettingzoo env support

* fixed pettingzoo env flake8, added test

* fixed pettingzoo env import

* fixed pettingzoo env import

* fixed pettingzoo import issue

* fixed pettingzoo test

* fixed linting problem

* fixed bad quotes

* future proofed pettingzoo dependency

* fixed ray init in pettingzoo env

* lint

* manual lint

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-07-06 21:32:26 -07:00
Stephanie Wang
b42d6a1ddc
[core] Refactor task arguments and attach owner address ()
* Add intended worker ID to GetObjectStatus, tests

* Remove TaskID owner_id

* lint

* Add owner address to task args

* Make TaskArg a virtual class, remove multi args

* Set owner address for task args

* merge

* Fix tests

* Fix

* build

* update

* build

* java

* Move code

* build

* Revert "Fix Google log directory again ()"

This reverts commit 275da2e400.

* Fix free

* x

* build

* Fix java

* Revert "Revert "Fix Google log directory again ()""

This reverts commit 4a326fcb148ca09a35bc7de11d89df10edbb56e7.

* lint
2020-07-06 21:25:14 -07:00
Ian Rodney
6fecd3cfce
[autoscaler] Run initialization_commands without a persistent connection ()
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
Co-authored-by: Edward Oakes <ed.nmi.oakes@gmail.com>
2020-07-06 16:34:59 -07:00
Richard Liaw
139d21e068
[tune] Docs for tune-sklearn ()
Co-authored-by: krfricke <krfricke@users.noreply.github.com>
2020-07-06 15:35:10 -07:00
SangBin Cho
557da7044f
Fix flaky test that says ray.init is called twice. () 2020-07-06 15:19:00 -07:00
Ian Rodney
c79a49488c
[Doc] Fixing Wheel Build Command () 2020-07-06 11:17:59 -07:00
Piotr Januszewski
155cc81e40
Clarify training intensity configuration docstring () () 2020-07-05 20:07:27 -07:00
ChenZhilei
6f3d993681
GCS server use worker table to handle RegisterWorker instead of redis accessor () 2020-07-06 10:37:25 +08:00
Ian Rodney
dcf989292e
[docker] Update base image () 2020-07-05 17:31:59 -07:00
Sven Mika
f43d934817
[RLlib] Type annotations for policy. () 2020-07-05 13:09:51 +02:00
Richard Liaw
b71c912da7
[tune] Fix up examples () 2020-07-05 01:16:20 -07:00
Tao Wang
f7ac495a68
[Core] Use map instead of list to represent resources in heartbeat message () 2020-07-05 10:59:25 +08:00
mehrdadn
28d18ec836
[windows] Improve GPU detection ()
Co-authored-by: Mehrdad <noreply@github.com>
2020-07-04 13:20:23 -07:00
David Fidalgo
c0ba337fe0
[tune] Add np.bool8 and np.int to allowed HPARAMS types () 2020-07-03 18:34:45 -07:00