Commit graph

1082 commits

Author SHA1 Message Date
Eric Liang
e556b729c2
[direct call] Fix max_calls interaction with background tasks. (#6536) 2019-12-19 13:48:32 -08:00
Edward Oakes
41fa2e9604 Remove object id translation (#6531) 2019-12-19 12:47:49 -08:00
mehrdadn
634b437543 Work around some compile errors for boost::asio::generic::stream_protocol (#6545) 2019-12-19 12:05:41 -08:00
Hao Chen
7e2addb424
Revert "[GCS] refactor the GCS Client Task Interface (#5515)" (#6543)
This reverts commit f78583147c.
2019-12-19 17:11:36 +08:00
micafan
f78583147c [GCS] refactor the GCS Client Task Interface (#5515) 2019-12-19 15:02:21 +08:00
mehrdadn
012c0ca7dc Fix some missing header files (#6527) 2019-12-18 16:54:51 -08:00
Yunzhi Zhang
c507859a83 [Dashboard] Node resource display fix (#6521) 2019-12-18 12:07:37 -08:00
Simon Mo
26ec500ef9
Implement async get for direct actor call (#6339) 2019-12-18 11:50:21 -08:00
Kai Yang
978d72be0a Disable port sharing in grpc server (#6479) 2019-12-18 14:48:54 +08:00
Yunzhi Zhang
166560e428 [Dashboard] displays resources row (#6516) 2019-12-17 01:05:57 -08:00
Eric Liang
1a1324d2a2
Bump version from 0.8.0.dev6 -> 0.9.0.dev (#6508) 2019-12-16 23:57:42 -08:00
Edward Oakes
38b43fb3ca
Optimize O(n^2) behavior in dependency resolver (#6509)
* Optimize O(n^2) behavior in dependency resolver

* fix check

* checks
2019-12-16 18:41:02 -08:00
mehrdadn
74b2e871b7 Tentative workaround for some forks and signals on Windows (#6362)
* Platform shims for Windows

* Tentative workaround for some forks and signals on Windows

* Rewrite WorkerPool::StartProcess by moving spawnvp wrapper to a separate function

* Separate spawnvp the wrappers for POSIX and Windows

* Fix rv use
2019-12-16 16:57:49 -08:00
Kai Yang
c3ef8581d2 [Java] fix UT segmentation fault on exit (#6455)
* fix segmentation fault in Java test

* update comments

* address comments
2019-12-15 17:52:34 +08:00
Kai Yang
9cc0ecc6ff Fix duplicated logging if log dir is not set (#6342) 2019-12-15 13:29:36 +08:00
ZhuSenlin
6c0531683f Add gcs server as well as the unit test (#6401) 2019-12-15 13:23:42 +08:00
Edward Oakes
e2b7459bfc
Fix worker exit cleanup (#6450)
* working but ugly

* comments

* proper but hanging in grpc server destructor

* grpc server shutdown deadline

* fix disconnect

* lint

* shutdown_only in test

* replace shutdown
2019-12-13 16:52:50 -08:00
Eric Liang
335dade1e6
Check worker id for all core worker RPCs (#6472)
* check worker id

* fix test

* owner

* fix tests

* comments
2019-12-13 10:15:56 -08:00
Eric Liang
eb6f3f86e5
Seed using multiple samples (#6471) 2019-12-12 21:41:19 -08:00
Stephanie Wang
c57dcc82d1 Port actor creation to use direct calls (#6375) 2019-12-12 19:50:51 -08:00
Eric Liang
5a5c94939f
[direct call] Retry failed tasks with delay (#6453)
* retry failed tasks with delay

* set to 0 for direct tests
2019-12-12 17:12:38 -08:00
Kai Yang
3adbe29450 fix core worker test hanging due to heartbeat is not working (#6416) 2019-12-12 18:16:28 +08:00
micafan
8c1520d18e [GCS] refactor the GCS Client Job Interface (#5503) 2019-12-12 16:57:32 +08:00
wanxing
64d8626d6d Optimize ray::LocalMemoryBuffer performance (#6384) 2019-12-11 21:49:52 -08:00
Eric Liang
58ac8639b9
Fix bad checks and race condition from actor_deaths and node_failures tests (#6411) 2019-12-11 14:47:24 -08:00
Edward Oakes
82f7dbc7a7
Increase TaskID size by 2 bytes, taken from JobID (#6425)
* Increase TaskID size by 2 bytes, taken from JobID

* comments

* check max job id

* fix doc

* fix local mode
2019-12-11 10:45:14 -08:00
Kai Yang
a131082767 fix startup worker process count for multi-threading (#6382) 2019-12-11 20:19:49 +08:00
Edward Oakes
044527adb8
Remove ref counting dependencies on ray.get() (#6412)
* Remove ref counting dependencies on Get()

* comment

* don't send IDs when disabled

* pass through internal config

* fix

* allow reinit

* remove flag
2019-12-10 18:11:34 -08:00
mehrdadn
2f19b01898 Factor out stream protocols for Windows (#6408) 2019-12-10 15:23:30 -08:00
Chaokun Yang
6272907a57 [Streaming] Streaming data transfer and python integration (#6185) 2019-12-10 20:33:24 +08:00
Rong Rong
c1d4ab8bb4 Move top level RayletClient to ray::raylet::RayletClient (#6404) 2019-12-09 21:08:59 -08:00
Eric Liang
a6bc2b1842
Misc direct call fixes from unit tests (#6394) 2019-12-08 19:34:02 -08:00
Kai Yang
eb912b68b1 [Java] Fix instanceof RayPyActor (#6377) 2019-12-07 16:28:29 +08:00
Kai Yang
7e9fddf3ed [Java] Add java exception check in JNI (#6378) 2019-12-07 16:25:17 +08:00
Kai Yang
2003d2d952 explicit delete local reference in task_execution_callback for garbage collection (#6379) 2019-12-06 18:53:24 +08:00
Zhijun Fu
b88b8202cc fix java build failure (#6062) 2019-12-06 14:38:43 +08:00
Ion
1c638a11a7 Refactor helper methods for new scheduler integration (#6354) 2019-12-05 18:49:25 -08:00
Edward Oakes
f63b64310a
Bump version to 0.8.0.dev7 (#6303) 2019-12-05 18:33:54 -08:00
Eric Liang
6223d2ed0b
[direct call] Assign resource ids for direct call tasks (#6364) 2019-12-05 10:16:04 -08:00
micafan
668ce47360 [GCS]Add abstract interface of actor to GCS Client (#6269) 2019-12-05 13:38:29 +08:00
Zhijun Fu
7611e484ec properly handle a forwarded task that gets forwarded back (#6271) 2019-12-05 13:37:52 +08:00
Zhijun Fu
fa98694dd0 Fix raylet crash during cluster shutdown (#6272) 2019-12-05 11:08:58 +08:00
Edward Oakes
f65d65f5de
Add WorkerID check to AssignTask (#6355) 2019-12-04 12:38:29 -08:00
Eric Liang
1a3b83abf8
[direct call] Fix hang when caller id changes for actor task submission (#6338) 2019-12-04 12:01:35 -08:00
Stephanie Wang
a82fb5585d
[direct task] Remove timeout for resolving futures that were deserialized (#6337)
* Reply GetObjectStatus once the task completes

* Remove timeout-based future resolution

* fix

* Update core_worker.h
2019-12-03 12:04:59 -08:00
Stephanie Wang
d5720779b3 Set the actor ID as the assigned task ID for direct actor workers (#6335)
* Fix

* rename
2019-12-03 10:54:26 -08:00
Edward Oakes
d2c66ba795
Don't add assigned tasks to SWAP queue (#6325) 2019-12-02 16:39:02 -08:00
Ion
2a3adf2d70 New scheduler integration (#6321) 2019-12-02 14:42:16 -08:00
Stephanie Wang
69dd5c9319
[direct task] Fix bug that starts duplicate connections from the worker to the local raylet (#6307)
* Fix bug and add unit test

* rename
2019-12-02 10:25:05 -08:00
Stephanie Wang
da41180dc0
[direct task] Retry tasks on failure and turn on RAY_FORCE_DIRECT for test_multinode_failures.py (#6306)
* multinode failures direct

* Add number of retries allowed for tasks

* Retry tasks

* Add failing test for object reconstruction

* Handle return status and debug

* update

* Retry task unit test

* update

* update

* todo

* Fix max_retries decorator, fix test

* Fix test that flaked

* lint

* comments
2019-12-02 10:20:57 -08:00