Clark Zinzow
da4cc26449
[CI] Disable Java log rotation test. ( #21394 )
2022-01-05 14:51:27 -08:00
Qing Wang
240e6efe21
[Java] Try to fix flaky NamespaceTest ( #21370 )
2022-01-05 09:01:34 +08:00
Qing Wang
340fbf53c0
[Java] Support actor handle reference counting. ( #21249 )
2022-01-01 10:26:22 +08:00
Qing Wang
663e14b232
[Java] Fix namespace test case. ( #21280 )
...
Since we've supported lifetime in Java, we should set the DETACHED for the detached actors in test.
2021-12-28 22:31:51 +08:00
Qing Wang
2df27a5f87
[Java] Support ActorLifetime ( #21074 )
...
We add a enum class ActorLifetime to indicate the lifetime of an actor. In this PR, we also add the necessary API to create an actor with specifying lifetime.
Currently, it has 2 values: detached and default.
2021-12-23 19:48:56 +08:00
WanXing Wang
72bd2d7e09
[Core] Support back pressure for actor tasks. ( #20894 )
...
Resubmit the PR https://github.com/ray-project/ray/pull/19936
I've figure out that the test case `//rllib:tests/test_gpus::test_gpus_in_local_mode` failed due to deadlock in local mode.
In local mode, if the user code submits another task during the executing of current task, the `CoreWorker::actor_task_mutex_` may cause deadlock.
The solution is quite simple, release the lock before executing task in local mode.
In the commit 7c2f61c76c
:
1. Release the lock in local mode to fix the bug. @scv119
2. `test_local_mode_deadlock` added to cover the case. @rkooo567
3. Left a trivial change in `rllib/tests/test_gpus.py` to make the `RAY_CI_RLLIB_DIRECTLY_AFFECTED ` to take effect.
2021-12-13 23:56:07 -08:00
Kai Fricke
d4413299c0
Revert "[Core] Support back pressure for actor tasks ( #19936 )" ( #20880 )
...
This reverts commit a4495941c2
.
2021-12-03 17:48:47 -08:00
WanXing Wang
a4495941c2
[Core] Support back pressure for actor tasks ( #19936 )
...
Support back pressure in core worker.
Job config added for python worker and java worker.
2021-12-02 14:41:30 -08:00
Qing Wang
cd2b83a259
[Core][ConcurrencyGroup] Fix blocking task in default group block tasks in other group. ( #20525 )
...
Why are these changes needed?
If max concurrency is 1 in default group, a blocking task executing in default group will block the following tasks in different group. See reproduction script in #20475
The issue is due to tasks executing in the default concurrent group run in the main task execution thread, and tasks in other concurrent groups will be blocked if the main task execution thread is blocked.
This PR only changes concurrent actor behavior that default group will not block other groups.
Related issue number
Fix #20475
2021-11-25 14:24:17 +08:00
Lixin Wei
a912b68375
[Java] Reenable Named Actor Test. ( #20627 )
...
We skipped testGetNonExistingNamedActor for some reason. Now this test is ready to enable. This PR reenables this test.
2021-11-22 16:25:16 +08:00
Larry
454db6902c
[Java] Add timeout parameter for Ray.get() API ( #20282 )
...
Why are these changes needed?
Add timeout(ms) param for Java ray.get. The API changes have been updated to doc ([Ray Core Walkthrough]->[Fetching Results]).
eg:
ObjectRef<Integer> objRef = Ray.put(1);
objRef.get(1000)
Ray.get(Ray.task(MyRayApp::slowFunction).remote(), 3000)
Related issue number
#20247
2021-11-17 11:02:17 +08:00
Qing Wang
6504ad6bb2
[xlang] Add named actor xlang tests. ( #20368 )
...
We add named actor xlang tests, including both getting java named actor in python and get python named actor in Java.
Related issue number
#19794
2021-11-16 21:42:05 +08:00
Qing Wang
1172195571
[Java] Remove global named actor and global pg ( #20135 )
...
This PR removes global named actor and global PGs.
I believe these APIs are not used widely in OSS.
CPP part is not included in this PR.
@kfstorm @clay4444 @raulchen Please take a look if this change is reasonable.
IMPORTANT NOTE: This is a Java API change and will lead backward incompatibility in Java global named actor and global PG usage.
CPP part is not included in this PR.
INCLUDES:
Remove setGlobalName() and getGlobalActor() APIs.
Remove getGlobalPlacementGroup() and setGlobalPG
Add getActor(name, namespace) API
Add getPlacementGroup(name, namespace) API
Update doc pages.
2021-11-15 16:28:53 +08:00
Qing Wang
7500f7d88a
Remove deprecated Java PG APIs. ( #20219 )
...
These APIs were deprecated at least 7+ months and 4+ versions, it's the time and very necessary to remove them.
2021-11-12 09:29:48 +08:00
Qing Wang
6d8a7291ab
Add getNamespace API for Java worker ( #20057 )
...
[Java API] Add getNamespace API for Java worker.
2021-11-08 15:51:14 +08:00
Qing Wang
4373aa1e3b
Support generating a UUID string as the anonymous namespace for Java worker. ( #19986 )
...
Why are these changes needed?
For Java worker, we generate a UUID string as the namespace if a job is not specified a namespace by user.
Related issue number
#16474
2021-11-04 11:40:17 +08:00
Jiajun Yao
5de4a38948
[CI] Run Java CI on Mac ( #19757 )
...
Why are these changes needed?
Enable Java tests on Mac CI to avoid more breakages.
Related issue number
Closes #19700
2021-11-03 23:40:05 +08:00
Qing Wang
da6894848d
Support Java namespace APIs ( #19468 )
...
## Why are these changes needed?
## Related issue number
#16474
2021-11-02 11:05:40 +08:00
Qing Wang
7647ea3512
[Java] Add helper method to build driver process. ( #19740 )
...
We make the buildDriver() process as a helpful util to avoid duplicate code.
2021-10-27 10:17:37 +08:00
Jiajun Yao
805ce453dd
[Java] Remove auto-generated pom.xml files. ( #19475 )
2021-10-19 17:35:37 +08:00
Gagandeep Singh
d226cbf21a
Added StartupToken to idenitfy a process at startup ( #19014 )
...
* Added StartupToken to idenitfy a process at startup
* Applied linting formats
* Addressed reviews
* Fixing worker_pool_test
* Fixed worker_pool_test
* Applied linting formatting
* Added documentation for StartupToken
* Fixed linting
* Reordered initialisation of WorkerPool members
* Fixed Python docs
* Fixing bugs in cluster_mode_test
* Fixing Java tests
* Create and set shim process after verifying startup_token
* shim_process.GetId() -> worker_shim_pid
* Improvements in startup token and modifying java files
* update io_ray_runtime_RayNativeRuntime.h
* Fixed java tests by adding startup-token to conf
* Applied linting
* Increased arg count for startup_token
* Attempt to fix streaming tests
* Type correction
* applied linting
* Corrected index of startup token arg
* Modified, mock_worker.cc to accept startup tokens
* Applied linting
* Applied linting changes from CI
* Removed override from worker.h
* Applied linting from scripts/format.sh
* Addressed reviews and applied scripts/format.sh
* Applied linting script from ci/travis
* Removed unrequired methods from public scope
* Applied linting
2021-10-15 15:13:13 -07:00
hazeone
c2f0035fd2
[Java]Support getGpuIds API ( #19031 )
...
Add java getGpuIds() API which is the same as get_gpu_ids in python. We can get deviceId if we've allocated a GPU to a worker.
2021-10-13 23:40:26 +08:00
Qing Wang
90d2456ec7
[Java] Support userloggers. ( #18846 )
...
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-09-26 16:53:06 +08:00
Qing Wang
371f03fa48
Remove dynamic resource from client side. ( #18514 )
2021-09-11 10:39:59 -07:00
Qing Wang
d87441cda7
[Java] ConcurrencyGroup in Java local mode. ( #18241 )
...
* WIP
* Fix
* Fix test
* Refine
* Fix lint,
* WIP2
* WIP2
* Refine
* Put a default concurrency group.
* Fix submitting task with concurrency group name.
* Remove unnecessary changes.
* Update java/runtime/src/main/java/io/ray/runtime/task/LocalModeTaskSubmitter.java
Co-authored-by: Kai Yang <kfstorm@outlook.com>
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-09-07 20:43:31 +08:00
Stephanie Wang
8e06db7280
Revert "[Core] revert: revert Unified worker starter ( #18008 )" ( #18228 )
...
This reverts commit b9978dd02b
.
2021-08-30 17:28:41 -07:00
Zhi Lin
d3786ac131
Bump Java version to 2.0.0-SNAPSHOT ( #15394 )
...
* bump java version to 2.0.0-SNAPSHOT
* update
2021-08-30 12:25:30 +08:00
Tao Wang
0b5f5890f7
[Named Actor] Throw RayException when getting named actor timed out ( #17998 )
...
* [Named Actor]throw RayException when getting named actor timed out
* lint
* correct the message
* lint
* nice catch
2021-08-25 13:50:53 +08:00
chenk008
b9978dd02b
[Core] revert: revert Unified worker starter ( #18008 )
2021-08-23 13:34:32 -07:00
Eric Liang
a9073d16f4
Revert "[Core] Unified worker initiators ( #17401 )" ( #17935 )
...
This reverts commit c3764ffd7d
.
2021-08-18 18:06:24 -07:00
chenk008
c3764ffd7d
[Core] Unified worker initiators ( #17401 )
...
* use setup_worker as starter
* use setup_worker as starter
* add java test
* fix
* fix
* lint
* sleep in ci
* sleep in ci
* fix ut
* fix
* fix
* fix
* fix
* fix
* fix
* change test size
* test
* fix
* fix
* fix ut
* restore sgd test
* change test size
* fix merge confict
* restore cpp worker flag
* fix
* fix
* add worker-languange in setup_runtime_env.py
* lint
* fix java command
Co-authored-by: root <chenk008>
2021-08-17 19:37:26 +08:00
Lingxuan Zuo
f2a3085ce2
[Metric]Java metric api enhancement ( #17811 )
...
* Java metric api enhancement:
make tagkey transparent for upper level users
* add java metric tags test
* mark Deprecated
2021-08-16 22:38:27 +08:00
Qing Wang
9d5c68ff55
[Java] Better log message when failed to invoke task. ( #17737 )
2021-08-13 17:31:58 +08:00
Kai Yang
ab53c5fc93
[Java] Update rolling logging configuration ( #17741 )
2021-08-12 10:15:27 +08:00
Qing Wang
6d6a1ea43e
Support reading system configs from native in Java. ( #17703 )
...
* Support reading system configs from native in Java.
* Fix lint
* Lint cpp
* Fix Java cases.
* Address comments.
* Address comments.
2021-08-12 10:06:01 +08:00
Qing Wang
4cc34588db
[Core] Support ConcurrentGroup part1 ( #16795 )
...
* Core change and Java change.
* Fix void call.
* Address comments and fix cases.
* Fix asyncio
2021-08-07 22:41:33 +08:00
Kai Yang
9b3c0ad35b
Fix “argument type mismatch” when an exception occurs in chained tasks ( #17636 )
2021-08-07 17:47:43 +08:00
Zhi Lin
82123123c4
[object store] Java API for Assign the object owner in Ray.put()
( #17237 )
...
Co-authored-by: Qing Wang <kingchin1218@126.com>
Co-authored-by: Kai Yang <kfstorm@outlook.com>
2021-08-06 15:26:59 +08:00
Qing Wang
4bde71ca86
[Java][Core] Support get current actor handle. ( #14900 )
2021-07-12 15:27:54 -07:00
Qing Wang
89b07572da
[Java] Upgrade log4j ( #16657 )
2021-06-24 21:01:27 -07:00
Qing Wang
3272997b0d
[Java] Upgrade some deps to fix CVEs ( #16650 )
2021-06-24 10:56:20 -07:00
Qing Wang
d6d27e9d34
[Java] Enable concurrent calls in local mode. ( #14896 )
...
* Enable concurrent calls in local mode.
* Fix submitting actor tasks before actor creation task executed.
Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-06-10 23:21:11 +08:00
mwtian
dce13d3a81
Explicitly set protobuf dependency version to allow building ray with bazel 4.0.0 ( #15756 )
...
Java protobuf dependency version is made to be consistent as well.
2021-05-13 10:34:09 -07:00
Kai Yang
b0ea947fa3
[Java] Support getCurrentActorId in local mode ( #14890 )
2021-03-31 21:39:39 +08:00
Kai Yang
6278df8604
[Java] refine generation of jvm options ( #14931 )
2021-03-31 21:04:52 +08:00
DK.Pino
374d166f6d
[JAVA] [Doc] Improve java doc for PG ( #14671 )
2021-03-29 11:21:20 +08:00
DK.Pino
ef59c145e2
[Java][Placement Group] Move related API of Placement Group from Ray to PlacementGroups. ( #14729 )
2021-03-23 12:34:12 +08:00
Lixin Wei
72d87093b9
[Core] Make Actor DEAD and Save Exceptions in GCS When Error Happens in Constructor ( #14211 )
2021-03-17 12:50:28 -07:00
Kai Yang
f60bd3afee
[Java] some small improvements ( #14565 )
2021-03-12 12:26:55 +08:00
DK.Pino
26907b7708
Support placement group for normal task in Java API ( #14342 )
...
* support pg for normal task
* fix lint
* fix comment
* fix comment
* update comment
* fix java typo
2021-03-05 10:21:37 +08:00