Stephanie Wang
71d4637b75
[core worker] Refactor CoreWorker member classes ( #5062 )
...
* Move store client mutex inside CoreWorkerPlasmaStoreProvider
* Move PlasmaClient inside CoreWorkerStoreProvider
* Remove CoreWorkerObjectInterface's ref to CoreWorker
* Remove WorkerLanguage
* Remove CoreWorkerTaskInterface's ref to CoreWorker
* Remove CoreWorkerTaskExecutionInterface's ref to CoreWorker
* lint
* move comment
* Fix build
* Fix build
2019-07-02 15:30:30 -07:00
Kai Yang
1cf7728f35
[Core worker] Serialize ActorHandle in core worker. Make ActorHandle thread safe. ( #5034 )
...
* Serialize ActorHandle in core worker. Make ActorHandle thread safe.
* Address comments
* Address comments
* Address comments
* Address comments
* lint
* Address comments
* Address comments
* Address comments
* Address comments
* Minor update
* Address comments
* lint
2019-07-02 16:48:43 +08:00
Eric Liang
904dcf081d
Switch cluster longevity tests to DLAMI, fix ray up verbosity ( #5084 )
...
* fix
* add branch commit
* comments
* Update ci/long_running_tests/.gitignore
Co-Authored-By: Robert Nishihara <robertnishihara@gmail.com>
2019-07-02 00:19:05 -07:00
Qing Wang
247f95b3ff
Refine RegisterClientRequest message to make it clearer. ( #5057 )
...
* transfor driver task id Explicitly
* Refins
* Fix and add comment.
* add more
* Fix
* Fix
* Add comments
* Fix
2019-07-02 14:26:19 +08:00
Philipp Moritz
a6a02fccd0
Do not compile redis twice ( #5074 )
2019-07-01 15:42:54 -07:00
Philipp Moritz
4e82313891
Update to latest arrow ( #5011 )
2019-06-30 20:36:36 -07:00
Simon Mo
0c4dd3c401
Use bazel disk cache with travis ( #5068 )
2019-06-30 17:57:48 -07:00
Simon Mo
6c4c1d444d
Update VersionKey in stats ( #5070 )
2019-06-30 18:23:12 +08:00
Simon Mo
d7ccfbe46b
Bump version to 0.8.0.dev2 ( #5069 )
2019-06-29 23:30:26 -07:00
Simon Mo
b5d473847c
bump version to 0.7.2 ( #5066 )
2019-06-29 19:06:51 -07:00
Robert Nishihara
bcc379556b
Make some fixes to long running stress tests. ( #5056 )
2019-06-28 15:42:54 -07:00
Kai Yang
4ccb7b05cc
[Core worker] Add metadata support in object interface ( #5031 )
2019-06-28 11:35:03 -07:00
Hao Chen
cefbb0c94c
Fix driver id in TaskInfo ( #5055 )
2019-06-28 12:56:48 +08:00
Kai Yang
a39982e676
[Core worker] Task execution passes TaskInfo struct to executor ( #5032 )
2019-06-28 10:59:45 +08:00
Joey Jiang
d6bbbdef35
Use gRPC to handle communication and data transmission between object manager ( #4996 )
2019-06-28 10:56:34 +08:00
Qing Wang
62e4b591e3
[ID Refactor] Rename DriverID to JobID ( #5004 )
...
* WIP
WIP
WIP
Rename Driver -> Job
Fix complition
Fix
Rename in Java
In py
WIP
Fix
WIP
Fix
Fix test
Fix
Fix C++ linting
Fix
* Update java/runtime/src/main/java/org/ray/runtime/config/RayConfig.java
Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>
* Update src/ray/core_worker/core_worker.cc
Co-Authored-By: Stephanie Wang <swang@cs.berkeley.edu>
* Address comments
* Fix
* Fix CI
* Fix cpp linting
* Fix py lint
* FIx
* Address comments and fix
* Address comments
* Address
* Fix import_threading
2019-06-28 00:44:51 +08:00
Qing Wang
d9768c1cd2
[hotfix] Fix master's linting ( #5049 )
...
The linting in CI on master always fail.
2019-06-27 20:21:32 +08:00
Hao Chen
a1156754e9
Fix test_task_forward ( #5040 )
2019-06-27 14:37:00 +08:00
Hao Chen
469ae41013
Fix memory leak in rpc ServerCall and ClientCall ( #5046 )
2019-06-27 13:19:47 +08:00
Daniel Edgecumbe
49c6e81de2
autoscaler/monitor: Kill workers on exception ( #4997 )
2019-06-26 17:59:12 -07:00
Stephanie Wang
1a8d0af814
Remove debug check for uncommitted lineage ( #5038 )
2019-06-26 11:21:00 -07:00
Robert Nishihara
a17c08faa4
Lengthen buffer in resource test. ( #4961 )
2019-06-26 09:54:04 -07:00
Richard Liaw
b1827d5fbe
[tune] Update MNIST Example ( #4991 )
2019-06-25 22:50:15 -07:00
Philipp Moritz
bbe3e5b4ed
[rllib] Give error if sample_async is used with pytorch for A3C ( #5000 )
...
* give error if sample_async is used with pytorch
* update
* Update a3c.py
2019-06-25 22:06:35 -07:00
Zhijun Fu
bb8e75b532
[grpc] refactor rpc server to support multiple io services ( #5023 )
2019-06-25 19:08:09 -07:00
Eric Liang
aa5fc52e32
[rllib] Add QMIX mixer parameters to optimizer param list ( #5014 )
...
* add mixer params
* Update qmix_policy.py
2019-06-25 19:02:40 -07:00
Hao Chen
0131353d42
[gRPC] Migrate gcs data structures to protobuf ( #5024 )
2019-06-25 14:31:19 -07:00
Richard Liaw
bd8aceb896
[ci] Change Jenkins to py3 ( #5022 )
...
* conda3
* integration
* add nevergrad, remotedata
* pytest 0.3.1
* otherdockers
* setup
* tune
2019-06-24 21:50:37 -07:00
Ashwinee Panda
11ccf66346
[docs] docs for running Tensorboard without sudo ( #5015 )
...
* Instructions for running Tensorboard without sudo
When we run Tensorboard to visualize the results of Ray outputs on multi-user clusters where we don't have sudo access, such as RISE clusters, a few commands need to first be run to make sure tensorboard can edit the tmp directory. This is a pretty common usecase so I figured we may as well put it in the documentation for Tune.
* Update tune-usage.rst
2019-06-24 11:26:53 -07:00
Qing Wang
e33d0eac68
Add dynamic worker options for worker command. ( #4970 )
...
* Add fields for fbs
* WIP
* Fix complition errors
* Add java part
* FIx
* Fix
* Fix
* Fix lint
* Refine API
* address comments and add test
* Fix
* Address comment.
* Address comments.
* Fix linting
* Refine
* Fix lint
* WIP: address comment.
* Fix java
* Fix py
* Refin
* Fix
* Fix
* Fix linting
* Fix lint
* Address comments
* WIP
* Fix
* Fix
* minor refine
* Fix lint
* Fix raylet test.
* Fix lint
* Update src/ray/raylet/worker_pool.h
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Update java/runtime/src/main/java/org/ray/runtime/AbstractRayRuntime.java
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Address comments.
* Address comments.
* Fix test.
* Update src/ray/raylet/worker_pool.h
Co-Authored-By: Hao Chen <chenh1024@gmail.com>
* Address comments.
* Address comments.
* Fix
* Fix lint
* Fix lint
* Fix
* Address comments.
* Fix linting
2019-06-23 18:08:33 +08:00
Philipp Moritz
2e342ef71f
Fix tensorflow-1.14 installation in jenkins ( #5007 )
2019-06-21 11:04:40 -07:00
Joey Jiang
a7f84b536f
Fix no cpus test ( #5009 )
2019-06-21 17:08:25 +08:00
Philipp Moritz
3b23d94cb8
Fix valgrind build by installing new version of valgrind ( #5008 )
2019-06-20 22:22:37 -07:00
Richard Liaw
31b6da12f9
[tune] Tutorial UX Changes ( #4990 )
...
* add integration, iris, ASHA, recursive changes, set reuse_actors=True, and enable Analysis as a return object
* docstring
* fix up example
* fix
* cleanup tests
* experiment analysis
2019-06-21 12:59:49 +08:00
Eric Liang
1d17125333
temp fix for build ( #5006 )
2019-06-20 18:07:44 -07:00
Andrew Berger
e59e8074dd
fix handling of non-integral timeout values in signal.receive ( #5002 )
2019-06-20 15:33:40 -07:00
Qing Wang
7bda5edc16
Fix Java CI failure ( #4995 )
2019-06-19 11:36:21 +08:00
Hao Chen
2bf92e02e2
[gRPC] Use gRPC for inter-node-manager communication ( #4968 )
2019-06-17 19:00:50 +08:00
Qing Wang
b08765a08b
Fix a crash when unknown worker registering to raylet ( #4992 )
2019-06-17 13:34:23 +08:00
Simon Mo
05e2748070
Inherit Function Docstrings and other metedata ( #4985 )
2019-06-15 11:01:27 -07:00
Tianhong Dai
1b86e551fb
Fix bugs in the a3c code template. ( #4984 )
2019-06-14 17:22:36 -07:00
Zhijun Fu
37abdb283f
[Core worker] add store & task provider ( #4966 )
2019-06-14 18:35:32 +08:00
Hao Chen
3c92b2ee4d
Upgrade CI clang-format to 6.0 ( #4976 )
2019-06-14 14:52:32 +08:00
Eric Liang
fa1d4c9807
[rllib] Fix DDPG example ( #4973 )
2019-06-13 15:07:46 -07:00
Qing Wang
ef1af49efd
[Java] Fix bug of BaseID
in multi-threading case. ( #4974 )
2019-06-13 20:52:41 +08:00
Robert Nishihara
d2f5b71c3b
Remove typing from setup.py install_requirements. ( #4971 )
2019-06-12 15:02:12 -07:00
Stephanie Wang
89ca5eeb29
Flush all tasks from local lineage cache after a node failure ( #4964 )
2019-06-12 11:13:39 -07:00
Peter Schafhalter
e0e52f1871
[sgd] Add non-distributed PyTorch runner ( #4933 )
...
* Add non-distributed PyTorch runner
* use dist.is_available() instead of checking OS
* Nicer exception
* Fix bug in choosing port
* Refactor some code
* Address comments
* Address comments
2019-06-11 22:38:34 -07:00
Zhijun Fu
472c36ed1e
[core worker] add task submission & execution interface ( #4922 )
2019-06-12 10:10:12 +08:00
Philipp Moritz
ebb3b3b928
[core] Use int64_t instead of int to keep track of fractional resources ( #4959 )
2019-06-10 23:49:04 -07:00