Commit graph

3877 commits

Author SHA1 Message Date
Ce Gao
574abe844a [ray-operator] Remove useless RBAC rules (#6853)
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-21 00:31:07 -06:00
Lingxuan Zuo
7e484687d3 Use GET-SET macro to reduce duplicated code. (#6863) 2020-01-21 10:57:57 +08:00
mehrdadn
139bf8908e Replace UNIX sockets with TCP sockets in Ray on Windows (#6823)
* Replace UNIX sockets with TCP sockets in Ray
2020-01-20 17:28:11 -08:00
Stephanie Wang
815cd0e39a
Task and actor fate sharing with the owner process (#6818)
* Add test

* Kill workers leased by failed workers

* merge

* shorten test

* Add node failure test case

* Fix FromBinary for nil IDs, add assertions

* Test

* Fate sharing on node removal, fix owner address bug

* lint

* Update src/ray/raylet/node_manager.cc

Co-Authored-By: Zhijun Fu <37800433+zhijunfu@users.noreply.github.com>

* fix

* Remove unneeded test

* fix IDs

Co-authored-by: Zhijun Fu <37800433+zhijunfu@users.noreply.github.com>
2020-01-20 16:44:04 -08:00
Eric Liang
14016535a5
[rllib] Add TF and Torch icons to show which are available for each algo (#6869) 2020-01-20 15:22:21 -08:00
Ce Gao
125e26dde5 [ray-operator] Watch the pod resource and remove useless code (#6852)
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-20 12:13:30 -06:00
Ce Gao
23f32c5ec8 [ray-operator]: Add ignore file (#6851)
Signed-off-by: Ce Gao <gaoce@caicloud.io>
2020-01-20 12:13:01 -06:00
Philipp Moritz
96e2c1ae74
[Projects] Add small tutorial for projects (#6641) 2020-01-20 09:33:41 -08:00
mehrdadn
10609c3a19 Use standard EditorConfig file for editor settings (#6861)
https://editorconfig.org/

Co-authored-by: GitHub Web Flow <noreply@github.com>
2020-01-20 08:03:06 -08:00
Robert Nishihara
c2cbb85a43
Fix flaky test test_feature_flag (#6850) 2020-01-19 20:59:03 -08:00
Richard Liaw
341ddd0a09
[tune] Default to TensorboardX and include in requirements. (#6836) 2020-01-19 01:49:33 -08:00
Eric Liang
a229bdf272
[rllib] Deprecate custom preprocessors (#6833)
* deprecation warnings

* add log warn

* fix test
2020-01-18 23:30:09 -08:00
Richard Liaw
8a9bd18606
[tune] Remove keras dependency (#6827) 2020-01-18 23:24:42 -08:00
Richard Liaw
c9a1810392
[doc] Add meetup link (temporary) (#6835) 2020-01-18 17:53:47 -08:00
Sven Mika
7659cae3ba [RLlib] Add PG torch regression test (#6828)
* Add PG torch regression test to tuned_examples/regression_tests dir.

* Rename cartpole-pg.yaml into cartpole-pg-tf.yaml

* cartpole-pg-tf.yaml: Change cartpole-pg name of tuned_example to cartpole-pg-tf.
2020-01-18 15:57:12 -08:00
Justin Terry
97bf79917c [RLlib] Update MADDPG example repo to maintained fork (#6831) 2020-01-18 13:08:27 -08:00
Yuhao Yang
9b1d2953de [tune] set correct path when deleting checkpoint folder (#6758) 2020-01-17 23:11:03 -08:00
Sven Mika
303547f119 [RLlib] Policy-classes cleanup and torch/tf unification. (#6770) 2020-01-17 22:26:28 -08:00
Mitchell Stern
763818b476 [Dashboard] Add static assets for speedscope v1.5.3 (#6822) 2020-01-17 20:53:53 -08:00
Sven Mika
e6227082bd [RLlib] Add torch flag to train.py (#6807) 2020-01-17 18:48:44 -08:00
Yunzhi Zhang
3acf3c7675 [Dashboard] Add actor task counter (#6820) 2020-01-17 15:43:56 -08:00
Simon Mo
8f246c17b5
Initialize async plasma for async actors (#6813)
* Initialize async plasma for async actors

* Address comment
2020-01-17 14:58:06 -08:00
Ameer Haj Ali
9f9c3f5026 adding context parameter for pool with a warning for not being supported (#6776) 2020-01-17 16:57:18 -06:00
Richard Liaw
a3a268435f
[docs] Edit survey links (#6777) 2020-01-17 11:52:04 -08:00
Zhijun Fu
92380dd4e6 Fix crash in HandleObjectMissing when direct actor creation task is not found in local_queues_ (#6817) 2020-01-17 13:29:13 -06:00
Edward Oakes
30776450a3
num_cpus=1 by default in Pool (#6812) 2020-01-17 13:28:25 -06:00
Qstar
0f3205af0b [Projects] Delete pods associated with the project when running ray session stop (#6787) 2020-01-17 10:42:30 -08:00
Mitchell Stern
9f96091aef [Dashboard] Add logical view displaying actor tree (#6810)
* [Dashboard] Add logical view displaying actor tree

* Fix key error in test_raylet_info_endpoint
2020-01-17 10:25:27 -08:00
Yuhao Yang
5f36e6eacb [tune] get checkpoints paths for a trial after tuning (#6643) 2020-01-17 10:15:04 -08:00
chaokunyang
fa3c513276 [Streaming] Streaming filter transform (#6816)
* add filter transform

* lint

* add new line
2020-01-17 22:05:47 +08:00
micafan
e143f85ca0 [GCS] Use new interface class GcsClient in ray (#6805) 2020-01-17 14:51:18 +08:00
Mitchell Stern
8e8b66a4b8 Add route for /favicon.ico to fix missing favicon (#6815) 2020-01-16 21:03:21 -06:00
Richard Liaw
232be5a058
[sgd] fault tolerance for pytorch + revamp documentation (#6465) 2020-01-16 18:38:27 -08:00
fangfengbin
e5ad4e6f8d Add worker info handler to gcs service (#6798)
* add worker info handler

* rebase master

* add log

* remove unused variable

* fix code style
2020-01-16 22:35:00 +08:00
Mitchell Stern
05674c219f Accept any port in test_get_webui in test_webui.py (#6804) 2020-01-15 23:16:35 -06:00
mehrdadn
fb8e3615d5 Use Boost.Process instead of pid_t (#6510)
* Use Boost.Process instead of pid_t

This will let us handle child processes (mostly) uniformly across platforms.
TODO: There is no SIGTERM on Windows; achieving something equivalent is fairly involved.
2020-01-15 20:05:02 -08:00
fangfengbin
f9fa93eaf1 Add error info handler to gcs service (#6754)
* add error info accessor

* rebase master

* add function comments

* capture type instead of request
2020-01-16 11:59:00 +08:00
Edward Oakes
b750bd7fc9
Use 2xlarge instances in long running tests (#6802) 2020-01-15 19:47:59 -06:00
Ziyad Edher
c480d1d1e4 Treat static methods as class methods instead of instance methods in actors (#6756)
* Treat static methods as class methods rather than instance methods

* Add tests for static methods in actors

* Revert formatting changes

* Readd future imports

* Restructure static method check

* Documentation enhancements

* Fix linting issues
2020-01-15 19:38:41 -06:00
micafan
2e972e725a [GCS] Add WorkerInfoAccessor to GCS Client (#6788) 2020-01-16 09:28:32 +08:00
mehrdadn
4780b52ea8 Polish clang-format and let it run in a local repo (#6793) 2020-01-15 11:17:49 -08:00
Qing Wang
ad90693ca8
[Java] Enhancement single process mode (#6795)
* enhancement

* Add ut

* Update java/runtime/src/main/java/org/ray/runtime/task/LocalModeTaskSubmitter.java

Co-Authored-By: Kai Yang <kfstorm@outlook.com>

* Update java/test/src/main/java/org/ray/api/test/RunModeTest.java

Co-Authored-By: Kai Yang <kfstorm@outlook.com>

* Address comments

* Use ExecutorSerivce to replace raw thread

Co-authored-by: Kai Yang <kfstorm@outlook.com>
2020-01-15 21:38:53 +08:00
Kai Yang
cd5fc81bdd Show the exit code if RunManager failed to start a process (#6797) 2020-01-15 19:31:56 +08:00
micafan
a0dc02042b [GCS] Add ErrorInfoAccessor to GCS Client (#6749) 2020-01-15 13:38:58 +08:00
Tianyi Chen
9a4da1951e [Streaming] Streaming scheduler - part1-1: job graph (#6712) 2020-01-15 13:12:03 +08:00
Edward Oakes
4227fd1b60
fix flaky test_wait (#6791) 2020-01-14 14:43:16 -06:00
Kai Yang
ddd4c42fe5 [Java] Add killActor API in Java (#6728)
* Add killActor API in Java

* fix javadoc

* update test case

* Address comments
2020-01-14 17:12:00 +08:00
Sven Mika
2bcf72e306 DQN distributional model: Replace all legacy tf.contrib imports with tf.keras.layers.xyz or tf.initializers.xyz. (#6772)
- This fixes a test case in test_evaluators.py.
2020-01-13 21:48:16 -08:00
Edward Oakes
3ea3b56eb1
Hotfix missing fields in multiprocessing.Pool (#6784) 2020-01-13 16:39:33 -06:00
Sven Mika
4ee566129f Ignore io.UnsupportedOperation error when "Enabling nice stack traces on SIGSEGV etc." in worker.py::connect(). (#6771)
- Fixes RLlib tf-eager test cases for all agents when run locally on Ubuntu and Mac.
2020-01-13 14:31:13 -08:00