Max Fitton
acf4508688
Display GPU Utilization in the Dashboard ( #8564 )
2020-06-17 20:30:21 -07:00
Stephanie Wang
3d7f61a31e
Use no_restart=False for ray.kill in Serve failure test ( #8952 )
2020-06-17 20:29:43 -07:00
mehrdadn
29770cbb94
Fix Windows build ( #8905 )
...
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-12 00:24:55 -07:00
Sven Mika
a69715e9aa
[RLlib] Issue 8889: action clipping bug ppo not learning mujoco ( #8898 )
2020-06-11 14:43:35 -07:00
Eric Liang
002e1e7c8d
[rllib] Set framework to tf by default and remove import checks; "Auto" option ( #8748 )
...
* tf by default
* Update rllib/agents/trainer.py
Co-authored-by: Sven Mika <sven@anyscale.io>
* remove it
* fix
* remove
* fix
* lint
Co-authored-by: Sven Mika <sven@anyscale.io>
2020-06-11 14:43:21 -07:00
Ian Rodney
f6034fd12e
[core] Check that port is unused before assigning to worker ( #8773 )
2020-06-11 00:43:01 -07:00
SangBin Cho
c3a3b00a57
Node failure test fix ( #8882 )
2020-06-11 00:42:41 -07:00
Sven Mika
9f151c1d6f
[Testing] Fix LINT/sphinx errors. ( #8874 )
2020-06-11 00:42:29 -07:00
SangBin Cho
3ddf8a41ae
[Core] Fix a detached actor bug fix when GCS actor management is off. ( #8843 )
2020-06-11 00:41:42 -07:00
mehrdadn
226b191864
Replace ps call with psutil ( #8851 )
...
* Replace ps call with psutil
* Minor formatting
Co-authored-by: Mehrdad <noreply@github.com>
Co-authored-by: Robert Nishihara <robertnishihara@gmail.com>
2020-06-11 00:41:35 -07:00
SangBin Cho
429c0507cf
[Serve] Serve long running test fix ( #8864 )
2020-06-11 00:41:03 -07:00
Edward Oakes
07112dd131
[serve] Fix long running failure test ( #8863 )
2020-06-11 00:40:56 -07:00
mehrdadn
e6215d224c
Change os.uname()[1] and socket.gethostname() to the portable and faster platform.node_ip() ( #8839 )
...
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-11 00:40:04 -07:00
Simon Mo
51ba6d5112
Fix asyncio re-entry error message ( #8842 )
2020-06-11 00:38:46 -07:00
SangBin Cho
668442ea99
[MERGE TO MASTER] Add microbenchmark result.
2020-06-09 09:27:10 -07:00
Simon Mo
5a6dbcf134
Add release test runnning full asan python test ( #8836 )
2020-06-08 17:04:33 -07:00
SangBin Cho
16a1873880
Linting fix.
2020-06-08 09:34:14 -07:00
SangBin Cho
596a2c0bac
Bump up the version to 0.8.6
2020-06-08 09:33:30 -07:00
SangBin Cho
3388864768
[Core] Clean up detached actors ( #8759 )
2020-06-08 11:22:01 -05:00
fangfengbin
68718b33b4
GCS Server add SIGTERM signal handler ( #8795 )
2020-06-08 17:26:36 +08:00
chaokunyang
d04953ab3c
[Streaming] Union api ( #8612 )
2020-06-08 14:28:11 +08:00
mehrdadn
3ee2e9f7e5
Make #include consistent ( #8666 )
...
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-07 15:43:24 +02:00
mehrdadn
f68183d778
Error-checking for a couple of corruption issues ( #8059 )
...
* Extra error handling
* Handle connection closed in Redis monitor
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-07 15:43:00 +02:00
Siyuan (Ryans) Zhuang
a0247ffe55
Build plasma store as a library ( #8817 )
...
* build plasma store as a library
* remove unused headers
* windows support
2020-06-06 22:11:37 -07:00
Edward Oakes
5d124489a9
[serve] Require backend when creating endpoint ( #8764 )
2020-06-06 21:10:42 -05:00
Ian Rodney
b07b4f2e55
Revert "[autoscaler] Create Docker Command Runner" ( #8816 )
...
This reverts commit 54189bca5a
.
2020-06-06 14:21:44 -07:00
Sven Mika
ad695a818b
Bug fix in the contextual bandit's linear_regression.py model. ( #8815 )
2020-06-06 22:47:42 +02:00
Eric Liang
be26a7b1b0
[rllib] Support for complex / variable-length observation spaces ( #8393 )
2020-06-06 12:22:19 +02:00
Stephanie Wang
b160b83d3e
[core] Queue subscription/unsubscription commands in the GCS ( #8756 )
...
* Only remove callback index if in map
* test
* Queue subscription commands
* lint
* Check status
* update
* update
* update
* Disable GCS restart tests
* lint
2020-06-05 19:49:19 -07:00
Ian Rodney
54189bca5a
[autoscaler] Create Docker Command Runner ( #8806 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-06-05 17:29:27 -07:00
Alex Wu
488c06e92d
Updated gitignore for tags and emacs ( #8809 )
2020-06-05 17:09:18 -07:00
Alex Wu
2c485a2598
[autoscaler] Command runner interactivity ( #7198 )
2020-06-05 17:08:38 -07:00
Edward Oakes
7bfce5c027
[serve] Clarify OMP_NUM_THREADS behavior ( #8740 )
2020-06-05 15:39:37 -05:00
Edward Oakes
c0df913b19
[serve] [docs] Cleanup splitting traffic, add A/B testing and incremental rollout ( #8741 )
2020-06-05 15:39:09 -05:00
Sven Mika
25c0974543
[RLlib] Issue 8412 (Adam vars not stored in ModelV2). ( #8480 )
2020-06-05 21:07:02 +02:00
krfricke
e62c1d2051
[tune] Use scientific notation in tune dashboard ( #8782 )
...
Co-authored-by: Kai Fricke <kai@anyscale.com>
2020-06-05 10:41:07 -07:00
Anil Choudhary
1dda659918
[tune]TrialRunner wait on global checkpoint syncdown ( #8798 )
2020-06-05 10:39:59 -07:00
Edward Oakes
4155d5830f
[serve] Replace actor error retry logic with max_task_retries ( #8768 )
2020-06-05 10:45:28 -05:00
Sven Mika
c74dc58f8b
[RLlib] Fix use_lstm
flag for ModelV2 (w/o ModelV1 wrapping) and add it for PyTorch. ( #8734 )
2020-06-05 15:40:30 +02:00
mehrdadn
d78757623d
bazel build --compilation_mode=debug ( #6457 )
2020-06-05 14:36:10 +02:00
chaokunyang
4a6c4003fd
[Java] fix serializer test location ( #8730 )
2020-06-05 15:02:08 +08:00
Sven Mika
97d524c075
[RLlib] Issue 8769 broken OOM tests_dir cases (R & S). ( #8770 )
2020-06-05 08:34:21 +02:00
Amog Kamsetty
9410e5884d
[Tune] Parametrize Cloud Syncing Frequency ( #8771 )
2020-06-04 18:55:50 -07:00
Ian Rodney
d452932740
[autoscaler] Improve Logtimer log messages ( #8753 )
2020-06-04 18:07:27 -07:00
Edward Oakes
c1a97c8c04
[Doc] clarify delete in serve docs ( #8765 )
2020-06-04 15:22:30 -07:00
Sven Mika
368088be85
[RLlib] Sample batch docs and cleanup. ( #8778 )
2020-06-04 22:47:32 +02:00
Victor Le
aee01133cd
Fix dict/tuple hybrid action space for tensorflow eager execution ( #8781 )
2020-06-04 13:28:46 -07:00
Ameer Haj Ali
d966d98729
cleanup to support provider's custom ssh command runner ( #8720 )
...
* cleanup to support provider's custom ssh command runner
* clean up
* trailing white spaces fix
* k8s signature fix
Co-authored-by: Ameer Haj Ali <ameerhajali@Ameers-MacBook-Pro.local>
2020-06-04 13:27:17 -07:00
Ian Rodney
09f89ff49d
[autoscaler] Improve SSH Command Failure Logging ( #8751 )
2020-06-04 12:38:20 -07:00
SangBin Cho
e372c06257
Hotfix dashboard broken tests. ( #8757 )
2020-06-04 09:44:00 -07:00