Commit graph

4951 commits

Author SHA1 Message Date
Eric Liang
536795ef79
[autoscaler] Initial support for multiple worker types (#9096)
* wip

* fix

* update

* debug state

* fix

* update

* update

* fix test

* fix

* fix

* update

* fix

* types and docs

* update
2020-06-24 23:08:30 -07:00
Eric Liang
0ff24ec8dc
Add "ray status" debug tool for autoscaler. (#9091) 2020-06-24 18:22:03 -07:00
Siyuan (Ryans) Zhuang
80bcbe20c7
[Core] Remove object notification IPC between Plasma and Raylet (initial step) (#8939)
* initial refactoring

redirect notifications to eventloop

implement direct notifications

* protect vector with mutex
2020-06-24 13:54:40 -07:00
ElektroChan89
4bc1d7c043
[docs] Update grid_random.rst (#9102) 2020-06-24 12:36:42 -07:00
Ian Rodney
67e049bc7b
[autoscaler/docker] Relax missing docker section (#9105) 2020-06-24 12:35:42 -07:00
mehrdadn
0487c250e8
Windows compatibility (#93)
Co-authored-by: mehrdadn <mehrdadn@users.noreply.github.com>
Co-authored-by: Mehrdad <noreply@github.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-06-24 11:27:28 -07:00
SongGuyang
cbf38573bd
C++ worker API refactoring (#9053) 2020-06-24 19:33:46 +08:00
Sven Mika
aa231799ed
Dyna test: small -> medium. (#9118) 2020-06-24 12:02:44 +02:00
Siyuan (Ryans) Zhuang
613abdf1b6
Remove arrow macros in plasma store (#9115) 2020-06-23 23:34:44 -07:00
mehrdadn
98d68b31c3
Suppress GRPC SO_REUSADDR warning on Linux (#7972)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-23 21:58:43 -07:00
Alex Wu
c152730e4a
[Core] Log output from different jobs to different drivers. (#8885)
* .

* .

* Correct now

* No interactivity errors

* format

* Filtering

* lint

* .

* No more filtering

* Removed interactivity

* .

* .

* .

* .

* .

* .

* Redirection works

* formatting

* something broken?

* .

* Works

* formatting

* redirect output

* formatting

* formatting

* Fix file descriptor leakage

* format

* .

* .

* .

* .

* .

* Refactor

* .

* Only run on job switch

* .

* cleanup

* .

* ...

* Review

* .

* .

* .

* .

* whoops

* .

* Should fix bug

* .

* .

* addressed comments

* formatting

* formatting

* Fix typo

* .

* .

* .

* .

Co-authored-by: Ubuntu <ubuntu@ip-172-31-14-33.us-west-2.compute.internal>
2020-06-23 18:45:32 -07:00
henktillman
acb3280235
GCP credentials (#9052) 2020-06-23 17:11:18 -07:00
goulou
9b4428c668
[ASHA][docs] Change default value of "brackets" (#9101)
Closes #9058
2020-06-23 14:43:27 -07:00
Siyuan (Ryans) Zhuang
acb7270bd7
Adopt upstream plasma changes (#9061)
* adopt upstream plasma changes
2020-06-23 14:19:57 -07:00
Amog Kamsetty
649a08926d
Fix link to serve (#9109) 2020-06-23 16:06:24 -05:00
Max Fitton
7904235517
Node Info Functional Components (#9073) 2020-06-23 13:11:32 -07:00
Tanay Wakhare
f77c638d6d
Pytorch AttentionNet (#9088) 2020-06-23 20:42:30 +02:00
Edward Oakes
c9010eb8ad
[serve] Add serve.shutdown() (#8766) 2020-06-23 13:42:03 -05:00
Simon Mo
b6d425526d
Move actor task submission to io service (#9093) 2020-06-23 10:07:33 -07:00
Siyuan (Ryans) Zhuang
306ca75737
Fix ray arrow logs (#9097)
* convert arrow logs to ray logs

* remove extra plasma tests and modules
2020-06-23 10:02:30 -07:00
Michael Luo
cf0894d396
[rllib] MAML Agent (#8862)
* Halfway done with transferring MAML to new Ray

* MAML Beta Out

* Debugging MAML atm

* Distributed Execution

* Pendulum Mass Working

* All experiments complete

* Cleaned up codebase

* Travis CI

* Travis CI

* Tests

* Merged conflicts

* Fixed variance bug conflict

* Comment resolved

* Apply suggestions from code review

fixed test_maml

* Update rllib/agents/maml/tests/test_maml.py

* asdf

* Fix testing

Co-authored-by: Sven Mika <sven@anyscale.io>
2020-06-23 09:48:23 -07:00
Xianyang Liu
b449ece2ea
[SGD] Variable worker CPU requirements (#8963) 2020-06-23 00:43:27 -07:00
chaokunyang
acd765cb22
[Streaming] fix source loop (#9085) 2020-06-23 11:57:06 +08:00
Zhilei Chen
8f2564f1a6
fix a bug that move a const variable (#9080) 2020-06-23 11:54:18 +08:00
Alex Wu
40c15b1ba0
[ParallelIterator] Fix for_each concurrent test cases/bugs (#8964)
* Everything works

* Update python/ray/util/iter.py

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>

* .

* .

* removed print statements

Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2020-06-22 18:26:45 -07:00
Edward Oakes
b88059326d
[docs] Rotate serve to top of sidebar (#9090) 2020-06-22 17:52:11 -07:00
Richard Liaw
e2330ffc35
[sgd] Cleanup code from last PR (#9076) 2020-06-22 15:17:07 -07:00
Ian Rodney
f76552d8db
[autoscaler] Expose cache_stopped_nodes in Example YAML (#8981) 2020-06-22 15:02:45 -07:00
Ian Rodney
b942bcd798
[Tune] remove whitelist from deep_copy (#8997) 2020-06-22 15:02:27 -07:00
Amog Kamsetty
f95ab4f506
[Testing] Multi-node Training+Tune Long Running Test (#8966) 2020-06-22 14:49:16 -07:00
Siyuan (Ryans) Zhuang
7a110b9401
[Core] Remove digests in plasma (4x performance improvement) (#8980)
* remove digest in plasma

* totally remove list
2020-06-22 14:24:32 -07:00
mehrdadn
275da2e400
Fix Google log directory again (#9063) 2020-06-22 14:56:28 -05:00
SangBin Cho
2154f38ae5
[Dashboard] Update the Ray dashboard documentation to explain memory view. (#8945) 2020-06-22 13:50:32 -05:00
mehrdadn
1a40d24174
Handle loop_ NULL case (#9067)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-22 11:05:29 -07:00
chaokunyang
b5fafe1fce
[Java] fix java test (#9079) 2020-06-22 16:56:47 +08:00
Max Fitton
1d5c123c81
[Hotfix] Fix lint mistake accidentally pushed to master (#9077)
Co-authored-by: Max Fitton <max@semprehealth.com>
2020-06-21 19:19:07 -07:00
Richard Liaw
acdd873481
[docs/sgd] Fix test failure + make slack link large (#9051) 2020-06-21 15:55:06 -07:00
Simon Mo
2b5119218e
[Serve] Raise exception when _scale_replicas is infeasible (#9005) 2020-06-21 15:38:58 -07:00
SangBin Cho
e254dd3115
Do not add reference count when it is local mode. (#8979) 2020-06-21 16:01:06 -05:00
mehrdadn
fc4684d3ca
Update pandas to 1.0.5 (#9065)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-21 14:35:25 -05:00
Ameer Haj Ali
76583d4a3a
Creating head node might take more than 50-30 seconds to show up. (#9066) 2020-06-21 11:12:37 -07:00
Vishnu Deva
432ce1be50
[tune] fix for sync_on_checkpoint bug (#9057)
* #9056 fix for sync_on_checkpoint bug

* fix for failing checks

* update help string
2020-06-21 01:07:11 -07:00
Richard Liaw
e6ee39a6a3
[tune] checkpoint_dir test (#8024) 2020-06-20 17:56:24 -07:00
Allen
8fa584a445
[autoscaler] Run ray stop on cluster before tearing it down (#8922)
Co-authored-by: Allen Yin <allenyin@anyscale.io>
2020-06-20 17:02:34 -07:00
mehrdadn
981f67bfb0
Fix more Windows issues (#9011)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-19 18:51:45 -07:00
mehrdadn
f8d49d69c1
Fix and merge asio client read/write operations (#9026)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-19 18:49:55 -07:00
Amit Sadaphule
f0b7de7cfe
Fix '//:redis_client' build on RHEL 7.6 ppc64le (#9035)
Co-authored-by: Mehrdad <noreply@github.com>
2020-06-19 16:28:37 -07:00
Sven Mika
2589309cf0
[RLlib] Make sure torch and tf behave the same wrt conv2d nets. (#8785) 2020-06-20 00:05:19 +02:00
Max Fitton
ca66f88b96
Sortable/Groupable Memory Dashboard (#9014) 2020-06-19 16:26:35 -05:00
Max Fitton
ad09aa985c
Make Dashboard Port Configurable (#8999) 2020-06-19 16:26:22 -05:00