Commit graph

8847 commits

Author SHA1 Message Date
Simon Mo
4a4210a083
Support streaming output of runtime env setup to logger/driver (#17306) 2021-07-27 16:39:15 -07:00
Edward Oakes
7225f28fff
[serve] Add Ray API stability annotations (#17295) 2021-07-27 16:00:15 -05:00
DK.Pino
2699b0f3ab
[Placement Group] Fix resource index assignment between with bundle index and without bundle index pg (#17318) 2021-07-27 13:51:02 -07:00
SangBin Cho
e1cd8580a0
[Test] Add various fixes to the nightly dashboard to improve signals (#17351)
* Add various fixes to the nightly dashboard to improve signals

* Fix issues
2021-07-27 12:37:11 -07:00
Alex Wu
5879e3132e
[Dataset] Support compressed files (#17355)
* .

* lint

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-07-27 12:35:16 -07:00
Sven Mika
90b21ce27e
[RLlib] De-flake 3 test cases; Fix config.simple_optimizer and SampleBatch.is_training warnings. (#17321) 2021-07-27 14:39:06 -04:00
Eric Liang
e70d84953e
[hotfix] Dataset tests accidentally disabled 2021-07-27 10:40:15 -07:00
Jiao
9eb1bcd061
[serve] Multi & single deployment large scale test (#17310) 2021-07-27 10:46:45 -05:00
Frank Luan
a6e8497dc9
[Dataset] Sort (#17142) 2021-07-27 01:53:53 -07:00
fyrestone
57b9b1bb0f
[Dashboard] Use a dedicated RPC to check the GCS is alive (#16330)
* Dashboard check gcs is alive

* Fix dashboard hangs at exit

* ray health-check call GCS CheckAlive

* Minor fixes

Co-authored-by: 刘宝 <po.lb@antfin.com>
2021-07-27 14:05:44 +08:00
Richard Liaw
597dc08dfe
Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies"" (#17254)
* Revert "Revert "[core] remove opencensus/prometheus_exporter dependencies" (#17251)"

This reverts commit 7b44dd8ecb.

* Lint

* Fix more imports

Co-authored-by: Kai Fricke <kai@anyscale.com>
2021-07-26 21:09:25 -07:00
DK.Pino
684e2b28e9
Placement group bug fix (#17320) 2021-07-26 21:03:35 -07:00
Stefan Schneider
489febc6b2
[RLlib] Better example scripts: Description --no-tune and --local-mode CLI options (#17038) 2021-07-26 22:25:48 -04:00
Dmitri Gekhtman
d0e58af075
[autoscaler] Avoid race in no-updaters logic (#17328)
* Extra logic and test

* anglish
2021-07-26 16:05:33 -04:00
architkulkarni
bcb3a6789b
[Core] [runtime env] Cache created runtime envs (#17342) 2021-07-26 14:37:40 -05:00
dependabot[bot]
4bf377ee4b
[tune](deps): Bump gym[atari] in /python/requirements/tune (#17199)
Bumps [gym[atari]](https://github.com/openai/gym) from 0.18.0 to 0.18.3.
- [Release notes](https://github.com/openai/gym/releases)
- [Commits](https://github.com/openai/gym/compare/0.18.0...0.18.3)

---
updated-dependencies:
- dependency-name: gym[atari]
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-26 10:53:41 -07:00
architkulkarni
756a4e7a90
[Core] [runtime env] update tests to use ray.init(runtime_env=...) and add e2e test (#17232) 2021-07-26 11:21:30 -05:00
Edward Oakes
58423e6018
[serve] Improve nightly release test (#17277) 2021-07-26 11:15:46 -05:00
Julius Frost
16be091702
[RLlib] Refactor if __name__ == "__main__" into main() method in rollout/train.py for better reusability (#17315) 2021-07-26 11:12:59 -04:00
Sven Mika
5231fdd996
[Testing] Split RLlib example scripts CI tests into 4 jobs (from 2). (#17331) 2021-07-26 10:52:55 -04:00
Tao Wang
d98ec7fc4d
Remove libray_redis_module (#17283) 2021-07-25 23:15:29 -07:00
matthewdeng
fdbeef6046
[SGD] RaySGD v2 skeleton code (#17300)
* [SGD] RaySGD v2 skeleton code

* add build file

* move file

* empty

* rename

* address comments

* add method interfaces

* move BUILD file out of tests dir

Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-07-25 17:39:24 -07:00
Sven Mika
0c5c70b584
[RLlib] Discussion 247: Allow remote sub-envs (within vectorized) to be used with custom APIs. (#17118) 2021-07-25 16:55:51 -04:00
Chris Bamford
29768a7c01
[RLLib] (P1 regression) Fixing view requirements in compute actions (#15856) 2021-07-25 14:25:07 -04:00
ddworak94
fba8461663
[RLlib] Add RNN-SAC agent (#16577)
Shoutout to @ddworak94 :)
2021-07-25 10:04:52 -04:00
Richard Liaw
d723ea8ef2
fix (#17311)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-07-24 10:38:46 -07:00
Antoni Baum
b500a651b7
[docs] Add LightGBM Tune integration to docs (#17304)
* Add LightGBM integration to docs

* Fix
2021-07-23 21:21:13 -07:00
Yi Cheng
93be44eebf
[workflow] Fix more usability issues (#17305)
* up

* up

* up

* up

* up

* up

* fix test error

* up
2021-07-23 20:26:44 -07:00
Edward Oakes
2142abae57
[Serve] Properly support runtime_env working_dir (#16480) 2021-07-23 17:35:32 -07:00
Jiao
9b6be6f1c8
update dask compatibility for 1.5.0 (#17302)
* update dask compatibility for 1.5.0

* change to right file

* add pip install pytest

Co-authored-by: Jiao Dong <jiaodong@anyscale.com>
2021-07-23 17:31:42 -07:00
Kai Fricke
8db61569f1
[tune] Fix HDFS sync down template (#17291) 2021-07-23 13:01:14 -07:00
Yi Cheng
29352e7fa3
[workflow] Fix some usability issues (#17284) 2021-07-23 11:39:49 -07:00
Edward Oakes
cc38ffb0a0
[debugger] Add doc entry for the --ray-debugger-external flag (#17294) 2021-07-23 12:00:30 -05:00
Eric Liang
df7fe8dd6d
[data] Cleanup Block type by dropping Generic[T] (#17276)
* wip

* update

* update

* quotes
2021-07-23 09:23:06 -07:00
Lixin Wei
ded239205f
[Core] Close RPC Server After GcsHearbeatManager (#17238) 2021-07-23 09:12:13 -07:00
Dmitri Gekhtman
e701ded54f
[autoscaler] Tweaks to support remote (K8s) operators (#17194)
* node provider hooks

* disable node updaters

* pending means not completed

* draft wip

* add flag to autoscaler initialization

* Explain

* terminate unhealthy nodes

* fix, add event summarizer message

* Revert node provider

* remove hooks from autoscaler.py

* avert indent apocalypse

* wip

* copy-node-termination-logic

* Added a test

* Finish tests

* test cleanup

* Move disable node updaters to config yaml

* fix

* Drop arg
2021-07-23 11:30:18 -04:00
Edward Oakes
811eb4b092
[debugger] Enable attaching to breakpoints on remote nodes (off by default) (#17275) 2021-07-23 09:37:40 -05:00
Siyuan (Ryans) Zhuang
57b2328e7b
[workflow] Virtual actor writer - Part I (#17256)
* update readonly virtual actor

use signature module

refactoring workflow

new execution interface

advance progress of a workflow

update storage

last_step_of_workflow

prevent setting dynamic output of "output.json" in workflow directory

use alternative exception

* fix

* fix comments

* better step names

* add TODO

* fix comments

* log errors when retry

* fix storage test
2021-07-22 22:53:04 -07:00
lantian-xu
daf37b7621
[Streaming] Fix illegal cast when rollbacking. (#17195)
Co-authored-by: yz54123 <57480840+yz54123@users.noreply.github.com>
2021-07-23 13:08:34 +08:00
Clark Zinzow
1ab4f0def7
[Datasets] Port read_binary_files to Datasource API. (#17225) 2021-07-22 19:03:10 -07:00
Yi Cheng
5f4d9085d2
[workflow] workflow ci enable (#17255)
* Enable workflow tests

* update

* Fix one bug
2021-07-22 17:59:24 -07:00
Simon Mo
b9b79cd5f4
[Runtime Env] Support per task/actor uri override job_config (#17252) 2021-07-22 16:37:43 -07:00
Edward Oakes
f6375cbb7c
[core] Fix bazel test sizes for C++ unit tests (#17272) 2021-07-22 17:38:56 -05:00
mwtian
b8e71f641c
[Build] Ray Docker image for Python 3.9. (#16571) 2021-07-22 13:38:57 -07:00
Richard Liaw
2ce73ce843
[docs] Revert #16919 and fix documentation build (#17270) 2021-07-22 13:03:38 -07:00
Dmitri Gekhtman
e5925daed3
fix (#17268) 2021-07-22 15:22:52 -04:00
Clark Zinzow
cff7596ea1
[Core] Update locality protocol comment. (#17267) 2021-07-22 11:43:01 -07:00
Simon Mo
aaf8afb78d
[Runtime Env] Add a test for working_dir inheritance (#17245) 2021-07-22 10:48:25 -07:00
Chen Shen
c691f73d87
[core][usability] fix noisy push related log (#17250) 2021-07-22 09:33:08 -07:00
Chen Shen
7736d06399
[core][easy] remove unused code in buffer_pool 2021-07-22 09:31:20 -07:00