Commit graph

8058 commits

Author SHA1 Message Date
Dmitri Gekhtman
95c3d88cac
[autoscaler][kubernetes] Helm chart (#15614) 2021-05-17 16:55:10 -07:00
Amog Kamsetty
c97594aca3
[CI] Update dependencies on travis flaky build (#15858) 2021-05-17 16:07:00 -07:00
Dmitri Gekhtman
c2b8381015
[autoscaler][gcp] Migrate GCP config to available node types (#15805) 2021-05-17 15:45:47 -07:00
dependabot[bot]
434465e477
[tune](deps): Bump gpy from 1.9.9 to 1.10.0 in /python/requirements/tune (#15850)
Bumps [gpy](https://github.com/SheffieldML/GPy) from 1.9.9 to 1.10.0.
- [Release notes](https://github.com/SheffieldML/GPy/releases)
- [Changelog](https://github.com/SheffieldML/GPy/blob/devel/CHANGELOG.md)
- [Commits](https://github.com/SheffieldML/GPy/compare/v1.9.9...v1.10.0)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-05-17 11:33:11 -07:00
Alex Wu
3744026897
Fix test_scheduling (#15823)
* done

* Update python/ray/tests/test_scheduling.py

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

* Update python/ray/tests/test_scheduling.py

Co-authored-by: Ian Rodney <ian.rodney@gmail.com>

* lint

Co-authored-by: Alex Wu <alex@anyscale.com>
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-05-17 10:08:01 -07:00
Alex Wu
3e94114336
Namespaces (#15774) 2021-05-17 10:04:22 -07:00
Sven Mika
f25d58492d
[Testing] Dependabot for RLlib. (#15812) 2021-05-17 18:24:13 +02:00
architkulkarni
78c26ac9fe
[runtime_env] Use sys executable in shim if conda not specified (#15834) 2021-05-17 11:19:58 -05:00
lanlin
5d2ed47978
[tune] Allow to set buffer_length via tune.run (#15810) 2021-05-17 13:11:26 +01:00
wzl
5247c0a5b8
[doc] Fix typo (#15828) 2021-05-16 16:08:14 -07:00
Sven Mika
d89fb82bfb
[RLlib] Add simple curriculum learning API and example script. (#15740) 2021-05-16 17:35:10 +02:00
Sven Mika
ebc6d8692a
[RLlib] Docs: Example scripts and blogs documentation update. (#15763) 2021-05-16 15:24:38 +02:00
Sven Mika
469f5227da
[RLlib] CQL bug fix: Normalize actions for atanh in BC part of the CQL loss. (#15814) 2021-05-16 15:21:06 +02:00
Sven Mika
bc09e75b78
[RLlib] Fix 3 flakey test cases. (#15785) 2021-05-16 12:20:33 +02:00
Edward Oakes
cd32a92edc
[serve] Avoid exporting actor class for every replica (#15788) 2021-05-15 09:04:09 -05:00
fcardoso75
b3428bd09e
Adjust bazel number of build jobs (#15784) 2021-05-14 21:33:14 -07:00
Dmitri Gekhtman
d1b1ae0f45
[test][client][dask] Run dask tests in client mode. (#15806) 2021-05-14 17:15:59 -07:00
Edward Oakes
f6be6dbcdc
[Serve] batch slow warning for multiple replicas (#15798) 2021-05-14 13:12:32 -07:00
Ian Rodney
00c913cbc6
[Flaky] Mark test_nested_observation_spaces as Flaky (#15794) 2021-05-14 12:08:52 -07:00
Ian Rodney
7b1c5dbe0a
[Hotfix][Lint] Pin other ESlint Deps (#15816) 2021-05-14 09:18:43 -07:00
Ian Rodney
ec5322a463
[Client] ray.client.connect() and ray.ClientBuilder (#15706) 2021-05-14 00:08:39 -07:00
Ian Rodney
395c0ea03c
[Testing] Pin Tensorflow Version in requirements.txt (#15799) 2021-05-13 17:09:15 -07:00
Edward Oakes
28f2962bb2
[serve] Add helpful log messages when deploying (#15689) 2021-05-13 18:10:23 -05:00
Ian Rodney
42f99541d4
[Tests] Mark test_scheduling & test_memstat as Flaky (#15789) 2021-05-13 15:46:12 -07:00
Edward Oakes
6a0f087643
[serve] Randomly shuffle replicas to avoid cross-handle synchronization (#15792) 2021-05-13 17:19:27 -05:00
Richard Liaw
c624e89483
[tune] Support numpy types in TBXlogger (#15760) 2021-05-13 14:54:47 -07:00
Edward Oakes
77d713ac78
[serve] Fix shutdown logic + add test (#15790) 2021-05-13 16:43:07 -05:00
Edward Oakes
d107cca1aa
[serve] Don't deserialize backend classes in the controller (#15741) 2021-05-13 16:01:09 -05:00
Ian Rodney
859703e993
[RuntimeEnv] Log which file caused an Exception (#15772) 2021-05-13 13:48:59 -07:00
mwtian
5462c6e7de
Fix link to release checklist from release process doc. (#15793) 2021-05-13 13:34:54 -07:00
Simon Mo
838cfec122
[Tracing] Fix kwargs replacement (#15742) 2021-05-13 12:44:35 -07:00
Ian Rodney
82876ecc2a
[rllib] [testing] make kill failure non fatal (#15771) 2021-05-13 12:24:49 -07:00
mwtian
dce13d3a81
Explicitly set protobuf dependency version to allow building ray with bazel 4.0.0 (#15756)
Java protobuf dependency version is made to be consistent as well.
2021-05-13 10:34:09 -07:00
SangBin Cho
259fcbd5bd
[Pubsub] Generalize the pubsub interface and adapt it for ref counting protocol (#15446)
* Add mock code first

* In the initial progress.

* Fix the number error

* In progress.

* in more pgoress.

* in progress.

* lint.

* Prototype done.

* Fix compilation bug.

* Now it is working with reference counting.

* Remove template.

* lint.

* Fixed issues.

* Fix reference count test.

* Reference count test passes now.

* Fixed the test array problem

* Addressed code review.

* lint.

* Addressed half of code review.

* Fix tests.

* Addressed the most critical issue.

* Make subscriber thread-safe.

* Revert "Make subscriber thread-safe."

This reverts commit 9a6a52197cfa8463ab60dfaae9530ad3c0ed8790.

* Fixed test failures. The only failure now is the asan failure.

* Reset test suites and see if it fixes the issue.

* Fix a flaky test

* Addressed code review.
2021-05-13 09:29:02 -07:00
architkulkarni
a0c1cfe034
[Core] Pass RuntimeEnv as opaque string in the task spec (#15658) 2021-05-13 10:32:00 -05:00
Simon Mo
d97ff80149
[Core] Fix global state keep initializing in busy loop (#15777) 2021-05-13 10:30:49 -05:00
SongGuyang
40b2face74
Fix std::atomic compiling error (#15781) 2021-05-13 10:27:45 -05:00
Tao Wang
19462e43d6
[large scale]use proxy to track gcs server address in core worker (#15714) 2021-05-13 19:26:01 +08:00
fcardoso75
c877da4c19
create_and_mmap_buffer() - In case CreateFileMapping() fails, GetLastError() return code is printed (#15773)
* Enabling all test cases on test_client.py

* Moving test_client.py to a large CI py_test_module_list

* Disabling test_client::test_remote_functions

* Divide Run CI script action into separete Build action and Test action

* Reverting test_client.py to separate work for different tickes

* Reverting python\ray\tests\BUILD to separate work for different tickets

* create_and_mmap_buffer() - In case CreateFileMapping() fails, GetLastError() return code is printed

* Addressed lint comments

Co-authored-by: Philipp Moritz <pcmoritz@gmail.com>
2021-05-13 00:31:33 -07:00
Richard Liaw
ec27f9cc6b
[tune] avoid system exit error (#15687)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-05-13 00:24:30 -07:00
Sven Mika
c4a3e1589b
[RLlib] CQL: Bug fixes and OPE example added to test and offline_rl.py example. (#15761) 2021-05-13 09:17:23 +02:00
Philipp Moritz
78d0ed3503
[Windows] Fix Test Log Upload (#15769) 2021-05-12 17:57:49 -07:00
Ian Rodney
cdf93930f3
Revert "[Core] Fix event loop instrumentation causing Java segfaults in tests. (#15349)" (#15727)
This reverts commit edb0d1b376.
2021-05-12 15:49:06 -07:00
SangBin Cho
2fc892166f
[Test] unskip global gc (#15767) 2021-05-12 15:45:59 -07:00
Ian Rodney
6e3ec98b72
[test] Clean-Up Test-Top-Level API (#15713) 2021-05-12 13:50:17 -07:00
mwtian
6a044f4f30
[Test] Ensure output params are initialized before calling IsPlasmaObjectPinnedOrSpilled() (#15758) 2021-05-12 10:22:35 -07:00
Dmitri Gekhtman
052d2acaee
[autoscaler][kubernetes] Restart after head failure, more consistent operator restart behavior. (#15655) 2021-05-12 11:49:11 -05:00
Sven Mika
16ddab49f5
[RLlib] Trainer._evaluate -> Trainer.evaluate; Also make evaluation possible w/o evaluation worker set. (#15591) 2021-05-12 12:16:00 +02:00
fyrestone
56c309416e
[Job submission] Basic job submission structure (#15103) 2021-05-12 15:08:20 +08:00
fcardoso75
fcf56fb970
Divide run ci script action (#15657)
* Enabling all test cases on test_client.py

* Moving test_client.py to a large CI py_test_module_list

* Disabling test_client::test_remote_functions

* Divide Run CI script action into separete Build action and Test action

* Reverting test_client.py to separate work for different tickes

* Reverting python\ray\tests\BUILD to separate work for different tickets
2021-05-11 21:39:18 -07:00