Commit graph

7269 commits

Author SHA1 Message Date
dependabot[bot]
323c7da70c
[tune](deps): Bump matplotlib from 3.3.3 to 3.3.4 in /python/requirements (#14087)
Bumps [matplotlib](https://github.com/matplotlib/matplotlib) from 3.3.3 to 3.3.4.
- [Release notes](https://github.com/matplotlib/matplotlib/releases)
- [Commits](https://github.com/matplotlib/matplotlib/compare/v3.3.3...v3.3.4)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-02-17 19:31:07 -08:00
Amog Kamsetty
be7114639d
[Tune] Update Transformers Example (#14150)
Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-151.us-west-2.compute.internal>
2021-02-17 18:37:27 -08:00
qicosmos
2307f54e88
[C++ API] simplify code of Arguments; avoid memory copy (#13978) 2021-02-18 10:33:18 +08:00
EscapeReality846089495
5ce1d262a3
[tune] Fixed atomic_save w/ os.replace (#14089)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-17 15:48:39 -08:00
SangBin Cho
9451b4ea86
[Object Spilling] Fix the race condition. (#14149)
* Fix the race condition.

* done.

* Fix the lint issu.e

* fix issues. addressed comments.
2021-02-17 14:35:22 -08:00
Kai Fricke
a0f73cf3f7
[xgboost] Update XGBoost release test configs (#13941)
* Update XGBoost release test configs

* Use GPU containers

* Fix elastic check

* Use spot instances for GPU

* Add debugging output

* Fix success check, failure checking, outputs, sync behavior

* Update release checklist, rename mounts
2021-02-17 23:00:49 +01:00
Antoni Baum
58d7398246
[Tune] Add HEBOSearch Searcher (#13863)
* HEBO first pass

* Fix bad quotes

* Fixes

* Reproductibility

* Update python/ray/tune/suggest/hebo.py

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

* Add hebo_example.py to BUILD

* Nit

* Update to pypi package

* Alphabetical HEBO requirement

* Fix syntax error

* Fix wrong space in hebo example

* Move validate_warmstart to utils

* Space assertion in HEBO

* Comment

* Apply suggestions from code review

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

* Formatting

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-02-17 22:53:10 +01:00
Eric Liang
ee584e8328
Fix OSX error and re-merge unhandled exceptions handling (#14138) 2021-02-17 13:35:07 -08:00
dependabot[bot]
67bdccca41
[tune](deps): Bump smart-open from 4.0.1 to 4.2.0 in /python/requirements (#14158)
Bumps [smart-open](https://github.com/piskvorky/smart_open) from 4.0.1 to 4.2.0.
- [Release notes](https://github.com/piskvorky/smart_open/releases)
- [Changelog](https://github.com/RaRe-Technologies/smart_open/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/piskvorky/smart_open/compare/4.0.1...v4.2.0)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-02-17 12:03:22 -08:00
Sumanth Ratna
c1d68d7dd0
[docs] Remove sphinx-gallery example runtimes (#14141)
e7f65d9b21/doc/conf.py (L340)
2021-02-17 11:07:16 -08:00
Alex Wu
753083c617
[docs][autoscaler] Update AWS node config link (#14125) 2021-02-17 10:44:10 -08:00
Kathryn Zhou
f6b5e838fe
Add disk and network metrics to Prometheus and fix dashboard (#14144) 2021-02-17 10:27:14 -08:00
SangBin Cho
3a6a977803
Revert "[Ownership based object directory] Turn on by default. (#13964)" (#14148)
This reverts commit 04d2df40cd.
2021-02-16 22:42:58 -08:00
architkulkarni
d9124e9329
Revert "[Core]Fix ray.kill doesn't cancel pending actor bug (#14025)" (#14146)
This reverts commit 1754359281.
2021-02-16 17:22:25 -08:00
SangBin Cho
04d2df40cd
[Ownership based object directory] Turn on by default. (#13964) 2021-02-16 17:16:44 -08:00
SangBin Cho
1b1420e069
[Scheduler] Fix spillback is done deterministically. (#14096)
* update.

* Fix comments.

* Addressed code review.

* fix a test.

* Addressed last code review.

* d.

* done.
2021-02-16 16:46:16 -08:00
SangBin Cho
4d7ab3c886
[Doc] Ray logging document. (#14102)
* Initial draft done.

* Addressed code review.
2021-02-16 15:27:30 -08:00
SangBin Cho
5135661bdf
[Metrics] Add spilling stats (#14103)
* Add stats for object spilling.

* Formatting.

* addressed code review.
2021-02-16 15:26:04 -08:00
Barak Michener
edf24580a6
[ray_client]: Set gRPC max message size to 4GiB (#14063)
* [ray_client]: Set gRPC max message size to 4GiB

Change-Id: Id4d6887cdd90dd761dd25248f10f104701462667

* reduce size

Change-Id: I71625ed3cffd9d8b3d7d3d7a981bb4dda00ed0a1

* Update test_basic_2.py

* Update test_advanced.py

* Update test_basic.py

Co-authored-by: Eric Liang <ekhliang@gmail.com>
2021-02-16 14:32:23 -08:00
Simon Mo
33316d4f8f
Revert "Export additional metrics to Prometheus (#14061)" (#14134)
This reverts commit 82539f2da4.
2021-02-16 12:49:12 -08:00
Edward Oakes
019d84a9f3
[serve] [docs] High-level reorganization of the docs (#14120) 2021-02-16 14:07:56 -06:00
architkulkarni
3ce03a52bc
Revert "Revert "Revert "Unhandled exception handler based on local ref counti… (#14113)" (#14136)
This reverts commit e457872fe1.
2021-02-16 11:47:09 -08:00
SangBin Cho
b05f87d7b2
[Object Spilling] Share the same S3 session for smart_open spilling. (#13904) 2021-02-16 10:40:55 -08:00
Barak Michener
c43a64230e
[ray_client]: Fix mutual recursion (#14122) 2021-02-16 10:37:58 -08:00
SangBin Cho
684bb32cdf
Fix assert get_outer_ref None failed + Support better traceback. (#14126)
* in progress.

* Better exception handling & stacktrace.

* done.
2021-02-16 10:09:01 -08:00
Richard Liaw
864956f817
fix-skopt (#14116)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-16 14:36:19 +01:00
Eric Liang
e434ffe06c
[tune] Avoid crash in client mode when return results creating logdir (#14115) 2021-02-15 19:25:14 -08:00
Ian Rodney
350fb5b9d1
[autoscaler] Remove Hardcoded 8265 (#14112) 2021-02-15 18:04:00 -08:00
Patrick Ames
da0c2c99a0
[autoscaler] Fix bad reference error when specifying IamInstanceProfile by name in config. (#14083) 2021-02-15 16:29:36 -08:00
Jack Parker-Holder
ebb6e552d2
[tune] PB2 - add small constant (#14118) 2021-02-15 16:04:10 -08:00
Edward Oakes
5e763893ea
[serve] Don't overwrite self.handle in StarletteEndpoint (#14111) 2021-02-15 17:51:54 -06:00
SangBin Cho
4ad79ca963
[Object Spilling] Remove LRU eviction (#13977)
* done.

* formatting.

* done.

* done.
2021-02-15 14:24:53 -08:00
Eric Liang
e457872fe1
Revert "Revert "Unhandled exception handler based on local ref counti… (#14113)
* Revert "Revert "Unhandled exception handler based on local ref counting (#14049)" (#14099)"

This reverts commit b45ae76765.

* reomve test

* fix

* fix
2021-02-15 14:11:11 -08:00
Alex Wu
4846a6c2d0
Release process update (#13798) 2021-02-15 11:40:49 -08:00
architkulkarni
496dd297e5
skip test_basic_reconstruction_actor_task on win (#14110) 2021-02-15 10:17:33 -08:00
architkulkarni
0fb96a61fc
[Serve] Add support for variable routes (#13968) 2021-02-15 11:42:42 -06:00
Richard Liaw
4d727e4cdf
[tune] enable more tests (#13969)
* try-this

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* test

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix-tests

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* address

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* real-ray

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix-client

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* fix-race-condition

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* revert-new-tune-tests

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* Revert "revert-new-tune-tests"

This reverts commit 3866b920bc47ac4b5cb9dab8f7b9d50e4acdb27a.

* format

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* update

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* build

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-15 09:19:55 -08:00
architkulkarni
bcb51a27c6
[Serve] [Doc] Add version warning (#14001) 2021-02-15 11:16:01 -06:00
javi-redondo
b8b2d6410d
[docs] new Ray Cluster documentation (#13839)
Co-authored-by: Javier Redondo <javier@anyscale.com>
Co-authored-by: AmeerHajAli <ameerh@berkeley.edu>
2021-02-15 00:47:14 -08:00
Kathryn Zhou
82539f2da4
Export additional metrics to Prometheus (#14061) 2021-02-14 23:16:26 -08:00
SangBin Cho
b45ae76765
Revert "Unhandled exception handler based on local ref counting (#14049)" (#14099)
This reverts commit 9dc671ae02.
2021-02-14 22:08:32 -08:00
architkulkarni
75568f856c
skip restart and multi restart test on win (#14084) 2021-02-14 15:17:54 -08:00
Alex Wu
5636af8084
[hotfix] Fix mac build (#14075)
* .

* done?

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-02-14 14:26:51 -08:00
Eric Liang
9dc671ae02
Unhandled exception handler based on local ref counting (#14049) 2021-02-12 22:58:38 -08:00
Erik Erlandson
ff1b26274e
[operator] expose RAY_CONFIG_DIR env var (fix #14074) (#14076) 2021-02-12 17:47:00 -08:00
architkulkarni
20f6cc2cb2
skip test_basic_reconstruction_put on win (#14082) 2021-02-12 15:47:00 -08:00
Clark Zinzow
c9a9d422c7
[OBOD] Disable the ownership-based object directory for all tests that use ray.objects(). (#14065) 2021-02-12 12:12:57 -08:00
Clark Zinzow
c7ff69f4bf
[OBOD] Add support for ownership-based object directory object recovery. (#14066) 2021-02-12 11:58:31 -08:00
Sven Mika
936cb5929c
[RLlib] Issue #13646: Rewards still not available in loss/json-output in certain situations when using the traj. view API. (#14036) 2021-02-12 10:07:44 +01:00
Dmitri Gekhtman
6644a0fe50
[autoscaler][kubernetes][docs] Updated Kubernetes Documentation (#14016)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-11 23:00:25 -08:00