Commit graph

7309 commits

Author SHA1 Message Date
Michael Luo
ec2c10309b
[RLlib] CQL for HalfCheetah-Random-v0 + Hopper-Random-v0 + CQL Bug Fixes (#14243) 2021-02-22 17:30:18 +01:00
Sven Mika
95ef04b71a
[RLlib] Implement TorchPolicy.export_model. (#13989) 2021-02-22 17:09:40 +01:00
Antoni Baum
ffbba8e699
[Tune] Batch suggestions for HEBO (#14246)
* Batch suggestions for HEBO

* Better documentation
2021-02-22 14:24:37 +01:00
DK.Pino
7647d60fa9
[Placement Group] Support named placement group java api & Refactor construct method (#13821) 2021-02-22 20:12:09 +08:00
Sven Mika
3d20d58c90
[RLlib] Tune trial + checkpoint selection example. (#14209) 2021-02-22 12:52:37 +01:00
SangBin Cho
de8d9d3e44
[Test] Skip test_load_balancing_under_constrained_memory on Windows (#14242)
* Skip the window test.

* Remove unrelated changes.

* Remove unrelated changes.
2021-02-21 23:32:48 -08:00
Kai Yang
e75b143faf
[Core] Some small fixes and improvements (#14210) 2021-02-22 12:02:30 +08:00
Ryan Sander
8b5310a4e6
Fixed "multit-threaded" --> "multi-threaded" (#14236) 2021-02-21 19:25:51 -08:00
Dmitri Gekhtman
090970bdf5
[autoscaler] Max worker default infinity (#14201)
* random doc typo

* max-worker-default-inf

* fix

* -1 means infinity

* doc

* comment tweak

* fix random typo

* Cluster max-worker default

* fix

* typo

* test

* Git add the test

* doc-tweak

* rest of the test logistics

* periods in doc

* Address comments

* docstring
2021-02-22 05:14:00 +02:00
Richard Liaw
9eb79727aa
[tune] Support extending BOHB/Hyperband runs past max_t (#14171)
* initial-commit-to-support

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* basic-test

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* ok

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>

* smoke-test

Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-20 14:28:59 -08:00
Kai Yang
d8c32be449
[Core] Simplify system config passing from Raylet to workers (#13860) 2021-02-20 20:20:13 +08:00
Stephanie Wang
a4d7792c0e
[core] Fix bugs in admission control again (#14222)
* Track which pull bundle requests are ready to run

* Regression test

* Reset retry timer on pull activation, don't count created objects towards memory usage, abort objects on pull deactivation

* Revert "Track which pull bundle requests are ready to run"

This reverts commit b5d0714783fa2fc842bdd4e2d2802228e25f03c2.

* Check object active before receiving chunk

* lint

* debug, unit test, fix race condition

* lint

* update

* lint

* fix

* fix build

* fix test

* remove print

* Fix bug in bytes accounting

* Split
2021-02-19 18:07:57 -08:00
SangBin Cho
5fcbf02bae
Fix. (#14218) 2021-02-19 18:06:34 -08:00
Qing Wang
b4f1cb86d3
[Java] Make options.resources copyable. (#14207) 2021-02-20 10:00:09 +08:00
Eric Liang
58f8c4b23a
Handle unhandled exception handler == nullptr in Java (#14221) 2021-02-19 16:54:41 -08:00
SangBin Cho
5740b2391e
Add multi node data processing cluster.yaml (#14198) 2021-02-19 16:16:55 -08:00
SangBin Cho
296792f963
Revert "[core] Fix bugs in admission control (#14157)" (#14217)
This reverts commit 94a819d00e.
2021-02-19 11:58:17 -08:00
Eric Liang
6a0b306221
fix stack (#14193) 2021-02-19 11:52:40 -08:00
Eric Liang
cc156f7b3c
Fix deadlock in unhandled exception handler and re-merge (#3) (#14192) 2021-02-19 11:52:09 -08:00
Amog Kamsetty
3ffe375a09
[Tune] Raise error when PBT is used with search algorithm (#14176) 2021-02-19 09:41:30 -08:00
Kai Yang
9eb45fd1c4
[Java] Simplify Java unit test configuration (#12823) 2021-02-19 21:42:31 +08:00
chaokunyang
f8a36eb350
[Java] Add java api overload doc and test (#14204) 2021-02-19 19:46:35 +08:00
Kai Yang
ec344b87c7
[Core] Fix grpc server is started check (#14183) 2021-02-19 16:48:28 +08:00
Stephanie Wang
94a819d00e
[core] Fix bugs in admission control (#14157)
* Track which pull bundle requests are ready to run

* Regression test

* Reset retry timer on pull activation, don't count created objects towards memory usage, abort objects on pull deactivation

* Revert "Track which pull bundle requests are ready to run"

This reverts commit b5d0714783fa2fc842bdd4e2d2802228e25f03c2.

* Check object active before receiving chunk

* lint

* debug, unit test, fix race condition

* lint

* update

* lint

* fix

* fix build

* fix test

* remove print

* Fix bug in bytes accounting
2021-02-18 20:39:00 -08:00
Oscar Bonilla
632023b825
[docs] Fix broken link in README.md (#14147) 2021-02-18 19:48:51 -08:00
Kai Yang
66f6c3944d
[Java] Re-enable remaining skipped Java test cases (#13979)
Co-authored-by: loushang.ls <loushang.ls@antfin.com>
2021-02-19 10:57:28 +08:00
SangBin Cho
8b9e0d1e6c
Add tqdm to windows build. (#14197) 2021-02-18 16:01:04 -08:00
Simon Mo
3fb6b07aea
[Buildkite] Add wheels, jars, and docker builds. (#14190) 2021-02-18 14:19:28 -08:00
Clark Zinzow
c092a5d184
Cancel object location long-poll on object free. (#14165) 2021-02-18 14:09:43 -08:00
Stephanie Wang
dfb86e0a8f
[core] Push object chunks with multiple threads (#14191)
* Push object chunks with multiple threads

* fix build
2021-02-18 14:09:23 -08:00
Kai Fricke
a3dc92ead6
[tune] fix specifying nested metrics in progress reporter (#14189) 2021-02-18 22:26:03 +01:00
Sven Mika
775e685531
[RLlib] Issue #13824: compress_observations=True crashes for all algos not using a replay buffer. (#14034) 2021-02-18 21:36:32 +01:00
Barak Michener
50ccd41cbf
fix and test the errors, limited to pickling (#14174)
Change-Id: I95c4715c0f54b1d5909aeb8eb96403db22aa0f07
2021-02-18 11:13:15 -08:00
SangBin Cho
3ad05337f7
[Shuffle] Use progress bar for experimental.shuffle (#14179)
* done.

* Add time.
2021-02-18 11:05:54 -08:00
architkulkarni
6d88036340
[ray_client]: Skip flaky test_cancel_chain on Windows (#14167)
* skip test_cancel_chain on windows

* lint

* lint
2021-02-18 10:43:15 -08:00
SangBin Cho
66f93a3d63
Revert "Fix OSX error and re-merge unhandled exceptions handling (#14138)" (#14180)
This reverts commit ee584e8328.
2021-02-18 10:35:38 -08:00
Sven Mika
929946271d
[RLlib] Issue #14022: Trajectory View API fails in MA-env where a new agent terminates right away (done=True right after initial obs). (#14031) 2021-02-18 14:07:49 +01:00
qicosmos
bc46fc5fce
Cpp worker refactor serializer (#14027) 2021-02-18 15:37:58 +08:00
Qing Wang
b579186791
Fix reset load_code_from_local in 2nd session. (#13985)
Co-authored-by: Qing Wang <jovany.wq@antgroup.com>
2021-02-18 13:52:36 +08:00
Siyuan (Ryans) Zhuang
af8c0c1add
fix numpy ufunc serialization failures (#14143) 2021-02-17 21:28:21 -08:00
dependabot[bot]
323c7da70c
[tune](deps): Bump matplotlib from 3.3.3 to 3.3.4 in /python/requirements (#14087)
Bumps [matplotlib](https://github.com/matplotlib/matplotlib) from 3.3.3 to 3.3.4.
- [Release notes](https://github.com/matplotlib/matplotlib/releases)
- [Commits](https://github.com/matplotlib/matplotlib/compare/v3.3.3...v3.3.4)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-02-17 19:31:07 -08:00
Amog Kamsetty
be7114639d
[Tune] Update Transformers Example (#14150)
Co-authored-by: Ubuntu <ubuntu@ip-172-31-6-151.us-west-2.compute.internal>
2021-02-17 18:37:27 -08:00
qicosmos
2307f54e88
[C++ API] simplify code of Arguments; avoid memory copy (#13978) 2021-02-18 10:33:18 +08:00
EscapeReality846089495
5ce1d262a3
[tune] Fixed atomic_save w/ os.replace (#14089)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2021-02-17 15:48:39 -08:00
SangBin Cho
9451b4ea86
[Object Spilling] Fix the race condition. (#14149)
* Fix the race condition.

* done.

* Fix the lint issu.e

* fix issues. addressed comments.
2021-02-17 14:35:22 -08:00
Kai Fricke
a0f73cf3f7
[xgboost] Update XGBoost release test configs (#13941)
* Update XGBoost release test configs

* Use GPU containers

* Fix elastic check

* Use spot instances for GPU

* Add debugging output

* Fix success check, failure checking, outputs, sync behavior

* Update release checklist, rename mounts
2021-02-17 23:00:49 +01:00
Antoni Baum
58d7398246
[Tune] Add HEBOSearch Searcher (#13863)
* HEBO first pass

* Fix bad quotes

* Fixes

* Reproductibility

* Update python/ray/tune/suggest/hebo.py

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

* Add hebo_example.py to BUILD

* Nit

* Update to pypi package

* Alphabetical HEBO requirement

* Fix syntax error

* Fix wrong space in hebo example

* Move validate_warmstart to utils

* Space assertion in HEBO

* Comment

* Apply suggestions from code review

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>

* Formatting

Co-authored-by: Kai Fricke <krfricke@users.noreply.github.com>
2021-02-17 22:53:10 +01:00
Eric Liang
ee584e8328
Fix OSX error and re-merge unhandled exceptions handling (#14138) 2021-02-17 13:35:07 -08:00
dependabot[bot]
67bdccca41
[tune](deps): Bump smart-open from 4.0.1 to 4.2.0 in /python/requirements (#14158)
Bumps [smart-open](https://github.com/piskvorky/smart_open) from 4.0.1 to 4.2.0.
- [Release notes](https://github.com/piskvorky/smart_open/releases)
- [Changelog](https://github.com/RaRe-Technologies/smart_open/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/piskvorky/smart_open/compare/4.0.1...v4.2.0)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-02-17 12:03:22 -08:00
Sumanth Ratna
c1d68d7dd0
[docs] Remove sphinx-gallery example runtimes (#14141)
e7f65d9b21/doc/conf.py (L340)
2021-02-17 11:07:16 -08:00