Commit graph

4721 commits

Author SHA1 Message Date
Amog Kamsetty
7318a212fb
[Serve] Skip test_redeploy_multiple_replicas on OSX (#16915) 2021-07-06 18:58:36 -07:00
Eric Liang
7e52fde8a3
Fix num returns error message (#16865) 2021-07-06 14:57:26 -07:00
Stefan Schneider
d4babd69c1
[windows] correct symlinks for files (node.py) (#16817) 2021-07-06 10:01:13 -07:00
Dmitri Gekhtman
a27a8172cc
[autoscaler] Handle node type key change/deletion (#16691) 2021-07-06 09:06:58 -07:00
Kai Fricke
4178655ba7
[tune] Pass custom sync_to_cloud templates to durable trainables (#16739) 2021-07-06 09:50:59 +01:00
Eleven Liu
e250abf689
[tune] Sort top results by metric (#16576) 2021-07-06 08:59:31 +01:00
Eric Liang
4af36faea1
[docs] Cleanup workflow api.py pydoc and spell out ObjectRef for clarity (#16857)
* cleanup types

* docs

* clarify
2021-07-06 00:59:06 -07:00
Kai Yang
7c21be5450
[Object spilling] Clean up spilled objects on disk when Raylet starts (#16669) 2021-07-05 12:01:25 +08:00
Vince Jankovics
63ce4b4e97
[tune] Fix step for MLflow log_trial_result (#16840)
* Fix step for MLflow log_trial_result

* fix test

* lint

Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-07-03 10:06:45 -07:00
Yi Cheng
4bb3883a73
[dataset] deduct filesystem automatically (#16762) 2021-07-03 00:50:59 -07:00
Siyuan (Ryans) Zhuang
122bf309fa
[Workflow] Workflow management - Part I (#16838)
* refactoring

* share fate with the driver

* move TODOs to correct locations

* disable objectref test

* test raise exception when use object ref as inputs
2021-07-02 22:12:45 -07:00
SangBin Cho
61451af06b
[OBOD] Bug fix from test_scheduling.py (#16791) 2021-07-02 19:26:31 -07:00
architkulkarni
f02e41a822
[Core] [runtime env] Add RuntimeEnvHash and JobID to SchedulingKey (#16766)
* add python integration test

* improve readability

* remove unneccessary ray start --head

* add shutdown_only

* move RuntimeEnvHash from worker_pool to task_spec

* lint

* Add runtimeEnvHash and JobID to SchedulingKey

* remove JobID from key and hopefully fix compile

* add test for same env

* lint
2021-07-02 18:15:28 -07:00
Eric Liang
f2f6176057
Deflake test_failure_3 (#16854) 2021-07-02 18:00:42 -07:00
loicsacre
b7be848f07
[tune] Allow the logging of wandb.Image in tune.integration.wandb (#16839) 2021-07-02 17:14:23 -07:00
Charles
2526299eba
[Autoscaler] Use NetworkInterfaces parameter to launch instances. (#16816) 2021-07-02 14:01:02 -07:00
dependabot[bot]
ba13c2f7dd
[RLlib](deps): Bump gym[atari] in /python/requirements/rllib (#15992)
Bumps [gym[atari]](https://github.com/openai/gym) from 0.18.0 to 0.18.3.
- [Release notes](https://github.com/openai/gym/releases)
- [Commits](https://github.com/openai/gym/compare/0.18.0...0.18.3)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-02 09:58:18 -07:00
dependabot[bot]
ee9a1b022e
[tune](deps): Bump kubernetes in /python/requirements/tune (#16002)
Bumps [kubernetes](https://github.com/kubernetes-client/python) from 12.0.1 to 17.17.0.
- [Release notes](https://github.com/kubernetes-client/python/releases)
- [Changelog](https://github.com/kubernetes-client/python/blob/master/CHANGELOG.md)
- [Commits](https://github.com/kubernetes-client/python/compare/v12.0.1...v17.17.0)

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-02 09:48:02 -07:00
dependabot[bot]
81866a754a
[tune](deps): Bump optuna in /python/requirements/tune (#16376)
Bumps [optuna](https://github.com/optuna/optuna) from 2.6.0 to 2.8.0.
- [Release notes](https://github.com/optuna/optuna/releases)
- [Commits](https://github.com/optuna/optuna/compare/v2.6.0...v2.8.0)

---
updated-dependencies:
- dependency-name: optuna
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-02 09:46:52 -07:00
dependabot[bot]
0959fbca96
[tune](deps): Bump transformers in /python/requirements/tune (#16672)
Bumps [transformers](https://github.com/huggingface/transformers) from 4.3.2 to 4.8.1.
- [Release notes](https://github.com/huggingface/transformers/releases)
- [Commits](https://github.com/huggingface/transformers/compare/v4.3.2...v4.8.1)

---
updated-dependencies:
- dependency-name: transformers
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-07-02 09:45:23 -07:00
Kai Fricke
6c29f2bed6
[tune] Use fork for wandb, disable ipython in trainables (#16675) 2021-07-02 15:52:24 +01:00
Antoni Baum
cabd159423
FIx PlacementGroupFactory hashing (#16842) 2021-07-02 15:44:39 +01:00
Eric Liang
e77a964640
[dataset] Add datasource API (#16826) 2021-07-01 23:44:30 -07:00
Ian Rodney
f9daf7fa2c
[Connect] Separate Specific Server logs into different files (#16534)
* better debug logging

* log to separate files

* use getter/setter methods

* actually use type for ray.node.Node

* update docstring

* use public method

* add a test

* fix glob glob

* fix test again
2021-07-01 21:27:03 -07:00
Kai Fricke
521a05834d
[tune] Fix resume=REMOTE documentation and log more details (#16709) 2021-07-01 18:40:15 -07:00
Eric Liang
7eec197d31
[data] Support block metadata and lazy loading of blocks from the filesystem (#16689) 2021-07-01 17:25:20 -07:00
Siyuan (Ryans) Zhuang
b570606438
[Workflow] Ensure asynchronous execution of "workflow.run" & fix TODOs (#16727)
* initial commit

* code refactoring; non-blocking "workflow.run"

* Rename "workflow_root_dir" to "storage_url"

* readable workflow_id and step_id

* verbose logging

* remove unused code

* slugify

* storage -> storage_or_url

* fix comments

* storage_or_url -> storage

* get workflow outputs without extra object copy & transfer

* update tests

* tests workflow outputs resolving
2021-07-01 15:53:22 -07:00
Edward Oakes
ff37bdcbd9
[serve] Fix bazel test sizes now that we're on buildkite (#16785) 2021-07-01 16:27:31 -05:00
Chen Shen
34cd6a862a
[dataset] implements split (#16765) 2021-07-01 13:03:34 -07:00
Alex Wu
c27a02a807
[Dataset] read binary files (#16717)
* done

* done

* done

* .

* .

* lint

* .

Co-authored-by: Alex Wu <alex@anyscale.com>
2021-07-01 13:00:10 -07:00
Clark Zinzow
52da2cce68
[Dataset] Adds JSON, CSV, Pandas, and Dask IO layers, and adds the write side of the Parquet IO layer. (#16724) 2021-07-01 11:57:40 -07:00
Edward Oakes
a6051ea3dc
[serve] Rolling updates for user_config changes (#15909) 2021-07-01 13:31:59 -05:00
Clark Zinzow
096b38df8b
[Dataset] Miscellaneous hotfixes. (#16730) 2021-07-01 10:50:06 -07:00
Edward Oakes
691f0a1dd1
[serve] Remove unnecessary STOPPED ReplicaState (#16783) 2021-07-01 12:35:00 -05:00
Kai Fricke
6f48e4f566
[tune/wandb] Fix parameter lookup for sequences (#16806) 2021-07-01 15:22:25 +01:00
mwtian
7669708237
Create a wait_for_num_nodes() function, and use it in train_small (#16784) 2021-07-01 10:17:53 +01:00
Antoni Baum
2cec40d0f5
[tune] LightGBM integration callbacks (#16713) 2021-07-01 10:12:57 +01:00
Siyuan (Ryans) Zhuang
18232bb808
[Core][Serialization] Fix recursive resolve/deserialization (#16794)
* fix recursive resolve

* fix threading issues
2021-07-01 01:44:24 -07:00
Amog Kamsetty
2c17f4907e
[Tune] Add pbt_transformers Client test (#16793) 2021-06-30 23:10:16 -07:00
chenk008
06c7db7dca
[Core] Rename container option and ray-nest-container (#16771)
* rename container_option to container

* rename ray-nest-container to ray-worker-container

* lint

Co-authored-by: wuhua.ck <wuhua.ck@alibaba-inc.com>
2021-07-01 13:12:26 +08:00
SangBin Cho
0ad2dac3ca
[Dataset] Map batch implementation (#16729) 2021-06-30 21:51:04 -07:00
SangBin Cho
94149ef50c
[Test] Skip some flaky tests (#16768)
* done

* Add test gcs fault tolerance to the flaky test

* Addressed code review

* Addressed the last code review.
2021-06-30 20:48:09 -07:00
Dmitri Gekhtman
0cbeefdadf
[autoscaler] GPU=0 resource tweak (#16761)
* tweak

* use >, some fields don't admit an order
2021-06-30 16:51:25 -07:00
Ian Rodney
1f14f366fd
[Client] Make Client_Hook per-thread (#16731) 2021-06-30 11:48:55 -07:00
architkulkarni
3962ce286b
[Core] Bump override_environment_variables removal date from Ray 1.5 to 1.6 (#16755) 2021-06-30 09:31:08 -05:00
architkulkarni
077c04b06c
[Serve] Fix fastapi tutorial and update doc with deprecation warning (#16759) 2021-06-30 09:30:49 -05:00
SongGuyang
1d5c72ebde
[C++ API] rename 'ray_redis_address' to 'ray_address' (#16772) 2021-06-30 20:50:28 +08:00
Kai Fricke
4b6f8a1ece
[cli] use shutil.move in ray cluster-dump and allow passing of tempfile (#16738) 2021-06-30 07:49:03 +01:00
Amog Kamsetty
69507f53db
[Horovod] Add Horovod example (#16742)
* wip

* updates

* updates

* update

* formatting

* updates

* updates

* update

* fix

* add timeout
2021-06-29 19:15:15 -07:00
Simon Mo
2ac8a197db
[Serve] Copy FastAPI ResponseModel field (#16760) 2021-06-29 16:28:08 -07:00