Jae Sim
d042aa6d73
[serve] Add optional prev_version
check to .deploy()
for users to avoid race conditions ( #15821 )
2021-05-20 09:43:22 -05:00
Sven Mika
03c7c530a9
[RLlib] Issue 15483: Wrong init states (should be non-zero if ModelV2.get_initial_state
returns non-zero values). ( #15733 )
2021-05-20 09:28:09 +02:00
Sven Mika
2d34216660
[RLlib] APEX-DQN: Bug fix for torch and add learning test. ( #15762 )
2021-05-20 09:27:03 +02:00
dependabot[bot]
dde7cbd288
[tune](deps): Bump tune-sklearn from 0.2.1 to 0.3.0 in /python/requirements/tune ( #15852 )
...
* [tune](deps): Bump tune-sklearn in /python/requirements/tune
Bumps [tune-sklearn](https://github.com/ray-project/tune-sklearn ) from 0.2.1 to 0.3.0.
- [Release notes](https://github.com/ray-project/tune-sklearn/releases )
- [Commits](https://github.com/ray-project/tune-sklearn/compare/v0.2.1...v0.3.0 )
Signed-off-by: dependabot[bot] <support@github.com>
* split test_torch
* lint
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 16:01:35 -07:00
dependabot[bot]
493dbd1602
[tune](deps): Bump mlflow in /python/requirements/tune ( #15853 )
...
Bumps [mlflow](https://github.com/mlflow/mlflow ) from 1.16.0 to 1.17.0.
- [Release notes](https://github.com/mlflow/mlflow/releases )
- [Changelog](https://github.com/mlflow/mlflow/blob/master/CHANGELOG.rst )
- [Commits](https://github.com/mlflow/mlflow/compare/v1.16.0...v1.17.0 )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 15:59:45 -07:00
Sven Mika
eaa7f6696d
[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. ( #15898 )
2021-05-19 15:44:11 -07:00
Simon Mo
7a5981f244
[Serve] Feature flag and turn off placement group usage. ( #15865 )
2021-05-19 15:43:46 -07:00
Ian Rodney
4825f1b2a5
[client] One Driver per RayClient Server ( #15923 )
2021-05-19 15:40:49 -07:00
Eric Liang
52ae4b4b4e
Disallow ignoring failures on osx flaky test build ( #15863 )
2021-05-19 15:22:29 -07:00
dependabot[bot]
8a9bebb5e4
[tune](deps): Bump timm from 0.3.2 to 0.4.5 in /python/requirements ( #15824 )
...
* [tune](deps): Bump timm from 0.3.2 to 0.4.5 in /python/requirements
Bumps [timm](https://github.com/rwightman/pytorch-image-models ) from 0.3.2 to 0.4.5.
- [Release notes](https://github.com/rwightman/pytorch-image-models/releases )
- [Changelog](https://github.com/rwightman/pytorch-image-models/blob/master/docs/changes.md )
- [Commits](https://github.com/rwightman/pytorch-image-models/commits/v0.4.5 )
Signed-off-by: dependabot[bot] <support@github.com>
* updates
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 14:29:34 -07:00
architkulkarni
c3d06697bb
[Core] Add dynamic conda env install in shim process ( #15881 )
2021-05-19 15:46:42 -05:00
Edward Oakes
a116875abc
[serve] Add properties + docstring + test for Deployment class ( #15917 )
2021-05-19 14:44:00 -05:00
Eric Liang
836c739fe5
Revert "[client] One Driver per RayClient Server ( #15875 )" ( #15922 )
...
This reverts commit 97d1414f23
.
2021-05-19 11:58:29 -07:00
Edward Oakes
5243e8776b
[Docs] update serve logo ( #15914 )
2021-05-19 11:57:54 -07:00
Chris K. W
df58c9c7f7
[autoscaler][aws] deprecate worker_nodes and head_node ( #15584 )
...
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
Co-authored-by: Chris Wong <cwong@anyscale.com>
Co-authored-by: Dmitri Gekhtman <dmitri.m.gekhtman@gmail.com>
2021-05-19 11:54:29 -07:00
Dmitri Gekhtman
a7a5a2b2b7
[autoscaler][kubernetes][minor][hotfix] Fix havoc-wreaking typo ( #15916 )
2021-05-19 13:52:26 -05:00
dependabot[bot]
c164e73c7c
[tune](deps): Bump gluoncv in /python/requirements/tune ( #15845 )
...
Bumps [gluoncv](https://github.com/dmlc/gluon-cv ) from 0.9.1 to 0.10.1.post0.
- [Release notes](https://github.com/dmlc/gluon-cv/releases )
- [Commits](https://github.com/dmlc/gluon-cv/commits )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-05-19 10:35:30 -07:00
Edward Oakes
2267befe27
[serve] Fix bug where placement group was always detached even in non-detached instances ( #15885 )
2021-05-19 12:22:58 -05:00
Eric Liang
2dc4198210
Increase the raylet start wait timeout to accomodate plasma preallocation ( #15860 )
...
* update
* add doc
* update
* quick fix
* no spam
* fix
2021-05-19 09:39:25 -07:00
Ian Rodney
97d1414f23
[client] One Driver per RayClient Server ( #15875 )
2021-05-19 09:03:09 -07:00
architkulkarni
c636bc3065
[Serve] [Core] Fix serve on Windows by disabling runtime env on Windows ( #15838 )
2021-05-19 10:58:40 -05:00
Stefan Schneider
55709bac7a
[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib ( #15897 )
2021-05-19 16:36:59 +02:00
Michael Luo
474f04e322
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup ( #14707 )
2021-05-19 16:32:29 +02:00
qicosmos
8790bb465b
[C++ worker] Remove func ptr offset ( #15809 )
2021-05-19 18:03:39 +08:00
Steven Morad
d8eed68af2
[RLlib] Add differentiable neural computer example ( #14844 )
2021-05-19 09:15:39 +02:00
SangBin Cho
29e9ad01e2
Fix ray memory command broken with RAY_ADDRESS='auto'. ( #15892 )
2021-05-18 22:06:17 -07:00
Kathryn Zhou
891648ea9e
Bump OpenTelemetry version & bug fix ( #15791 )
2021-05-18 17:06:43 -07:00
Michael Galarnyk
cedd348650
[DOCS] Added link to ray pulse community survey ( #15891 )
2021-05-18 15:25:36 -07:00
Farrukh Ali
dbdc8459ad
[rllib][docs] Update rllib-dev.rst ( #15864 )
2021-05-18 11:28:57 -07:00
Rick Lan
3b1b1d74fe
[rllib] Read "logger_config" first before "prefix". ( #15871 )
2021-05-18 10:50:46 -07:00
Sven Mika
7e260edb07
[RLlib] Fix small memory leak in SimpleListCollector (already superseeded by Bam4d's PR + small fix in error message). ( #15783 )
2021-05-18 16:02:03 +02:00
Chris Bamford
0be83d9a95
[RLlib] Fixing Memory Leak In Multi-Agent environments. Adding tooling for finding memory leaks in workers. ( #15815 )
2021-05-18 13:23:00 +02:00
Sven Mika
d2c755ccef
[RLlib] Examples scripts add argparse help and replace --torch
with --framework
. ( #15832 )
2021-05-18 13:18:12 +02:00
Sven Mika
2303851c3c
[RLlib] Torch multi-GPU + LSTM/RNN bug fix. ( #15492 )
2021-05-18 11:51:05 +02:00
Sven Mika
4e9555cad3
[RLlib] Issue 15724: Breaking example script in docs due to outdated eager
config flag (use framework='tf2|tfe' instead). ( #15736 )
2021-05-18 11:34:46 +02:00
dependabot[bot]
4c8813f2e8
[RLlib](deps): Bump pettingzoo in /python/requirements/rllib ( #15846 )
...
Bumps [pettingzoo](https://github.com/PettingZoo-Team/PettingZoo ) from 1.8.1 to 1.8.2.
- [Release notes](https://github.com/PettingZoo-Team/PettingZoo/releases )
- [Commits](https://github.com/PettingZoo-Team/PettingZoo/compare/1.8.1...1.8.2 )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-05-18 11:25:52 +02:00
Sven Mika
839fc59224
[RLlib] CQL TensorFlow support ( #15841 )
2021-05-18 11:10:46 +02:00
Sven Mika
a36b9305d4
[RLlib] Better error message when deep-learning framework not installed. ( #15735 )
2021-05-18 11:06:05 +02:00
Sven Mika
6f4d988713
[RLlib] Issue 15556: Fix R2D2 using chunks from previous episodes in the "burn-in" window. ( #15737 )
2021-05-18 11:05:42 +02:00
Sven Mika
308ea62430
[RLlib] Fix "seed" setting to work in all frameworks and w/ all CUDA versions. ( #15682 )
2021-05-18 11:00:24 +02:00
architkulkarni
194c5e3a96
[Core] Cache workers by runtime_env in worker pool ( #15782 )
...
* pass RuntimeEnv in task spec as opaque string
* lint
* set correct empty value for json: "{}" not ""
* add comment for field in proto
* fix worker pool test by checking both "" and "{}"
* add RAY_CHECK todo
* make dict empty if all values null
* remove unnecessary ser/de
* fix
* address comments
* add WorkerCacheKey with hash function
* clean up
* add naive impl., dedicated workers never killed
* put dedicated workers in idle_of_all_languages
* pipe env hash from worker.py -> Worker
* fully pipe through hash, basic cache test passing
* use int type for runtime env hash
* convert Worker env hash type from size_t to int
* fix
* add method to MockWorker to fix cpp tests
* make compatible with java streaming test
* restore old dynamic_options code to fix java test
* address comments
* add comment about sorting before hash
* add comments for private members of WorkerCacheKey
2021-05-18 00:19:27 -07:00
Yi Cheng
863532af0a
[core] API for pre-run customized functions ( #15749 )
...
* run customer setup fn
* fix
* lint
* skip on w32
* fix comment
* up
* up
2021-05-17 22:52:36 -07:00
Alex Wu
69f228d22d
[core] Record actor+job start/end times and metadata ( #15803 )
2021-05-17 21:38:39 -07:00
Frank Luan
0dc34566fe
Refactor raylet to allocate+write+seal one return object at a time ( #15757 )
...
* Refactor raylet to allocate+write+seal one return object at a time
* Fix build
* Fix C++ and Java runtime
* Skip Windows testing
* Fix java and cpp runtime
* Fix warnings
* Fix cpp and java tests
* Fix cpp and java runtime
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
2021-05-17 20:06:08 -07:00
SangBin Cho
ff461634b0
[Core] Improved bad error message. ( #15663 )
...
* Improved bad error message.
* Update src/ray/raylet/node_manager.cc
Co-authored-by: Clark Zinzow <clarkzinzow@gmail.com>
* lint.
* Add a pid
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Clark Zinzow <clarkzinzow@gmail.com>
2021-05-17 19:38:05 -07:00
Dmitri Gekhtman
95c3d88cac
[autoscaler][kubernetes] Helm chart ( #15614 )
2021-05-17 16:55:10 -07:00
Amog Kamsetty
c97594aca3
[CI] Update dependencies on travis flaky build ( #15858 )
2021-05-17 16:07:00 -07:00
Dmitri Gekhtman
c2b8381015
[autoscaler][gcp] Migrate GCP config to available node types ( #15805 )
2021-05-17 15:45:47 -07:00
dependabot[bot]
434465e477
[tune](deps): Bump gpy from 1.9.9 to 1.10.0 in /python/requirements/tune ( #15850 )
...
Bumps [gpy](https://github.com/SheffieldML/GPy ) from 1.9.9 to 1.10.0.
- [Release notes](https://github.com/SheffieldML/GPy/releases )
- [Changelog](https://github.com/SheffieldML/GPy/blob/devel/CHANGELOG.md )
- [Commits](https://github.com/SheffieldML/GPy/compare/v1.9.9...v1.10.0 )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-05-17 11:33:11 -07:00
Alex Wu
3744026897
Fix test_scheduling ( #15823 )
...
* done
* Update python/ray/tests/test_scheduling.py
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
* Update python/ray/tests/test_scheduling.py
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
* lint
Co-authored-by: Alex Wu <alex@anyscale.com>
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
2021-05-17 10:08:01 -07:00