Alex Wu
f080911d9b
[dashboard] include worker id in actor snapshot ( #15967 )
...
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-05-21 09:26:37 -07:00
Dominic Ming
43be599a9a
[Dashboard] Actor Table UI Optimize ( #15802 )
2021-05-21 09:23:32 -07:00
architkulkarni
0a7ba95e42
[Core] minor refactor for usability of runtime_env ( #15965 )
...
* minor refactor for usability of runtime_env
* remove job_config changes
2021-05-21 09:02:32 -07:00
Edward Oakes
b6a79445fe
[serve] Fix some test sizes to avoid bazel warnings ( #15959 )
2021-05-21 10:11:46 -05:00
Ian Rodney
6add438929
[client] Start Specific Server's in separate Conda environments ( #15926 )
...
* conda support
* test multiple ray.init called
* additional testing
* test for proxy_manager
* better error message
* pass in session_dir
* unit tests
* fix test_runtime_env_complicated
* clean up proxier
* respond to comments
* try finally blocks
* fix up test_client_proxy
* small modifications to tests
* additional test
* fix tests
* lintfix
2021-05-21 01:01:57 -07:00
Eric Liang
29aa336a4d
Revert "[Object spilling] Avoid worker crash when an object is spille… ( #15964 )
...
This reverts commit 061e3fbde3
.
2021-05-20 21:17:59 -07:00
Eric Liang
86cb3e28cd
[hotfix] Somehow the pong_plot_example is failing to gym.make("Pong-v0") ( #15961 )
2021-05-20 20:56:39 -07:00
Edward Oakes
d339c8734e
Fix duplicate log messages when ray.shutdown() and ray.init() are called repeatedly ( #15957 )
2021-05-20 22:27:54 -05:00
Edward Oakes
82410f20b2
[serve] Add warning + docstring for anonymous namespaces ( #15921 )
2021-05-20 22:27:15 -05:00
Steven Morad
581d63e607
[RLlib] Fix dnc input shape ( #15939 )
...
Co-authored-by: Steven Morad <sm2558@cam.ac.uk>
2021-05-20 19:06:02 -07:00
architkulkarni
64fdac83a7
[Core] Add minimal support for pip in runtime env ( #15927 )
2021-05-20 20:47:16 -05:00
SangBin Cho
a1375a955b
Pubsub registration / unregistration idempotency ( #15896 )
...
* Make AddEntry idempotent.
* Done.
2021-05-20 18:40:06 -07:00
Kai Yang
061e3fbde3
[Object spilling] Avoid worker crash when an object is spilled right after being restored ( #15903 )
...
* Fix check failure when memory pressure is high
* Add test
* lint
2021-05-20 18:36:11 -07:00
Simon Mo
32f9d2287b
[Core] Fix asyncio actor exit. ( #15925 )
2021-05-20 17:21:58 -07:00
Kai Fricke
12418a2f69
[xgboost] Update documentation ( #15900 )
2021-05-20 17:16:45 -07:00
Simon Mo
cce5007285
Revert "[CI] Remove wheel renaming code path. ( #15952 )" ( #15954 )
...
This reverts commit 42bbde2987
.
2021-05-20 15:44:53 -07:00
Simon Mo
b130613143
[Serve] Latency improvement by using pickle ( #15945 )
2021-05-20 15:20:58 -07:00
Frank Luan
c87b76632d
[plasma] Reset OOM timer as objects are being spilled ( #15431 )
...
* Fix deserializer in metrics.Counter
* Fix restore_spilled_objects() for external object spilling
* WIP reset OOM timer
* Add test
* Revert style change
* pytest
* Simplify test
* Fix test
* Make tests faster
2021-05-20 13:13:54 -07:00
Alex Wu
ec997c0145
[client] Client builder API namespace support ( #15934 )
...
* add namespace to client
* done?
* address comments
Co-authored-by: Alex <alex@anyscale.com>
2021-05-20 12:36:05 -07:00
Simon Mo
42bbde2987
[CI] Remove wheel renaming code path. ( #15952 )
...
pypa/manylinux2014_x86_64 was updated 05-20-2021 and the wheels
produced already have manylinux in them. So the renaming will
only change the name to `manymanylinux20142014`.
2021-05-20 12:21:46 -07:00
Richard Liaw
2cb0ad4fb1
[docs] improve banner for ray summit ( #15947 )
2021-05-20 11:56:12 -07:00
Richard Liaw
5f595cbb0f
[docs] add summit banner ( #15938 )
2021-05-20 10:35:45 -07:00
Micah Yong
52eb41d881
[core] Use immutable keys for _future_to_actor in ActorPools.py ( #15402 )
...
* Use immutable keys for _future_to_actor in ActorPools.py
* Add corresponding test for multiple returns
* Lint and format
2021-05-20 10:05:42 -07:00
Sven Mika
e80095591c
[RLlib] Entropy coeff schedule bug fix and git bisect script. ( #15937 )
2021-05-20 18:15:10 +02:00
YeahNew
9a93dd9682
Adding a RaySGD and DGL ( Deep Graph Library) integration example(gat… ( #15718 )
...
* Adding a RaySGD and DGL ( Deep Graph Library) integration example(gat_dgl.py)
* Update gat_dgl.py
* Update gat_dgl.py
* Update gat_dgl.py
* the gat_dgl.py has been formated by the format.sh script
* delet useless code in the gat_dgl.py
* add 'import numpy as np', modified the output form of accuracy in the validate method
* Modified the code for better readability and added the README.md file
* Update README.md
* Update README.md
* Update README.md
* updates
* formatting
Co-authored-by: YeahNew <1650996069@qq.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-20 08:47:19 -07:00
Alex Wu
cd2fc7792f
[dashboard] Snapshot of cluster state ( #15868 )
2021-05-20 08:10:32 -07:00
Yi Cheng
874558e813
[runtime env] Put runtime env into runtime context; ( #15895 )
2021-05-20 08:08:45 -07:00
Jae Sim
d042aa6d73
[serve] Add optional prev_version
check to .deploy()
for users to avoid race conditions ( #15821 )
2021-05-20 09:43:22 -05:00
Sven Mika
03c7c530a9
[RLlib] Issue 15483: Wrong init states (should be non-zero if ModelV2.get_initial_state
returns non-zero values). ( #15733 )
2021-05-20 09:28:09 +02:00
Sven Mika
2d34216660
[RLlib] APEX-DQN: Bug fix for torch and add learning test. ( #15762 )
2021-05-20 09:27:03 +02:00
dependabot[bot]
dde7cbd288
[tune](deps): Bump tune-sklearn from 0.2.1 to 0.3.0 in /python/requirements/tune ( #15852 )
...
* [tune](deps): Bump tune-sklearn in /python/requirements/tune
Bumps [tune-sklearn](https://github.com/ray-project/tune-sklearn ) from 0.2.1 to 0.3.0.
- [Release notes](https://github.com/ray-project/tune-sklearn/releases )
- [Commits](https://github.com/ray-project/tune-sklearn/compare/v0.2.1...v0.3.0 )
Signed-off-by: dependabot[bot] <support@github.com>
* split test_torch
* lint
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 16:01:35 -07:00
dependabot[bot]
493dbd1602
[tune](deps): Bump mlflow in /python/requirements/tune ( #15853 )
...
Bumps [mlflow](https://github.com/mlflow/mlflow ) from 1.16.0 to 1.17.0.
- [Release notes](https://github.com/mlflow/mlflow/releases )
- [Changelog](https://github.com/mlflow/mlflow/blob/master/CHANGELOG.rst )
- [Commits](https://github.com/mlflow/mlflow/compare/v1.16.0...v1.17.0 )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 15:59:45 -07:00
Sven Mika
eaa7f6696d
[RLlib] Issue 15887: MARWIL adv norm update mismatch for tf (static-graph) vs torch versions. ( #15898 )
2021-05-19 15:44:11 -07:00
Simon Mo
7a5981f244
[Serve] Feature flag and turn off placement group usage. ( #15865 )
2021-05-19 15:43:46 -07:00
Ian Rodney
4825f1b2a5
[client] One Driver per RayClient Server ( #15923 )
2021-05-19 15:40:49 -07:00
Eric Liang
52ae4b4b4e
Disallow ignoring failures on osx flaky test build ( #15863 )
2021-05-19 15:22:29 -07:00
dependabot[bot]
8a9bebb5e4
[tune](deps): Bump timm from 0.3.2 to 0.4.5 in /python/requirements ( #15824 )
...
* [tune](deps): Bump timm from 0.3.2 to 0.4.5 in /python/requirements
Bumps [timm](https://github.com/rwightman/pytorch-image-models ) from 0.3.2 to 0.4.5.
- [Release notes](https://github.com/rwightman/pytorch-image-models/releases )
- [Changelog](https://github.com/rwightman/pytorch-image-models/blob/master/docs/changes.md )
- [Commits](https://github.com/rwightman/pytorch-image-models/commits/v0.4.5 )
Signed-off-by: dependabot[bot] <support@github.com>
* updates
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-05-19 14:29:34 -07:00
architkulkarni
c3d06697bb
[Core] Add dynamic conda env install in shim process ( #15881 )
2021-05-19 15:46:42 -05:00
Edward Oakes
a116875abc
[serve] Add properties + docstring + test for Deployment class ( #15917 )
2021-05-19 14:44:00 -05:00
Eric Liang
836c739fe5
Revert "[client] One Driver per RayClient Server ( #15875 )" ( #15922 )
...
This reverts commit 97d1414f23
.
2021-05-19 11:58:29 -07:00
Edward Oakes
5243e8776b
[Docs] update serve logo ( #15914 )
2021-05-19 11:57:54 -07:00
Chris K. W
df58c9c7f7
[autoscaler][aws] deprecate worker_nodes and head_node ( #15584 )
...
Co-authored-by: Ian Rodney <ian.rodney@gmail.com>
Co-authored-by: Chris Wong <cwong@anyscale.com>
Co-authored-by: Dmitri Gekhtman <dmitri.m.gekhtman@gmail.com>
2021-05-19 11:54:29 -07:00
Dmitri Gekhtman
a7a5a2b2b7
[autoscaler][kubernetes][minor][hotfix] Fix havoc-wreaking typo ( #15916 )
2021-05-19 13:52:26 -05:00
dependabot[bot]
c164e73c7c
[tune](deps): Bump gluoncv in /python/requirements/tune ( #15845 )
...
Bumps [gluoncv](https://github.com/dmlc/gluon-cv ) from 0.9.1 to 0.10.1.post0.
- [Release notes](https://github.com/dmlc/gluon-cv/releases )
- [Commits](https://github.com/dmlc/gluon-cv/commits )
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-05-19 10:35:30 -07:00
Edward Oakes
2267befe27
[serve] Fix bug where placement group was always detached even in non-detached instances ( #15885 )
2021-05-19 12:22:58 -05:00
Eric Liang
2dc4198210
Increase the raylet start wait timeout to accomodate plasma preallocation ( #15860 )
...
* update
* add doc
* update
* quick fix
* no spam
* fix
2021-05-19 09:39:25 -07:00
Ian Rodney
97d1414f23
[client] One Driver per RayClient Server ( #15875 )
2021-05-19 09:03:09 -07:00
architkulkarni
c636bc3065
[Serve] [Core] Fix serve on Windows by disabling runtime env on Windows ( #15838 )
2021-05-19 10:58:40 -05:00
Stefan Schneider
55709bac7a
[RLlib] Examples for training, saving, loading, testing an agent with SB & RLlib ( #15897 )
2021-05-19 16:36:59 +02:00
Michael Luo
474f04e322
[RLlib] DDPG/TD3 + A3C/A2C + MARWIL/BC Annotation/Comments/Code Cleanup ( #14707 )
2021-05-19 16:32:29 +02:00