Dmitri Gekhtman
25ebefafc8
[autoscaler][aws][test] Validate current state of subnet-specification ( #14859 )
...
* This PR adds a test that validates that adding head_nodes and worker_nodes fields with subnet data to a multi-node-type config leads to a correct configuration of a security group.
2021-03-25 01:40:16 +02:00
Yi Cheng
f427801c10
Revert "[core] Fix worker type in python ( #14823 )" ( #14910 )
...
This reverts commit 9ccf291f4d
.
2021-03-24 13:27:56 -07:00
Amog Kamsetty
233f174984
Update release instructions ( #14882 )
2021-03-24 12:41:50 -07:00
Simon Mo
d57808d007
[Serve] Add support for handle.method_name.remote ( #14831 )
2021-03-24 12:10:14 -07:00
Edward Oakes
59e231818d
[serve] Add Deployment.delete() and un-skip relevant tests ( #14898 )
2021-03-24 13:40:30 -05:00
SangBin Cho
ec3cfef883
Revert "[Core] Implement long polling-based pubsub to reduce number of WaitForObjectEviction requests in flight. ( #14638 )" ( #14905 )
...
This reverts commit 35ec91c4e0
.
2021-03-24 11:22:48 -07:00
Clark Zinzow
ed46d8bf45
[Core] Added ownership-based object directory metrics, fixed raylet metric bug. ( #14855 )
...
* Added ownership-based object directory metrics.
* Updated OBOD metric descriptions.
* Dump OBOD metrics in debug string.
* Added e2e tests for metrics.
2021-03-24 10:53:22 -07:00
Edward Oakes
ade5857aa3
[serve] Duplicate test_api for new declarative codepath ( #14877 )
2021-03-24 11:29:35 -05:00
mvindiola1
5e350ceaa2
[RLlib] Issue 14119: Fix TD3 policy delay for torch. ( #14840 )
2021-03-24 16:26:22 +01:00
Raoul Khouri
c37fa3f389
[RLlib] Example and test for custom Trainer wrapper experiments ( #14652 )
2021-03-24 16:22:46 +01:00
Jack Parsons
3df7a010b1
[RLlib] Fixing conv filters config for ComplexInputNetwork ( #14749 )
2021-03-24 16:15:36 +01:00
astronauti
8874ccec2d
[RLlib] Update sac_tf_policy.py (add tf.cast to float32 for rewards) ( #14843 )
2021-03-24 16:12:55 +01:00
Sven Mika
6708211b59
[RLlib] JSONReader: Mix files if > 1 at beginning (each worker should start with different file). ( #14865 )
2021-03-24 16:07:40 +01:00
Sven Mika
3e389d5812
[RLlib] Issue 14533: tf.enable_eager_execution()
must be called at beginning. ( #14737 )
2021-03-24 12:54:27 +01:00
SangBin Cho
35ec91c4e0
[Core] Implement long polling-based pubsub to reduce number of WaitForObjectEviction requests in flight. ( #14638 )
...
* in progress.
* IN progress.
* lint.
* Updated code
* lint.
* In progress of writing tets.
* Finished implementation. Need cleanup & refactoring.
* fixing tests...
* Finish the impl.
* Fix typo.
* impl done. Only cleanup left.
* done.
* Finished clean up.
* Fix issues.
* Add a stronger consistency check.
* Addressed code review.
* lint.
* done.
* Addressed more.
* addressed all reviews.
* Addressed code review.
* lint.
* Added unit tests to assert no leak.
2021-03-23 23:47:08 -07:00
Stephanie Wang
201ebc3f92
Revert "[core] Set a configurable max memory for fetched objects ( #14817 )" ( #14887 )
...
This reverts commit 8769953474
.
2021-03-23 21:58:11 -07:00
Dmitri Gekhtman
10458564a3
[autoscaler][aws] Use subnets in only one VPC ( #14868 )
2021-03-23 21:16:25 -07:00
fyrestone
52cfa1cdd7
Fix load code from local ( #12102 )
2021-03-24 11:49:58 +08:00
Kai Fricke
898243d538
[tune] Limit maximum number of pending trials. Add convergence test. ( #14835 )
2021-03-23 18:19:41 -07:00
Ian Rodney
5d763b3f49
[Autoscaler][Docker] Mention nightly images in addition to nightly wheels ( #14871 )
2021-03-23 17:13:36 -07:00
Ian Rodney
3869c9d342
[Test] Fix Test-Dashboard ( #14874 )
2021-03-23 15:50:15 -07:00
Edward Oakes
36bce2d3af
[flaky_test][windows] Disable test_actor_resources::test_creating_more_actors_than_resources on Windows ( #14879 )
2021-03-23 15:45:10 -07:00
Yi Cheng
41d4d9fc94
Increase asan build timeout ( #14876 )
2021-03-23 17:11:35 -05:00
Stephanie Wang
8769953474
[core] Set a configurable max memory for fetched objects ( #14817 )
...
* Set threshold, tests
* comment
* move max to pull manager
* unit test
* fix plasma
* comment
2021-03-23 13:55:02 -07:00
Edward Oakes
7a1f85ab38
[serve] Add initial support for @serve.deployment syntax ( #14869 )
2021-03-23 15:07:43 -05:00
Ian Rodney
9775ad2f25
[Autoscaler] Correct typing for Event Handler execute_callback ( #14718 )
2021-03-23 12:57:49 -07:00
Ian Rodney
4fb4b96a4f
[Autoscaler] Make setup_commands in defaults.yaml Docker Compatible ( #14804 )
2021-03-23 12:52:15 -07:00
Xianyang Liu
a3eb64311e
[sgd] Fixes TrainingOperator wrap model incorrect ( #14353 )
...
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-03-23 12:19:32 -07:00
Sven Mika
78c64ca151
[RLlib] Attention net example script: Clarifications on how to use with Trainer.compute_action. ( #14864 )
2021-03-23 19:33:01 +01:00
SangBin Cho
5f7ce293fe
[Test] Large scale dask on ray test ( #14340 )
...
* Add a test.
* Add a test.
* d
* Modify the release doc.
* Addressed code review.
2021-03-23 11:00:35 -07:00
Sven Mika
04bc0a9828
[RLlib] Remove all non-trajectory view API code. ( #14860 )
2021-03-23 09:50:18 -07:00
Ian Rodney
e0f31bf035
[Disable Tests] New Dashboard Tests ( #14845 )
2021-03-23 07:49:54 -07:00
Hao Chen
9053be0e63
Fix 'dictionary changed size during iteration' in _available_resources_per_node ( #14839 )
2021-03-23 17:44:05 +08:00
Sven Mika
3e7899d251
[RLlib] Issue 14653: Empty env steps cause key error in SimpleListCollector. ( #14765 )
2021-03-23 10:30:53 +01:00
Sven Mika
f859ebb99f
[RLlib] Fix env rendering and recording options (for non-local mode; >0 workers; +evaluation-workers). ( #14796 )
2021-03-23 10:06:06 +01:00
Yi Cheng
9ccf291f4d
[core] Fix worker type in python ( #14823 )
...
* Fix
* format
* more
* format
2021-03-23 00:58:57 -07:00
Sven Mika
e7557ae433
[RLlib] Issue 13132: DQN does not update target net after restore ( #14838 )
2021-03-23 08:30:37 +01:00
DK.Pino
ef59c145e2
[Java][Placement Group] Move related API of Placement Group from Ray to PlacementGroups. ( #14729 )
2021-03-23 12:34:12 +08:00
Ian Rodney
95e8e8c049
[Dependencies] Install Ray Full in Autoscaler ( #14851 )
2021-03-22 20:57:16 -07:00
Ian Rodney
ce18427017
[HotFix] Fix Colorful mock for with_style ( #14848 )
2021-03-22 14:11:03 -07:00
Simon Mo
2f25c7d32c
[Core] Remove legacy code path for unhandled exception in asyncio ( #14828 )
2021-03-22 13:09:48 -07:00
SangBin Cho
87877cdfbf
[Test] Fix flaky object spilling test ( #14722 )
...
* start
* done.
* d
* d
* Push the fix.
* done.
* Enable test.
2021-03-22 12:51:47 -07:00
Simon Mo
21b5ab6739
[Serve] Raise error when endpoint fail to register ( #14827 )
2021-03-22 11:40:58 -07:00
andrew-rosenfeld-ts
d6933da423
[tune] fixing Trainable.__init__ docstring reference to build() #14824 ( #14850 )
...
Co-authored-by: Andrew Rosenfeld <rosnfeld@gmail.com>
2021-03-22 18:53:49 +01:00
Clark Zinzow
1a9ba19012
[Core] Adds deprecation decorator and fixes privatization of a few APIs. ( #14811 )
2021-03-22 10:31:50 -07:00
Simon Mo
51828a42dc
[Serve] Pin Pydantic version to >=1.8 ( #14830 )
2021-03-22 12:20:11 -05:00
Kai Fricke
be30b784a4
Amend #14308 (fix for post_fcnet_hiddens) ( #14354 )
2021-03-22 15:44:18 +01:00
Kai Fricke
7364a7a327
[tune] Move Optuna to ask(fixed_distributions) interface ( #14731 )
...
Adjusting to changes in Optuna 2.6.0. Old interface was marked as deprecated.
2021-03-22 12:25:37 +01:00
chaokunyang
c8ab3ca84f
fix install jdk8 on mac ( #14725 )
2021-03-22 14:57:45 +08:00
Alex Wu
7b80023656
[Autoscaler] Improved observability ( #14816 )
...
Co-authored-by: Alex <alex@anyscale.com>
2021-03-21 21:04:05 -07:00