Sven Mika
95175a822f
[RLlib] Issue 11974: Traj view API next-action (shift=+1) not working. ( #12407 )
...
* WIP.
* Fix and LINT.
2020-11-25 11:26:29 -08:00
Max Fitton
2e95552f0c
[Dashboard] Defensive change to make sure we do not iterate over "None" in the case that workers is not present in node physical stats for a given node ( #12358 )
2020-11-25 11:06:45 -08:00
ZhuSenlin
dc55f6ba3a
skip gcs fault tolerance test for the time being when new scheduler is enabled ( #12393 )
...
Co-authored-by: senlin.zsl <senlin.zsl@antfin.com>
2020-11-25 10:40:47 -08:00
Gekho457
2b293832e7
[Docker][minor] Update path in base-deps Dockerfile ( #12391 )
...
* Update path in base-deps Dockerfile
* development Dockerfile
2020-11-25 10:30:10 -08:00
SangBin Cho
2e4e285ef0
[Object Spilling] Fusion small objects ( #12087 )
2020-11-25 10:13:32 -08:00
karstenddwx
09d5413f70
[RLlib] rollout batch, handle rewards that are None (unknown) in a multi-agent env ( #11858 ) ( #11911 )
2020-11-25 13:39:22 +01:00
danuo
c009c178f6
[RLlib] Closes #11924 : Add support for custom/ray environments in rollouts.py for agents without workers ( #11926 )
...
* Closes #11924
Formerly, rollout.py would only load environments from gym (with
gym.make() ) , if an agent without workers is employed (such as ES or
ARS). This will result in an error, if a custom environment is used. This
PR adds the possibility to load environments from the ray registry,
while maintaining the support for gym environments.
* Update rllib/rollout.py
Co-authored-by: Sven Mika <sven@anyscale.io>
2020-11-25 08:43:17 +01:00
Tomasz Wrona
82852f0ed2
[RLlib] Add ResetOnExceptionWrapper with tests for unstable 3rd party envs ( #12353 )
2020-11-25 08:41:58 +01:00
Ian Rodney
c5845c3a4e
[docker] Docker stop on each node ( #12357 )
2020-11-24 23:15:53 -08:00
Hao Chen
20eb217c55
Fix a zsh compatiblity issue in java/BUILD.bazel ( #12199 )
2020-11-25 14:49:49 +08:00
Barak Michener
4066056a0d
[ray_client] Support calling functions from other functions and correct the tests ( #12141 )
...
* Add test mode and fix f calling g
* formatting
* remove unused functions
* fix tests -- which will be better in actor PR
2020-11-24 22:19:20 -08:00
Tao Wang
4dd0aa7822
[GCS]make thread number of gcs rpc server configurable ( #12257 )
2020-11-25 11:40:29 +08:00
Tao Wang
5d47d02f81
[GCS]add callback for RegisterSelf api, make it done first ( #12252 )
2020-11-25 11:36:44 +08:00
Tao Wang
e025b9e788
[TEST]Move all WaitReady together ( #12254 )
2020-11-25 11:21:24 +08:00
Tao Wang
2af10c1b78
[GCS]Add new message ReportResourceUsage ( #11848 )
2020-11-25 11:18:26 +08:00
Tao Wang
e1075c0a82
[GCS]Fill resource fields when re-report heartbeat after gcs restarted ( #12097 )
2020-11-25 11:07:02 +08:00
fangfengbin
1d909321c9
[PlacementGroup]Fix node manager release unused bundles bug ( #12346 )
2020-11-25 11:02:43 +08:00
fangfengbin
5934b20b96
[PlacementGroup]Fix destroy bundle resources bug ( #12336 )
...
* [PlacementGroup]Fix destroy bundle resources bug
* revert AddBundleLocations code change
* add comment
* fix review comments
Co-authored-by: 灵洵 <fengbin.ffb@antgroup.com>
2020-11-25 09:45:26 +08:00
Eric Liang
9f322db71d
Add many_ppo long running test ( #12364 )
...
* add new tes
* update
* update
2020-11-24 16:00:33 -08:00
Edward Oakes
dae137b919
Don't allow 'optional' files in setup.py ( #12359 )
2020-11-24 17:41:58 -06:00
Sven Mika
4afaa46028
[RLlib] Increase the scope of RLlib's regression tests. ( #12200 )
2020-11-24 22:18:31 +01:00
Eric Liang
5895554555
[autoscaler] Raise node "start" deadline to 900s, make configurable ( #12316 )
2020-11-24 12:16:01 -08:00
Edward Oakes
4ada3e4c99
[serve] Incremental change towards async control loop for replica startup ( #12281 )
2020-11-24 13:06:08 -06:00
roireshef
888357d251
added address resolution fix for running in docker containers ( #11944 )
...
* added address resolution fix for running in docker containers
* added address resolution fix for running in docker containers (java)
* Update RayNativeRuntime.java
Co-authored-by: Eric Liang <ekhliang@gmail.com>
2020-11-24 10:34:56 -08:00
Edward Oakes
be0fa7b8b4
Properly specify kubectl-rsync.sh in setup.py ( #12356 )
2020-11-24 12:13:29 -06:00
Gekho457
e66ddab190
[autoscaler/k8s] Handle unavailable k8s API ( #12283 )
2020-11-24 11:13:15 -06:00
Lixin Wei
462c7fb575
[streaming] export aligned_ symbols from raylet.so ( #12345 )
2020-11-24 10:16:12 -06:00
ZhuSenlin
1ae4d2873a
[GCS] refactor gcs initialization ( #11890 )
2020-11-24 21:11:18 +08:00
fangfengbin
be7938ee09
[PlacementGroup]Fix AddBundleLocations bug ( #12330 )
...
Co-authored-by: 灵洵 <fengbin.ffb@antgroup.com>
2020-11-24 16:57:17 +08:00
dHannasch
2c4514a2c0
[minor] Refactor to expose RedisContext::PingPort ( #12022 )
2020-11-23 20:39:50 -08:00
dHannasch
40cbc638ee
A bit more explanation about worker ports ( #12185 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-23 20:24:15 -08:00
Kai Fricke
b94bfdfa99
[tune] use default anonymous metric _metric
if at least a mode is set ( #12159 )
...
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-23 20:09:33 -08:00
Edward Oakes
135f2e0602
[serve] Only use typed BackendConfig in controller ( #12275 )
2020-11-23 20:56:04 -06:00
Edward Oakes
be2187f942
Remove use_legacy_config tests ( #12272 )
2020-11-23 20:30:50 -06:00
Edward Oakes
1b3c48b922
Remove unused _post_get_hooks ( #12270 )
2020-11-23 20:30:30 -06:00
Edward Oakes
a93ed4583d
Remove unused print_failed_task ( #12267 )
2020-11-23 16:33:10 -06:00
Richard Liaw
e59fe65d3d
[tune] Fix logging for dockersyncer ( #12196 )
2020-11-23 14:29:41 -08:00
Edward Oakes
4e43650851
Remove deprecated f._submit API ( #12268 )
2020-11-23 16:20:22 -06:00
Tomasz Wrona
822d8eb535
Removed double "install" statement ( #12261 )
2020-11-23 13:36:01 -08:00
Sumanth Ratna
6346ea0be6
[tune] Allow None values in TBX logger ( #12262 )
2020-11-23 12:57:23 -08:00
Max Fan
c816a44667
[docs] Insert missing article in sentence ( #12245 )
2020-11-23 12:53:11 -08:00
Bart Broere
ddd7a18cb9
[docs] Fix dead links to cluster docs by making dashes slashes ( #12260 )
2020-11-23 12:52:36 -08:00
viotemp1
4c4f189f97
[tune] nevergrad add points_to_evaluate ( #12207 )
2020-11-23 12:51:04 -08:00
Ian Rodney
1cf95cb081
[Serve] Remove old report-queue-code ( #12211 )
2020-11-23 12:42:05 -08:00
Ian Rodney
7f1f16d99e
[Core] Force get_actor(name)'s name to be non-empty string ( #12218 )
2020-11-23 12:38:16 -08:00
Edward Oakes
32d159a2ed
Fix release directory & RELEASE_PROCESS.md ( #12269 )
2020-11-23 14:28:59 -06:00
Richard Liaw
40428c9b05
[tune] Make tf distributed testing smaller ( #12173 )
2020-11-23 12:15:10 -08:00
Edward Oakes
c99c376d66
Remove unused logging job id field ( #12266 )
2020-11-23 13:00:15 -06:00
fangfengbin
084f03797b
[Placement Group]Placement Group supports gcs failover(Part3) ( #12036 )
2020-11-23 16:57:58 +08:00
chaokunyang
9d133e874c
[Java] support java actor class inheritance ( #12001 )
2020-11-23 14:07:10 +08:00