Commit graph

3436 commits

Author SHA1 Message Date
Siyuan (Ryans) Zhuang
b8dda0e3d0
[Serialization] Fix buffer alignment issues (#11888)
* fix buffer alignment issues

* remove unused fields

* aligned memory allocation

* windows compat

* license. fix compiler warnings

* fix compilation error

* reinterpret_cast
2020-11-10 23:44:16 -08:00
Alex Wu
8afd2acdc1
[Autoscaler] simulator placement groups (#11777) 2020-11-10 18:10:36 -08:00
Eric Liang
46f3652102
Remove repeat push timeout from object manager (#11874) 2020-11-10 16:26:53 -08:00
Keqiu Hu
0c1bdaef59
[tune] TensorFlow Distributed Trainable (#11876)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-10 14:59:08 -08:00
Richard Liaw
50dbf1a307
[core] Support configurable number of "check for redis" attempts (#11902)
Signed-off-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-10 14:57:57 -08:00
Ian Rodney
1d158dda32
[serve] Rename to use replicas, not workers (#11822) 2020-11-10 11:36:15 -08:00
Eric Liang
9b8218aabd
[docs] Move all /latest links to /master (#11897)
* use master link

* remae

* revert non-ray

* more

* mre
2020-11-10 10:53:28 -08:00
Nikita Vemuri
aba9288615
[Autoscaler] Introduce callback system (#11674)
Co-authored-by: Nikita Vemuri <nikitavemuri@Nikitas-MacBook-Pro.local>
Co-authored-by: Xiayue Charles Lin <xcl@anyscale.com>
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-09 20:03:15 -08:00
Eric Liang
ee2da0cf45
[Core] PushManager for reliable broadcast (#11869) 2020-11-09 18:01:47 -08:00
Benjamin Black
1999266bba
Updated pettingzoo env to acomidate api changes and fixes (#11873)
* Updated pettingzoo env to acomidate api changes and fixes

* fixed test failure

* fixed linting issue

* fixed test failure
2020-11-09 16:09:49 -08:00
Eric Liang
a9cf0141a0
[autoscaler] Fix semantics of request_resources (#11820) 2020-11-09 14:57:40 -08:00
Edward Oakes
1c132f2ff8
[serve] Improve DEBUG logging for understanding perf (#11838) 2020-11-09 14:10:42 -06:00
architkulkarni
adcaabcd64
[Serve] Reconfigure backend class at runtime (#11709) 2020-11-09 14:04:51 -06:00
Kai Fricke
287aba6dc3
[tune] schedulers: Add test for context finalization (#11889) 2020-11-09 11:37:05 -08:00
Richard Liaw
a09e49ee94
[core] Add retry for reading session name (#11844) 2020-11-09 11:22:50 -08:00
Kai Fricke
88be1ea20b
[tune] Handle infinite and NaN values (#11835) 2020-11-09 11:18:31 -08:00
Eric Liang
0932320eb3
Move test_joblib back to new_scheduler_broken category (#11872) 2020-11-07 20:08:41 -08:00
Stephanie Wang
61e41257e7
[Object spilling] Queue failed object creation requests until objects have been spilled (#11796)
* Queue creation requests

* Cleanup disconnected clients

* Remove unused

* todo

* FIFO order for create requests, remove warmup for IO workers

* test and lint

* disable test

* lint

* Skip on windows
2020-11-06 18:22:19 -05:00
Amog Kamsetty
900a48c19c
[Tune] Better warnings/exceptions for fail_fast='raise' (#11842) 2020-11-06 15:01:55 -08:00
Aaron Miller
045fed5cd2
[examples] comment out rsync_ settings for K8S (#11862) 2020-11-06 14:35:21 -08:00
Simon Mo
871cde989a
Re-Revert: [Serialization] Update CloudPickle to 1.6.0 (#9694) (#11837) 2020-11-06 12:24:36 -08:00
Kishan Sagathiya
c5e6c90e1e
[Core] Add name of actor in the result of ray.actors() (#11828)
Added name field to `actor_info`

Fixes #11112
2020-11-06 10:45:44 -08:00
Philipp Moritz
28e7439cf0
[doc] Add documentation for Ray debugger (#11815)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-05 16:25:27 -08:00
Barak Michener
27c810a97e
Basic protos for ray client (#11762) 2020-11-05 16:23:54 -08:00
Eric Liang
f86c4f992c
Fix RAY_ENABLE_NEW_SCHEDULER=1 pytest test_advanced_2.py::test_zero_cpus_actor (#11817) 2020-11-05 16:02:04 -08:00
architkulkarni
347e871409
[Serve] Add dependency management (#11743) 2020-11-05 16:39:37 -06:00
Kai Yang
ffc267f94b
[Test] Ignore setproctitle for local mode (#11819) 2020-11-05 11:07:34 -08:00
Kai Fricke
603accf1c2
[tune] logger refactor part 3: Add ExperimentLogger class (#11749) 2020-11-05 08:55:38 -08:00
Richard Liaw
f6717b8b03
[autoscaler] Support empty node list for kill node (#11810) 2020-11-04 22:40:07 -08:00
Richard Liaw
efa07d5403
Revert "Revert "[tune] PB2 (#11466)" (#11795)" (#11812) 2020-11-04 20:47:12 -08:00
Eric Liang
69145d6215
[hotfix] Bazel candidates not found due to raising too early 2020-11-04 16:08:51 -08:00
Ian Rodney
22bbbc3171
[wheel] Fix Manylinux2014 Build (#11811) 2020-11-04 14:50:38 -08:00
Amog Kamsetty
92718de40c
[SGD] Better support for custom DDP (#11771) 2020-11-04 13:58:51 -08:00
Ameer Haj Ali
ebdf8ba3fa
[autoscaler] Support legacy cluster configs with the new resource demand scheduler (#11751) 2020-11-04 12:05:48 -08:00
Kai Yang
31598338b3
[Core] Fix ray start failure to due to bug of redis address detection (#11735)
* Fix ray start failure to due redis address detection bug

* Address comment
2020-11-04 12:04:44 -08:00
Alex Wu
53aac55739
[autoscaler] Autoscaler simulator (#11690) 2020-11-04 12:04:11 -08:00
Akash Patel
b7531fb4f5
[redis-py] change redis-py deprecated hmset usage to hset (#11776) 2020-11-03 22:23:02 -08:00
Amog Kamsetty
7248d5f4ae
Revert "[tune] PB2 (#11466)" (#11795)
This reverts commit e7aafd7d24.
2020-11-03 21:05:00 -08:00
Kai Fricke
007634fd1b
[tune] logger refactor part 2: Add SyncerCallback (#11748)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-03 21:04:40 -08:00
Barak Michener
05c4e3fb2a
[build] Build wheels with manylinux2014 (#11621)
* necessary changes

* Split bazel install

* manylinux2014

* change references to manylinux2014

* Fix lint

* port alex's docker build changes

* fix config issue

* remove extra manylinux2010 requirement script

* revert SHA overwrite

* wip

* incompatible_linklibs

* fix nits
2020-11-03 19:36:32 -08:00
Ian Rodney
9527220a86
[serve] Fix Controller Crashes on Win (#11792) 2020-11-03 16:54:16 -08:00
Ian Rodney
c3074f559c
[serve] Split out metadata for checkpointing (#11533) 2020-11-03 12:41:24 -08:00
Philipp Moritz
39ce0eadbe
Ray PDB support (#11739) 2020-11-03 09:49:23 -08:00
Ameer Haj Ali
08e0e8311a
[autoscaler] Fixing AWS instance types autofill (#11758) 2020-11-03 09:34:14 -08:00
Kai Fricke
f7b19c41e3
[tune] logger refactor part 1: move classes and utilities to own files (#11746)
* [tune] logger refactor part 1: move classes and utilities to own files

* Fix circular dependency

* Remove uneeded pretty print copy

* Apply suggestions from code review
2020-11-03 07:48:09 -08:00
Maksim Smolin
0a6d24a727
[cli] Remove the deprecated old_style logging calls (#10776)
Co-authored-by: Richard Liaw <rliaw@berkeley.edu>
2020-11-02 23:40:18 -08:00
Stephanie Wang
0ba777af99
[Object spilling] Add policy to automatically spill objects on OutOfMemory (#11673) 2020-11-02 12:42:02 -08:00
Ameer Haj Ali
8d74a04a42
[autoscaler] Flag flip for resource_demand_scheduler should take into account queue (#11615) 2020-11-02 12:41:22 -08:00
Ian Rodney
171e02c684
[serve] re-enable serve-controller-crash test (#11579) 2020-11-02 11:22:09 -08:00
Eric Liang
48dee789b3
Add random actor placement; fix cancellation callback; update test skips (#11684) 2020-10-30 18:36:35 -07:00