daiyaanarfeen
8f6d73a93a
[sgd] Extend distributed pytorch functionality ( #5675 )
...
* raysgd
* apply fn
* double quotes
* removed duplicate TimerStat
* removed duplicate find_free_port
* imports in pytorch_trainer
* init doc
* ray.experimental
* remove resize example
* resnet example
* cifar
* Fix up after kwargs
* data_dir and dataloader_workers args
* formatting
* loss
* init
* update code
* lint
* smoketest
* better_configs
* fix
* fix
* fix
* train_loader
* fixdocs
* ok
* ok
* fix
* fix_update
* fix
* fix
* done
* fix
* fix
* fix
* small
* lint
* fix
* fix
* fix_test
* fix
* validate
* fix
* fi
2019-11-05 11:16:46 -08:00
Mitchell Stern
82be14f943
Move gRPC calls outside of Raylet stats lock ( #6090 )
2019-11-05 00:47:15 -08:00
mehrdadn
e312f3d282
Compatibility issues ( #6071 )
...
* Pass -f - to tar to force stdin on Windows
* Quote paths that may contain spaces (causes issues on Windows)
* Copy over Windows code from Arrow for glog signal handle uninstall
* Add missing COPTS to build rules since we'll need them for Windows compatibility
* Begin adding COPTS for Windows compatibility
* Disable glog on Arrow until we change WIN32 to _WIN32 there
* Missing header files that cause problems on Windows
* WORD typedef conflicts with Windows; remove it
* uint -> unsigned int wherever we're dealing with milliseconds (signed version is already int)
* uint -> unsigned int for enums
* uint -> size_t, wherever we're dealing with sizes or indices into arrays
* Work around Boost 1.68 bug in detecting clang-cl (revert this after upgrading)
* Missing #include <unistd.h>
* Add check for signal handler uninstallation failure
* Linting issue
2019-11-05 00:08:14 -08:00
Philipp Moritz
fefe050a58
Fix running out of file descriptors in the WebUI ( #6086 )
2019-11-04 21:17:36 -08:00
Edward Oakes
043d1f4094
Return RayObjects to core worker ( #6052 )
2019-11-04 20:27:57 -08:00
visatish
18241f4a2d
[tune] Added resources_per_trial arg to validate_save_restore u… ( #6032 )
2019-11-04 13:24:46 -08:00
Simon Mo
c23eae5998
[Serve] Fix router-worker communication ( #5961 )
...
* Half way there, needs the strict queuing fix
* Fix scale down, use callback
* Cleanup
* Address commments
* Comment, nit
* Fix docstring
2019-11-04 11:29:21 -08:00
Eric Liang
8485304e83
Support concurrent Actor calls in Ray ( #6053 )
2019-11-04 01:14:35 -08:00
Eric Liang
fbad6f543b
Try fixing actor handle destruction on py2 ( #6076 )
2019-11-03 22:46:40 -08:00
Philipp Moritz
1c5446851a
Use Plasma with LRU refreshing integrated ( #6050 )
2019-11-03 16:19:05 -08:00
Philipp Moritz
894885593c
Fix prometheus-cpp failure ( #6073 )
2019-11-03 15:05:47 -08:00
Eric Liang
1a3e97cf23
Revert "Enable the Ray dashboard by default ( #5976 )" ( #6068 )
...
This reverts commit 6166ef3e09
.
2019-11-01 17:08:37 -07:00
Richard Liaw
e94bebb1de
[tune] Fix Jenkins tests ( #6028 )
2019-11-01 16:42:04 -07:00
Eric Liang
fb34928a2a
[minor] Perf optimizations for direct actor task submission ( #6044 )
...
* merge optimizations
* fix
* fix memory err
* optimize
* fix tests
* fix serialization of method handles
* document weakref
* fix check
* bazel format
* disable on 2
2019-11-01 14:41:14 -07:00
Eric Liang
eef4ad3bba
Report census view data as part of raylet node stats ( #6060 )
2019-11-01 14:26:09 -07:00
Simon Mo
c8d7065bf3
[CI] Use rerunfailures instead of flaky ( #6061 )
...
* Use rerunfailures instead of flaky
* Lint
2019-11-01 13:59:03 -07:00
Eric Liang
6166ef3e09
Enable the Ray dashboard by default ( #5976 )
2019-11-01 12:19:01 -07:00
Simon Mo
7f5b3502da
Implement Detached Actor ( #6036 )
...
* Arg propagation works
* Implement persistent actor
* Add doc
* Initialize is_persistent_
* Rename persistent->detached
* Address comment
* Make test passes
* Address comment
* Python2 compatiblity
* Fix naming, py2
* Lint
2019-11-01 10:28:23 -07:00
Philipp Moritz
f7455839bf
Expose raylet info to dashboard ( #6045 )
2019-10-31 17:36:59 -07:00
Eric Liang
c86f945520
Support pass by ref args in for direct actor calls ( #6040 )
2019-10-31 16:55:10 -07:00
Eric Liang
16891e9379
[rllib] Don't use flat weights in non-eager mode ( #6001 )
2019-10-31 15:16:02 -07:00
Edward Oakes
16e9dfd2e1
Exit workers when raylet dies unexpectedly ( #6014 )
2019-10-30 20:29:25 -07:00
Edward Oakes
e9e78871b9
Remove unused function definition caching ( #6042 )
2019-10-30 16:41:18 -07:00
Simon Mo
56f3e96887
[Serve] Use ray's cloudpickle ( #6051 )
...
* Revert "Add cloudpickle as doc requirements (#6037 )"
This reverts commit 03ce3b7c5b
.
* Use ray's vendored cloudpickle
2019-10-30 15:21:09 -07:00
Qing Wang
4636fc2b78
Fix java ci ( #5964 )
2019-10-30 14:50:53 -07:00
Eric Liang
8ebba202df
[minor] Reduce perf overhead of object ref tracking ( #6041 )
2019-10-29 18:14:51 -07:00
Eric Liang
b89cac976a
Basic direct actor call support in Python ( #5991 )
2019-10-28 22:09:04 -07:00
Simon Mo
4c4342c165
Bring back pytest-sugar ( #6038 )
...
* Add cloudpickle as doc requirements
* Bring back pytest-sugar
* Revert "Add cloudpickle as doc requirements"
This reverts commit 2206e9e62ee20d93638e115f07a3fc933cbad9a3.
2019-10-28 20:24:28 -07:00
Simon Mo
03ce3b7c5b
Add cloudpickle as doc requirements ( #6037 )
2019-10-28 18:25:02 -07:00
Simon Mo
9e2c5f8218
[Serve] Put global state in remote actor ( #5937 )
...
* Making progress
* Impl done, start debugging
* Tests all pass
* Add test, fix
* Update doc
* Fix type
2019-10-28 11:43:47 -07:00
Edward Oakes
c1418b04df
Remove CoreWorkerObjectInterface ( #6023 )
2019-10-28 10:48:41 -07:00
Simon Mo
e08b5d0cae
[Serve] Add a minimal cli ( #5854 )
...
* Add a minimal cli
* Integrate serve_cli with ray scripts
2019-10-28 09:51:31 -07:00
Richard Liaw
085a6713a0
[docs] Add documentation for Dynamic Custom Resources ( #6000 )
2019-10-27 17:58:04 -07:00
Philipp Moritz
80c01617a3
Optimize python task execution ( #6024 )
2019-10-27 00:43:34 -07:00
mehrdadn
e706cb63cc
Fix missing double quotes for spaces in paths ( #6026 )
2019-10-26 20:46:55 -07:00
Stephanie Wang
eb41c945a1
Add gRPC endpoint to raylet to expose metrics ( #6005 )
2019-10-26 16:37:39 -07:00
Philipp Moritz
010270b3dc
Cleanup left over shell scripts in build process ( #6017 )
2019-10-26 15:46:46 -07:00
Eric Liang
a0dcb45dc3
[rllib] Fix APEX priorities returning zero all the time ( #5980 )
...
* fix
* move example tests to end
* level err
* guard against none
* no trace test
* ignore thumbs
* np
* fix multi node
* fix
2019-10-26 13:23:42 -07:00
Philipp Moritz
0bb922c29f
Revert "Use plasma with batched CreateAndSeal implemented ( #5864 )" ( #6022 )
...
This reverts commit 875c84ed63
.
2019-10-25 23:02:21 -07:00
Eric Liang
a5523466a2
Enable memstore by default ( #6003 )
2019-10-25 21:59:12 -07:00
Simon Mo
f1d2eb5247
Apply shallow-since and sha256 ( #6019 )
2019-10-25 19:48:04 -07:00
Edward Oakes
d4055d70e3
Remove CoreWorkerTaskExecutionInterface ( #6009 )
2019-10-25 16:33:44 -07:00
Edward Oakes
e6141a0b8b
Remove UsePush logic from raylet ( #6015 )
2019-10-25 14:52:19 -07:00
Edward Oakes
f8a6ed7832
Spawn processes in background sessions ( #6008 )
...
Allows us to properly handle KeyboardInterrupts in interactive python interpreters.
2019-10-25 13:01:35 -07:00
Edward Oakes
1ce521a7f3
Remove task context from python worker ( #5987 )
...
Removes duplicated state between the python and C++ workers. Also cleans up the serialization codepaths a bit.
2019-10-25 07:38:33 -07:00
Ujval Misra
cf16b2f0c4
Add timesteps and remove ID from progress output ( #5999 )
2019-10-25 00:48:42 -07:00
Eric Liang
4edae7ea2b
Speed up task submissions a bit ( #5992 )
2019-10-25 00:10:37 -07:00
Edward Oakes
6f27d881bd
Fix core worker shutdown errors ( #6004 )
2019-10-24 22:29:05 -07:00
Edward Oakes
71a2f4c63d
fix comment ( #6006 )
2019-10-24 18:07:49 -07:00
Edward Oakes
436dd936d2
Update profiling numbers ( #5989 )
2019-10-24 18:02:44 -07:00