chenk008
c318293d9f
[Core] start worker in container ( #16671 )
2021-06-29 10:12:47 -07:00
matthewdeng
b0f304a1b5
[release] add golden notebook release test for torch/tune/serve ( #16619 )
...
* [release] add golden notebook release test for torch/tune/serve
* start serve on all nodes so remote localhost works
2021-06-29 09:13:23 -07:00
Ian Rodney
b3532cc2d1
[Client][Test] Avoid Port-Reuse to DeFlake ( #16697 )
...
Co-authored-by: mwtian <81660174+mwtian@users.noreply.github.com>
2021-06-28 23:54:06 -07:00
Ian Rodney
a9df1b7a67
[Test][Modin] Actually run test_modin ( #16719 )
2021-06-28 20:39:30 -07:00
SangBin Cho
804a867b3d
Revert revert OBOD pubsub PR ( #16487 )
...
* Revert "Revert "[Pubsub] Use a pubsub module for Ownership based object directory (#16407 )" (#16486 )"
This reverts commit b986938f0f
.
* revert the obod problem.
* Add stats.
* Fix a possible regression.
* in another progress
* debugging
* Fix stats bug
* update
* Add more stats.
* Add stats
* lint
* Fix issue
* remove spammy logs
* lint
* better error msg for debugging
* Add even more logging
* Remove spammy logs
* Fix iterator invalidation issue
* more debugging info
* fix
* Add more debug logs
* add debug logs
* Remove the problematic line for confirmation
* Completed
* Fixed a broken test.
* experiment
* Lint
* Add a better error message
* try out
* revert the build file.
* In progress again
* IP
* Formatting
* Revert the log level
* Unskip test array
* final clean up.
* fix a build issue
* debug logs
* remove
* .
* Add more critical logs.
* format
* tmp
* log
* log
* issue fix
* Upgrade
* test experiment
* Fix an issue
* Fix issues.
* Lint
* remove unnecessary code
* last clean up.
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
2021-06-28 20:30:31 -07:00
SongGuyang
41b9a5102b
[C++ worker] support build C++ worker during python setup ( #16636 )
2021-06-29 10:29:47 +08:00
Amog Kamsetty
322b9531f6
[SGD] Add __init__ file to tf.examples ( #16726 )
2021-06-28 19:23:22 -07:00
Ian Rodney
1a357a7e4f
[Client] Auto-Run ray.client().connect() ( #16259 )
2021-06-28 17:01:26 -07:00
Travis Addair
e5dfa4cfb9
[tune] Only use TBXLoggerCallback when torch is installed ( #16695 )
...
* [tune] Only use TBXLoggerCallback when torch is installed
* Fix lint
* fix
* Update python/ray/tune/utils/callback.py
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-28 16:34:20 -07:00
Alex Wu
0811ae4231
Use the same worker id in python and C++ ( #16712 )
...
Co-authored-by: Alex <alex@anyscale.com>
2021-06-28 15:42:37 -07:00
Jiao
6aeda62d40
[Serve] Add serve test config files and wrk dependency ( #16631 )
2021-06-28 10:01:55 -07:00
Amog Kamsetty
be1f6d59fa
[CI] Re-try Tag rllib flaky tests ( #16680 )
2021-06-28 18:42:54 +02:00
architkulkarni
b9f6132c08
skip flaky conda env fixture on MacOS ( #16710 )
2021-06-28 09:38:17 -07:00
Tao Wang
38157a3166
[Core]support external redis address when starting ray processes ( #13170 )
...
* support external redis address when starting ray processes
* use a more general name
* add cli option
* handle some details
* fix set shards logic
* reuse --address instead of introduce a new one
* lint
* tiny
* lint and fix
2021-06-28 09:22:40 -07:00
Kai Fricke
04bfba1274
[tune] Move reporter detection to utility function ( #16673 )
...
Test failures seem unrelated
2021-06-28 12:55:05 +01:00
qicosmos
500891c1e0
[C++ Worker]Support windows ( #16700 )
2021-06-28 17:45:20 +08:00
Amog Kamsetty
54ce8092ab
[Tune] Update transformers to 4.6.1 ( #16397 )
...
* add examples
* update dask docs
* add build file
* formatting
* fix ci command
* fix
* Update python/ray/util/dask/BUILD
* newline
* fix pytest fixtures
* fixes
* formatting
* fix shuffle example
* update
* dont log to wandb
2021-06-26 14:10:47 -07:00
AnnaKosiorek
1e709771b2
[rllib][minor] clarification of the softmax axis in dqn_torch_policy ( #16311 )
...
pytorch nn.functional.softmax (unlike tf.nn.softmax) calculates softmax along zeroth dimension by default
2021-06-26 11:19:54 -07:00
Eric Liang
aa882ed52d
Make it more convenient to develop ray.data by setting RAY_EXPERIMENTAL_DATA_API=1 ( #16685 )
...
* make it convenient to import ray.data
* update
* Update python/ray/experimental/data/read_api.py
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2021-06-26 09:17:30 -07:00
Eric Liang
6bfa97eed7
Check in the first iteration of an Arrow-based dataset api ( #16648 )
2021-06-25 18:45:13 -07:00
Eric Liang
3f5ce01949
Address leftover comments from https://github.com/ray-project/ray/pull/16394/files ( #16684 )
2021-06-25 16:45:50 -07:00
Dmitri Gekhtman
7b58ec9ae5
[autoscaler] rsync bootstrap flag ( #16667 )
2021-06-25 15:26:47 -07:00
Eric Liang
9b17c35bee
Fix PullManager handling of get requests and liveness issues ( #16394 )
2021-06-25 13:01:46 -07:00
Kai Fricke
696334ff08
[tune] Fix Tee utility class properties ( #16674 )
2021-06-25 18:19:01 +01:00
architkulkarni
06dfd8dddb
Revert "[Dashboard][event] Basic event module ( #16283 )" ( #16676 )
...
This reverts commit 5afa53aa64
.
2021-06-25 09:38:18 -07:00
architkulkarni
35039869ee
Revert "[RLlib] Add some learning tests to rllib-flaky ( #16604 )" ( #16677 )
...
This reverts commit d1510911e0
.
2021-06-25 09:37:58 -07:00
Lixin Wei
a9d6e93977
[scheduler] Rename TaskRequest to ResourceRequest ( #16649 )
2021-06-25 08:50:20 -07:00
architkulkarni
503641c2c2
[Core] [runtime env] add C++ test for caching workers by runtime env hash ( #16664 )
2021-06-25 09:38:37 -05:00
architkulkarni
b15ab2d60b
[Core] [runtime env] Support specifying runtime env in @ray.remote decorator ( #16660 )
2021-06-25 09:37:40 -05:00
SongGuyang
e74d9d3ded
[runtime env] Download runtime env(conda) in agent instead of setup_worker ( #16525 )
2021-06-25 19:39:05 +08:00
dependabot[bot]
2e3771cc29
[tune](deps): Bump tensorflow-probability in /python/requirements/tune ( #16561 )
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-06-25 11:50:35 +01:00
fyrestone
5afa53aa64
[Dashboard][event] Basic event module ( #16283 )
2021-06-25 13:59:02 +08:00
mwtian
49b8b86488
Remove empty ClusterTaskManager::ScheduleInfeasibleTasks() ( #16665 )
2021-06-24 22:34:57 -07:00
Eric Liang
1c709cbeb3
Fix typing ( #16668 )
2021-06-24 22:06:33 -07:00
Chen Shen
c4d7b31a79
[Test] Placement group stress test ( #16633 )
2021-06-24 21:35:55 -07:00
Qing Wang
89b07572da
[Java] Upgrade log4j ( #16657 )
2021-06-24 21:01:27 -07:00
Alex Wu
bfe85326f2
[core] Cleanup dead pubsub related code ( #16629 )
2021-06-24 19:36:56 -07:00
Dmitri Gekhtman
ea23382919
[autoscaler][docs] Doc tweak ( #16663 )
...
* doc-tweak
* fix
2021-06-24 16:25:00 -07:00
Alex Wu
8ffaa8d3fa
Refactor pubsub to support GCS publisher/raylet client ( #16624 )
...
* .
* .
* .
* .
* .
* import error :(
* boop
* .
* fix tests
* fix tests
* .
* cleanup
Co-authored-by: Alex Wu <alex@anyscale.com>
2021-06-24 15:30:42 -07:00
Amog Kamsetty
d1510911e0
[RLlib] Add some learning tests to rllib-flaky ( #16604 )
2021-06-25 00:28:54 +02:00
architkulkarni
8587f9d738
[Core] [runtime env] Fix conda/pip filepaths relative to working_dir ( #16186 )
2021-06-24 16:43:25 -05:00
Qing Wang
3272997b0d
[Java] Upgrade some deps to fix CVEs ( #16650 )
2021-06-24 10:56:20 -07:00
architkulkarni
4637298d36
Delete conda env before creating to deflake test_runtime_env_complicated ( #16628 )
2021-06-24 12:13:26 -05:00
architkulkarni
e8c25a2fa4
[Core] [runtime env] Merge child's runtime_env["env_vars"] with that of parent ( #16553 )
2021-06-24 12:13:13 -05:00
Simon Mo
aabdfe2989
[Serve] Fix HTTP headers ( #16647 )
2021-06-24 11:59:43 -05:00
Amog Kamsetty
53d16365b0
[Release] Convert Horovod and SGD release tests ( #15999 )
2021-06-24 15:56:02 +01:00
Kai Fricke
ef97bdd407
[release] Fix app config: Install latest releases. Bump xgboost-ray version ( #16581 )
2021-06-24 12:56:21 +01:00
Gabriele Oliaro
3e2f608145
Work stealing! ( #15475 )
...
* work_stealing one commit squash
* using random task id to request workers
* inlining methods in direct_task_transport.h
* faster checking for presence of stealable tasks in RequestNewWorkerIfNeeded
* linting
* fixup! using random task id to request workers
* estimating number of tasks to steal based only on tasks in flight
* linting
* fixup! linting
* backup of changes
* fixed issue in scheduling queue test after merge
* linting
* redesigned work stealing. compiles but not tested
* all tests passing locally
* fixup! all tests passing locally
* fixup! fixup! all tests passing locally
* fixed big bug in StealTasksIfNeeded
* rev1
* rev2 (before removing the work_stealing param)
* removed work_stealing flag, fixed existing unit tests
* added unit tests; need to figure out how to assign distinct worker ids in GrantWorkerLease
* fixed work stealing test
* revisions, added two more unit/regression tests
* test
2021-06-23 17:08:28 -07:00
Frank Luan
9249287a36
Object spilling threshold ( #16558 )
...
* Object spilling threshold
* clang-format
* Make tests more lenient
* Fix tests
* Fix tests
* Address comments
* Fix tests lint
* Refactor
* Fix tests
* Fix cpp tests
* Address comments
2021-06-23 16:54:41 -07:00
SangBin Cho
f816f613c7
[Test] Handle flaky tests ( #16602 )
...
* Handle flaky tests.
* lint
* tag more
* add test_scheduling
* Remove global gc
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
2021-06-23 16:24:12 -07:00