Commit graph

8511 commits

Author SHA1 Message Date
Sven Mika
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes (#16531) 2021-06-30 12:32:11 +02:00
Kai Fricke
4b6f8a1ece
[cli] use shutil.move in ray cluster-dump and allow passing of tempfile (#16738) 2021-06-30 07:49:03 +01:00
architkulkarni
13a133817b
[Core] [runtime env] [Tests] Add C++ unit test for dispatch queue nonblocking behavior (#16751) 2021-06-29 20:16:17 -07:00
Amog Kamsetty
69507f53db
[Horovod] Add Horovod example (#16742)
* wip

* updates

* updates

* update

* formatting

* updates

* updates

* update

* fix

* add timeout
2021-06-29 19:15:15 -07:00
Alex Wu
d89f148fbf
[Pubsub] Don't depend on subscriber address (#16752)
* remove subscriber address

* .

* lint

* test

* done

* lint

* .

* Update BUILD.bazel

Co-authored-by: Alex <alex@anyscale.com>
2021-06-29 17:34:37 -07:00
SangBin Cho
3cde8c36c9
Properly update the pinned object size (#16476) 2021-06-29 17:00:19 -07:00
Simon Mo
2ac8a197db
[Serve] Copy FastAPI ResponseModel field (#16760) 2021-06-29 16:28:08 -07:00
Patrick Ames
cf8785b0e1
[docs] Note that ordering of objects returned is preserved for ray.get. (#16763) 2021-06-29 16:17:16 -07:00
Richard Liaw
bcb73ed58b
finished impl (#16753) 2021-06-29 14:37:42 -07:00
Amog Kamsetty
abd16a8438
[RLlib] Skip two_step_game_qmix test (#16758) 2021-06-29 14:27:48 -07:00
Amog Kamsetty
56068f8f81
Skip test_component_failures_2 on Windows (#16745) 2021-06-29 14:06:09 -07:00
Ian Rodney
b8f950775e
[Client] Keep client_mode for dumps_from_client (#16732) 2021-06-29 13:30:10 -07:00
Amog Kamsetty
c0560dadef
[Docker] Pin Tensorflow (#16741) 2021-06-29 11:14:46 -07:00
Dmitri Gekhtman
257d072d13
[kubernetes][release] K8s release test instructions (#16662) 2021-06-29 10:57:35 -07:00
chenk008
c318293d9f
[Core] start worker in container (#16671) 2021-06-29 10:12:47 -07:00
matthewdeng
b0f304a1b5
[release] add golden notebook release test for torch/tune/serve (#16619)
* [release] add golden notebook release test for torch/tune/serve

* start serve on all nodes so remote localhost works
2021-06-29 09:13:23 -07:00
Ian Rodney
b3532cc2d1
[Client][Test] Avoid Port-Reuse to DeFlake (#16697)
Co-authored-by: mwtian <81660174+mwtian@users.noreply.github.com>
2021-06-28 23:54:06 -07:00
Ian Rodney
a9df1b7a67
[Test][Modin] Actually run test_modin (#16719) 2021-06-28 20:39:30 -07:00
SangBin Cho
804a867b3d
Revert revert OBOD pubsub PR (#16487)
* Revert "Revert "[Pubsub] Use a pubsub module for Ownership based object directory (#16407)" (#16486)"

This reverts commit b986938f0f.

* revert the obod problem.

* Add stats.

* Fix a possible regression.

* in another progress

* debugging

* Fix stats bug

* update

* Add more stats.

* Add stats

* lint

* Fix issue

* remove spammy logs

* lint

* better error msg for debugging

* Add even more logging

* Remove spammy logs

* Fix iterator invalidation issue

* more debugging info

* fix

* Add more debug logs

* add debug logs

* Remove the problematic line for confirmation

* Completed

* Fixed a broken test.

* experiment

* Lint

* Add a better error message

* try out

* revert the build file.

* In progress again

* IP

* Formatting

* Revert the log level

* Unskip test array

* final clean up.

* fix a build issue

* debug logs

* remove

* .

* Add more critical logs.

* format

* tmp

* log

* log

* issue fix

* Upgrade

* test experiment

* Fix an issue

* Fix issues.

* Lint

* remove unnecessary code

* last clean up.

Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
2021-06-28 20:30:31 -07:00
SongGuyang
41b9a5102b
[C++ worker] support build C++ worker during python setup (#16636) 2021-06-29 10:29:47 +08:00
Amog Kamsetty
322b9531f6
[SGD] Add __init__ file to tf.examples (#16726) 2021-06-28 19:23:22 -07:00
Ian Rodney
1a357a7e4f
[Client] Auto-Run ray.client().connect() (#16259) 2021-06-28 17:01:26 -07:00
Travis Addair
e5dfa4cfb9
[tune] Only use TBXLoggerCallback when torch is installed (#16695)
* [tune] Only use TBXLoggerCallback when torch is installed

* Fix lint

* fix

* Update python/ray/tune/utils/callback.py

Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-28 16:34:20 -07:00
Alex Wu
0811ae4231
Use the same worker id in python and C++ (#16712)
Co-authored-by: Alex <alex@anyscale.com>
2021-06-28 15:42:37 -07:00
Jiao
6aeda62d40
[Serve] Add serve test config files and wrk dependency (#16631) 2021-06-28 10:01:55 -07:00
Amog Kamsetty
be1f6d59fa
[CI] Re-try Tag rllib flaky tests (#16680) 2021-06-28 18:42:54 +02:00
architkulkarni
b9f6132c08
skip flaky conda env fixture on MacOS (#16710) 2021-06-28 09:38:17 -07:00
Tao Wang
38157a3166
[Core]support external redis address when starting ray processes (#13170)
* support external redis address when starting ray processes

* use a more general name

* add cli option

* handle some details

* fix set shards logic

* reuse --address instead of introduce a new one

* lint

* tiny

* lint and fix
2021-06-28 09:22:40 -07:00
Kai Fricke
04bfba1274
[tune] Move reporter detection to utility function (#16673)
Test failures seem unrelated
2021-06-28 12:55:05 +01:00
qicosmos
500891c1e0
[C++ Worker]Support windows (#16700) 2021-06-28 17:45:20 +08:00
Amog Kamsetty
54ce8092ab
[Tune] Update transformers to 4.6.1 (#16397)
* add examples

* update dask docs

* add build file

* formatting

* fix ci command

* fix

* Update python/ray/util/dask/BUILD

* newline

* fix pytest fixtures

* fixes

* formatting

* fix shuffle example

* update

* dont log to wandb
2021-06-26 14:10:47 -07:00
AnnaKosiorek
1e709771b2
[rllib][minor] clarification of the softmax axis in dqn_torch_policy (#16311)
pytorch nn.functional.softmax (unlike tf.nn.softmax) calculates softmax along zeroth dimension by default
2021-06-26 11:19:54 -07:00
Eric Liang
aa882ed52d
Make it more convenient to develop ray.data by setting RAY_EXPERIMENTAL_DATA_API=1 (#16685)
* make it convenient to import ray.data

* update

* Update python/ray/experimental/data/read_api.py

Co-authored-by: Alex Wu <itswu.alex@gmail.com>

Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2021-06-26 09:17:30 -07:00
Eric Liang
6bfa97eed7
Check in the first iteration of an Arrow-based dataset api (#16648) 2021-06-25 18:45:13 -07:00
Eric Liang
3f5ce01949
Address leftover comments from https://github.com/ray-project/ray/pull/16394/files (#16684) 2021-06-25 16:45:50 -07:00
Dmitri Gekhtman
7b58ec9ae5
[autoscaler] rsync bootstrap flag (#16667) 2021-06-25 15:26:47 -07:00
Eric Liang
9b17c35bee
Fix PullManager handling of get requests and liveness issues (#16394) 2021-06-25 13:01:46 -07:00
Kai Fricke
696334ff08
[tune] Fix Tee utility class properties (#16674) 2021-06-25 18:19:01 +01:00
architkulkarni
06dfd8dddb
Revert "[Dashboard][event] Basic event module (#16283)" (#16676)
This reverts commit 5afa53aa64.
2021-06-25 09:38:18 -07:00
architkulkarni
35039869ee
Revert "[RLlib] Add some learning tests to rllib-flaky (#16604)" (#16677)
This reverts commit d1510911e0.
2021-06-25 09:37:58 -07:00
Lixin Wei
a9d6e93977
[scheduler] Rename TaskRequest to ResourceRequest (#16649) 2021-06-25 08:50:20 -07:00
architkulkarni
503641c2c2
[Core] [runtime env] add C++ test for caching workers by runtime env hash (#16664) 2021-06-25 09:38:37 -05:00
architkulkarni
b15ab2d60b
[Core] [runtime env] Support specifying runtime env in @ray.remote decorator (#16660) 2021-06-25 09:37:40 -05:00
SongGuyang
e74d9d3ded
[runtime env] Download runtime env(conda) in agent instead of setup_worker (#16525) 2021-06-25 19:39:05 +08:00
dependabot[bot]
2e3771cc29
[tune](deps): Bump tensorflow-probability in /python/requirements/tune (#16561)
Signed-off-by: dependabot[bot] <support@github.com>

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2021-06-25 11:50:35 +01:00
fyrestone
5afa53aa64
[Dashboard][event] Basic event module (#16283) 2021-06-25 13:59:02 +08:00
mwtian
49b8b86488
Remove empty ClusterTaskManager::ScheduleInfeasibleTasks() (#16665) 2021-06-24 22:34:57 -07:00
Eric Liang
1c709cbeb3
Fix typing (#16668) 2021-06-24 22:06:33 -07:00
Chen Shen
c4d7b31a79
[Test] Placement group stress test (#16633) 2021-06-24 21:35:55 -07:00
Qing Wang
89b07572da
[Java] Upgrade log4j (#16657) 2021-06-24 21:01:27 -07:00