Sven Mika
7eb1a29426
[RLlib] Fix ModelV2 custom metrics for torch. ( #16734 )
2021-07-01 13:01:40 +02:00
mwtian
7669708237
Create a wait_for_num_nodes() function, and use it in train_small
( #16784 )
2021-07-01 10:17:53 +01:00
Antoni Baum
2cec40d0f5
[tune] LightGBM integration callbacks ( #16713 )
2021-07-01 10:12:57 +01:00
Siyuan (Ryans) Zhuang
18232bb808
[Core][Serialization] Fix recursive resolve/deserialization ( #16794 )
...
* fix recursive resolve
* fix threading issues
2021-07-01 01:44:24 -07:00
Sven Mika
ce3e550c43
[RLlib] Enhance comment in example script multi_agent_custom_policy. ( #16740 )
2021-07-01 10:28:38 +02:00
Amog Kamsetty
2c17f4907e
[Tune] Add pbt_transformers Client test ( #16793 )
2021-06-30 23:10:16 -07:00
chenk008
06c7db7dca
[Core] Rename container option and ray-nest-container ( #16771 )
...
* rename container_option to container
* rename ray-nest-container to ray-worker-container
* lint
Co-authored-by: wuhua.ck <wuhua.ck@alibaba-inc.com>
2021-07-01 13:12:26 +08:00
SangBin Cho
0ad2dac3ca
[Dataset] Map batch implementation ( #16729 )
2021-06-30 21:51:04 -07:00
Lixin Wei
e00d898b75
[Core] Lightweight Resource Report for New Scheduler ( #16527 )
...
* check resource diff
* fix
* fix
* comment modified
* fix
2021-06-30 21:27:29 -07:00
SangBin Cho
94149ef50c
[Test] Skip some flaky tests ( #16768 )
...
* done
* Add test gcs fault tolerance to the flaky test
* Addressed code review
* Addressed the last code review.
2021-06-30 20:48:09 -07:00
Dmitri Gekhtman
0cbeefdadf
[autoscaler] GPU=0 resource tweak ( #16761 )
...
* tweak
* use >, some fields don't admit an order
2021-06-30 16:51:25 -07:00
crdnb
113ed2a07c
[kubernetes] Adding cpu limit to make ray helm chart working in environments which require set resource limits ( #16701 )
2021-06-30 13:31:55 -07:00
Ian Rodney
1f14f366fd
[Client] Make Client_Hook
per-thread ( #16731 )
2021-06-30 11:48:55 -07:00
ellimac54
bf4fcb2460
[doc] updated Windows development build instructions ( #16778 )
2021-06-30 11:34:46 -07:00
architkulkarni
3962ce286b
[Core] Bump override_environment_variables removal date from Ray 1.5 to 1.6 ( #16755 )
2021-06-30 09:31:08 -05:00
architkulkarni
077c04b06c
[Serve] Fix fastapi tutorial and update doc with deprecation warning ( #16759 )
2021-06-30 09:30:49 -05:00
SongGuyang
1d5c72ebde
[C++ API] rename 'ray_redis_address' to 'ray_address' ( #16772 )
2021-06-30 20:50:28 +08:00
Sven Mika
53206dd440
[RLlib] CQL BC loss fixes; PPO/PG/A2|3C action normalization fixes ( #16531 )
2021-06-30 12:32:11 +02:00
Kai Fricke
4b6f8a1ece
[cli] use shutil.move in ray cluster-dump
and allow passing of tempfile ( #16738 )
2021-06-30 07:49:03 +01:00
architkulkarni
13a133817b
[Core] [runtime env] [Tests] Add C++ unit test for dispatch queue nonblocking behavior ( #16751 )
2021-06-29 20:16:17 -07:00
Amog Kamsetty
69507f53db
[Horovod] Add Horovod example ( #16742 )
...
* wip
* updates
* updates
* update
* formatting
* updates
* updates
* update
* fix
* add timeout
2021-06-29 19:15:15 -07:00
Alex Wu
d89f148fbf
[Pubsub] Don't depend on subscriber address ( #16752 )
...
* remove subscriber address
* .
* lint
* test
* done
* lint
* .
* Update BUILD.bazel
Co-authored-by: Alex <alex@anyscale.com>
2021-06-29 17:34:37 -07:00
SangBin Cho
3cde8c36c9
Properly update the pinned object size ( #16476 )
2021-06-29 17:00:19 -07:00
Simon Mo
2ac8a197db
[Serve] Copy FastAPI ResponseModel field ( #16760 )
2021-06-29 16:28:08 -07:00
Patrick Ames
cf8785b0e1
[docs] Note that ordering of objects returned is preserved for ray.get. ( #16763 )
2021-06-29 16:17:16 -07:00
Richard Liaw
bcb73ed58b
finished impl ( #16753 )
2021-06-29 14:37:42 -07:00
Amog Kamsetty
abd16a8438
[RLlib] Skip two_step_game_qmix test ( #16758 )
2021-06-29 14:27:48 -07:00
Amog Kamsetty
56068f8f81
Skip test_component_failures_2 on Windows ( #16745 )
2021-06-29 14:06:09 -07:00
Ian Rodney
b8f950775e
[Client] Keep client_mode for dumps_from_client
( #16732 )
2021-06-29 13:30:10 -07:00
Amog Kamsetty
c0560dadef
[Docker] Pin Tensorflow ( #16741 )
2021-06-29 11:14:46 -07:00
Dmitri Gekhtman
257d072d13
[kubernetes][release] K8s release test instructions ( #16662 )
2021-06-29 10:57:35 -07:00
chenk008
c318293d9f
[Core] start worker in container ( #16671 )
2021-06-29 10:12:47 -07:00
matthewdeng
b0f304a1b5
[release] add golden notebook release test for torch/tune/serve ( #16619 )
...
* [release] add golden notebook release test for torch/tune/serve
* start serve on all nodes so remote localhost works
2021-06-29 09:13:23 -07:00
Ian Rodney
b3532cc2d1
[Client][Test] Avoid Port-Reuse to DeFlake ( #16697 )
...
Co-authored-by: mwtian <81660174+mwtian@users.noreply.github.com>
2021-06-28 23:54:06 -07:00
Ian Rodney
a9df1b7a67
[Test][Modin] Actually run test_modin ( #16719 )
2021-06-28 20:39:30 -07:00
SangBin Cho
804a867b3d
Revert revert OBOD pubsub PR ( #16487 )
...
* Revert "Revert "[Pubsub] Use a pubsub module for Ownership based object directory (#16407 )" (#16486 )"
This reverts commit b986938f0f
.
* revert the obod problem.
* Add stats.
* Fix a possible regression.
* in another progress
* debugging
* Fix stats bug
* update
* Add more stats.
* Add stats
* lint
* Fix issue
* remove spammy logs
* lint
* better error msg for debugging
* Add even more logging
* Remove spammy logs
* Fix iterator invalidation issue
* more debugging info
* fix
* Add more debug logs
* add debug logs
* Remove the problematic line for confirmation
* Completed
* Fixed a broken test.
* experiment
* Lint
* Add a better error message
* try out
* revert the build file.
* In progress again
* IP
* Formatting
* Revert the log level
* Unskip test array
* final clean up.
* fix a build issue
* debug logs
* remove
* .
* Add more critical logs.
* format
* tmp
* log
* log
* issue fix
* Upgrade
* test experiment
* Fix an issue
* Fix issues.
* Lint
* remove unnecessary code
* last clean up.
Co-authored-by: Stephanie Wang <swang@cs.berkeley.edu>
2021-06-28 20:30:31 -07:00
SongGuyang
41b9a5102b
[C++ worker] support build C++ worker during python setup ( #16636 )
2021-06-29 10:29:47 +08:00
Amog Kamsetty
322b9531f6
[SGD] Add __init__ file to tf.examples ( #16726 )
2021-06-28 19:23:22 -07:00
Ian Rodney
1a357a7e4f
[Client] Auto-Run ray.client().connect() ( #16259 )
2021-06-28 17:01:26 -07:00
Travis Addair
e5dfa4cfb9
[tune] Only use TBXLoggerCallback when torch is installed ( #16695 )
...
* [tune] Only use TBXLoggerCallback when torch is installed
* Fix lint
* fix
* Update python/ray/tune/utils/callback.py
Co-authored-by: Amog Kamsetty <amogkamsetty@yahoo.com>
Co-authored-by: Amog Kamsetty <amogkam@users.noreply.github.com>
2021-06-28 16:34:20 -07:00
Alex Wu
0811ae4231
Use the same worker id in python and C++ ( #16712 )
...
Co-authored-by: Alex <alex@anyscale.com>
2021-06-28 15:42:37 -07:00
Jiao
6aeda62d40
[Serve] Add serve test config files and wrk dependency ( #16631 )
2021-06-28 10:01:55 -07:00
Amog Kamsetty
be1f6d59fa
[CI] Re-try Tag rllib flaky tests ( #16680 )
2021-06-28 18:42:54 +02:00
architkulkarni
b9f6132c08
skip flaky conda env fixture on MacOS ( #16710 )
2021-06-28 09:38:17 -07:00
Tao Wang
38157a3166
[Core]support external redis address when starting ray processes ( #13170 )
...
* support external redis address when starting ray processes
* use a more general name
* add cli option
* handle some details
* fix set shards logic
* reuse --address instead of introduce a new one
* lint
* tiny
* lint and fix
2021-06-28 09:22:40 -07:00
Kai Fricke
04bfba1274
[tune] Move reporter detection to utility function ( #16673 )
...
Test failures seem unrelated
2021-06-28 12:55:05 +01:00
qicosmos
500891c1e0
[C++ Worker]Support windows ( #16700 )
2021-06-28 17:45:20 +08:00
Amog Kamsetty
54ce8092ab
[Tune] Update transformers to 4.6.1 ( #16397 )
...
* add examples
* update dask docs
* add build file
* formatting
* fix ci command
* fix
* Update python/ray/util/dask/BUILD
* newline
* fix pytest fixtures
* fixes
* formatting
* fix shuffle example
* update
* dont log to wandb
2021-06-26 14:10:47 -07:00
AnnaKosiorek
1e709771b2
[rllib][minor] clarification of the softmax axis in dqn_torch_policy ( #16311 )
...
pytorch nn.functional.softmax (unlike tf.nn.softmax) calculates softmax along zeroth dimension by default
2021-06-26 11:19:54 -07:00
Eric Liang
aa882ed52d
Make it more convenient to develop ray.data by setting RAY_EXPERIMENTAL_DATA_API=1 ( #16685 )
...
* make it convenient to import ray.data
* update
* Update python/ray/experimental/data/read_api.py
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
Co-authored-by: Alex Wu <itswu.alex@gmail.com>
2021-06-26 09:17:30 -07:00