ray/.buildkite at d95dc2f2e5247a76e78cb7f09ff3c4da7af0f98b - hiro/ray

hiro/ray

mirror of https://github.com/vale981/ray synced 2025-03-05 10:01:43 -05:00

History

Jiao d95dc2f2e5 [AIR][GPU Batch Prediction] Add basic support for GPU batch prediction (#26251 ) This PR adds GPU support for pytorch and tensorflow predictor, as well as automatic setting `use_gpu` flag in `BatchPredictor`. Notable changes: - Added `use_gpu` flag in the constructor of `TorchPredictor` and `TensorflowPredictor` (note it's slightly different from our latest design doc that puts this flag at `predict()` call) - Added `use_gpu` flag to `SklearnPredictor` so its interface is compatible with `BatchPredictor` - Code to move both model weights and input tensor to default visible GPU at index 0 if flag is set - parametrized existing predictor tests to use GPU for both CPU & GPU coverage - Changed BUILD CI tests with an added `gpu` tag (I'm not 100% sure if that's a right way tho) Follow ups: https://github.com/ray-project/ray/issues/26249 is created in case our host has multiple GPU devices. It's a bit out of scope for this PR, but for GPU batch inference ideally we should be able to evenly use all GPU devices on host where CPU & DRAM are busy with pre-fetching + data movement to GPU. We might approximately do the same by scheduling same # of Predictor instances on the host, but that's worth verifying once benchmarks are set.		2022-07-11 13:04:15 -07:00
..
hooks	[CI] Upload macOS bazel test files (#25744 )	2022-06-15 10:09:04 -07:00
copy_files.py	apply isort uniformly for a subset of directories (#25824 )	2022-06-17 13:40:32 -07:00
Dockerfile	[ci] Clean up ci/ directory (refactor ci/travis) (#23866 )	2022-04-13 18:11:30 +01:00
Dockerfile.gpu	[ci] Fix GPU docker builds (#24336 )	2022-04-29 17:10:18 +01:00
pipeline.gpu.large.yml	[AIR][GPU Batch Prediction] Add basic support for GPU batch prediction (#26251 )	2022-07-11 13:04:15 -07:00
pipeline.gpu.yml	[ci] Clean up ci/ directory (refactor ci/travis) (#23866 )	2022-04-13 18:11:30 +01:00
pipeline.macos.yml	[CI] Upload macOS bazel test files (#25744 )	2022-06-15 10:09:04 -07:00
pipeline.ml.yml	[RLlib] More Trainer -> Algorithm renaming cleanups. (#25869 )	2022-06-20 15:54:00 +02:00
pipeline.windows.yml	[ci] Clean up ci/ directory (refactor ci/travis) (#23866 )	2022-04-13 18:11:30 +01:00
pipeline.yml	[Docker] Add support for Cuda 11.3 (#26233 )	2022-07-10 21:50:42 -07:00