ray/.buildkite
Jiao d95dc2f2e5
[AIR][GPU Batch Prediction] Add basic support for GPU batch prediction (#26251)
This PR adds GPU support for pytorch and tensorflow predictor, as well as automatic setting `use_gpu` flag in `BatchPredictor`.

Notable changes:
- Added `use_gpu` flag in the constructor of `TorchPredictor` and `TensorflowPredictor` (note it's slightly different from our latest design doc that puts this flag at `predict()` call)
- Added `use_gpu` flag to `SklearnPredictor` so its interface is compatible with `BatchPredictor`
- Code to move both model weights and input tensor to default visible GPU at index 0 if flag is set 
- parametrized existing predictor tests to use GPU for both CPU & GPU coverage
- Changed BUILD CI tests with an added `gpu` tag (I'm not 100% sure if that's a right way tho)

Follow ups:

https://github.com/ray-project/ray/issues/26249 is created in case our host has multiple GPU devices. It's a bit out of scope for this PR, but for GPU batch inference ideally we should be able to evenly use all GPU devices on host where CPU & DRAM are busy with pre-fetching + data movement to GPU. We might approximately do the same by scheduling same # of Predictor instances on the host, but that's worth verifying once benchmarks are set.
2022-07-11 13:04:15 -07:00
..
hooks [CI] Upload macOS bazel test files (#25744) 2022-06-15 10:09:04 -07:00
copy_files.py apply isort uniformly for a subset of directories (#25824) 2022-06-17 13:40:32 -07:00
Dockerfile [ci] Clean up ci/ directory (refactor ci/travis) (#23866) 2022-04-13 18:11:30 +01:00
Dockerfile.gpu [ci] Fix GPU docker builds (#24336) 2022-04-29 17:10:18 +01:00
pipeline.gpu.large.yml [AIR][GPU Batch Prediction] Add basic support for GPU batch prediction (#26251) 2022-07-11 13:04:15 -07:00
pipeline.gpu.yml [ci] Clean up ci/ directory (refactor ci/travis) (#23866) 2022-04-13 18:11:30 +01:00
pipeline.macos.yml [CI] Upload macOS bazel test files (#25744) 2022-06-15 10:09:04 -07:00
pipeline.ml.yml [RLlib] More Trainer -> Algorithm renaming cleanups. (#25869) 2022-06-20 15:54:00 +02:00
pipeline.windows.yml [ci] Clean up ci/ directory (refactor ci/travis) (#23866) 2022-04-13 18:11:30 +01:00
pipeline.yml [Docker] Add support for Cuda 11.3 (#26233) 2022-07-10 21:50:42 -07:00