mirror of
https://github.com/vale981/ray
synced 2025-03-05 10:01:43 -05:00
![]() Adds a streaming based reading option for Snappy-compressed files. Arrow doesn't support streaming Snappy decompression since the canonical C++ Snappy library doesn't natively support streaming decompression. This PR works around this by doing streaming reads of snappy-compressed files using the streaming decompression API provided in the [python-snappy](https://github.com/andrix/python-snappy) package. This commit supplies a custom datasource that uses Arrow + [python-snappy](https://github.com/andrix/python-snappy) to read and decompress Snappy-compressed files. Co-authored-by: siddharth.goel <siddharth.goel@bytedance.com> Co-authored-by: Chen Shen <scv119@gmail.com> |
||
---|---|---|
.. | ||
data_processing | ||
ml | ||
requirements_default.txt |