Skip to content
This repository has been archived by the owner on Feb 18, 2024. It is now read-only.

Does arrow2 support codecs when reading IPC files? #163

Closed
ghuls opened this issue Jun 23, 2021 · 4 comments · Fixed by #168
Closed

Does arrow2 support codecs when reading IPC files? #163

ghuls opened this issue Jun 23, 2021 · 4 comments · Fixed by #168
Labels
question Further information is requested

Comments

@ghuls
Copy link
Contributor

ghuls commented Jun 23, 2021

Does arrow2 support codecs when reading IPC files (Feather v2)?

Feather v2 files created by pandas will use lz4 compression by default. arrow-rs crashes on this kind of files (for now).
Does arrow2 support reading IPC files with a compression codes (lz4, zstd).

apache/arrow-rs#286

@jorgecarleitao
Copy link
Owner

Not yet. I will take a look and check how difficult it is to implement.

@ghuls
Copy link
Contributor Author

ghuls commented Jun 24, 2021

At least checking if IPC files are using a compression codec would already be a good start even if actually decompression is not supported yet.

@jorgecarleitao
Copy link
Owner

@ghuls , there is now a PR for it, #168 :)

@ghuls
Copy link
Contributor Author

ghuls commented Jun 24, 2021

Wow, that was quite fast. Any chance that is also will end up in arrow-rs (apache/arrow-rs#286)? Depending on how fast polars switches to arrow2, it might be less of an issue (although now arrow-rs will not see that the IPC stream is compressed and will try to read at the wrong locations, resulting in weird error messages, so at least detection of compression would be good to have there too.

@jorgecarleitao jorgecarleitao added the question Further information is requested label Jul 23, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
question Further information is requested
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants