-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Appending to a parquet file #47
Comments
Interesting proposal. I do not know whether other parquet implementations even support this. Valid proposal regardless. With
It does require some internal changes, though. |
Hello! I have a project where I am reading a large file into a Is my understanding of how the footer is being written is correct? Would the feature described here address the issue I am having? Thank you! |
@curtisalexander , do you have a minimal example of what you are doing? |
In developing a minimal example, I figured out that I had a bug in my project! The exercise of producing that example was most helpful. So please ignore my original question / comment. My apologies for reaching out prematurely without better testing. The bug I observed had to do with overwriting parquet files and the order in which they were overwritten. If the file was of a larger size (say I wrote 10 rows to the file) and then I tried to overwrite it with a smaller size (only 5 rows) then I would observe reading errors. It ultimately came down to the fact that I was not using Again, sorry for the distraction. Thanks for your willingness to help with what I reported. And thanks for this, and the |
Hi !
I was wondering if it would be possible to add support for appending data at the end of a parquet file ?
It would probably mean to truncate the footer, write the next
RowGroup
and rewrite the footer again.The text was updated successfully, but these errors were encountered: