You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What happened:
When writing a delta table with approx 10m records but around 35 partitions it hangs randomly due to many file handlers open.
I was able to narrow it down to the file handles since I set the max_files_open to 2. And then I looped the writing by 25 times which all worked fine.
When i removed this parameter it would randomly fail to write.
Some additional weird behavior, I could see some empty partition folders being created and then it hangs and doesn't do anything for many hours until I manually killed the pipeline.
What you expected to happen:
Write also with many file handles open at same time.
Environment
Delta-rs version: 0.13.0
Binding: Python
Environment:
Bug
What happened:
When writing a delta table with approx 10m records but around 35 partitions it hangs randomly due to many file handlers open.
I was able to narrow it down to the file handles since I set the max_files_open to 2. And then I looped the writing by 25 times which all worked fine.
When i removed this parameter it would randomly fail to write.
Some additional weird behavior, I could see some empty partition folders being created and then it hangs and doesn't do anything for many hours until I manually killed the pipeline.
What you expected to happen:
Write also with many file handles open at same time.
How to reproduce it:
Difficult, but the integration test @roeap triggered todat, since it didn't run for a while, was showing the same behaviour: https://github.com/delta-io/delta-rs/actions/runs/6812593991/job/18525273592
More details:
The text was updated successfully, but these errors were encountered: