Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Delta-rs writer hangs with to many file handles open (Azure) #1832

Closed
ion-elgreco opened this issue Nov 9, 2023 · 3 comments
Closed

Delta-rs writer hangs with to many file handles open (Azure) #1832

ion-elgreco opened this issue Nov 9, 2023 · 3 comments
Labels
bug Something isn't working

Comments

@ion-elgreco
Copy link
Collaborator

Environment

Delta-rs version: 0.13.0

Binding: Python

Environment:

  • Cloud provider: azure
  • OS:
  • Other:

Bug

What happened:
When writing a delta table with approx 10m records but around 35 partitions it hangs randomly due to many file handlers open.

I was able to narrow it down to the file handles since I set the max_files_open to 2. And then I looped the writing by 25 times which all worked fine.

When i removed this parameter it would randomly fail to write.

Some additional weird behavior, I could see some empty partition folders being created and then it hangs and doesn't do anything for many hours until I manually killed the pipeline.

What you expected to happen:
Write also with many file handles open at same time.

How to reproduce it:
Difficult, but the integration test @roeap triggered todat, since it didn't run for a while, was showing the same behaviour: https://github.com/delta-io/delta-rs/actions/runs/6812593991/job/18525273592

More details:

@ion-elgreco ion-elgreco added the bug Something isn't working label Nov 9, 2023
@stefnba
Copy link

stefnba commented Nov 9, 2023

Sounds like it’s the same issue reported by me #1770

@ion-elgreco
Copy link
Collaborator Author

@stefnba indeed seems to be the same issue! Sorry I missed that and also forget to respond there

@ion-elgreco
Copy link
Collaborator Author

Closing it in favor of discussion in #1770

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants