-
Notifications
You must be signed in to change notification settings - Fork 837
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Multipart upload can leave futures unpolled, leading to timeout #5366
Comments
Perhaps I am missing something, why would the future not be polled if there is work to do? This sounds like the issue is the user is multiplexing cpu bound tasks on the same threadpool/task, and therefore starving their write tasks. The conclusion of prior efforts in this space, e.g. #4040, is this is a user problem |
Okay, let me try creating a reproduction in Rust. |
Here is the reproduction:
I think we would agree that it might be expected behavior if the sleep in step (3) was synchronous/blocking. However, this will fail even if the sleep is async, which makes this feel like a real foot gun. use object_store::{aws::AmazonS3Builder, path::Path, ObjectStore};
use tokio::io::AsyncWriteExt;
#[tokio::main]
async fn main() {
// Create an S3 store configured from environment variables
let store = AmazonS3Builder::from_env().with_bucket_name("lance-performance-testing").build().unwrap();
let (_id, mut writer) = store.put_multipart(&Path::from("test")).await.unwrap();
// Upload 10 MB of data
let data_chunk = vec![0u8; 1024 * 1024];
for _ in 0..10 {
writer.write_all(&data_chunk).await.unwrap();
}
// Sleep for 30 seconds
tokio::time::sleep(tokio::time::Duration::from_secs(30)).await;
// try to upload 10 more (should fail here)
for _ in 0..10 {
writer.write_all(&data_chunk).await.unwrap();
}
// finish.
writer.shutdown().await.unwrap();
}
|
Hmm... I wonder if the issue here is that write_all is not waiting for in-progress writes to finish? Is it possible the poll_flush implementation isn't quite right? I would expect the above to work, without needing to spawn work. |
It looks like this is actually an issue with the underlying IO traits - tokio-rs/tokio#4296 TLDR is the user must explicitly flush when they've finished writing. I agree this is an unfortunate footgun |
That is correct, but it's due to the design of the multipart write implementation. By design, when I wrote it, |
Oh I see what you mean about flush. Basically, anytime a user expects there will be a significant gap in time between write calls, they must call flush. I think that could work. |
Yes this is an unfortunate property of the AsyncWriteExt trait, I agree it is unfortunate and the method is definitely confusing, we should definitely document this footgun and link back to the tokio ticket. |
|
@tustvold does this affect only MultiPart? Because I'm seeing since object store 0.9 quite some timeouts in ADLS and S3 with Delta-RS |
This ticket is specifically multipart.
What version were you running before, 0.8 set some default timeouts (previously they were unlimited). |
Describe the bug
Instead of waiting until the data passed to the writer is uploaded to return ready, we buffer it until there is enough data and then put the request future in
FuturesUnordered
. I thought this was pretty clever when I originally wrote it, but it has a big flaw: If a lot of time passes between write calls, the request futures can go all that time without being polled. This can cause timeouts such as:To Reproduce
I discovered this downstream in some code that uses multi part uploads: lancedb/lance#1878 (comment)
Expected behavior
I think the best solution is to run the upload tasks inside of a background task created with
tokio::task::spawn()
.Additional context
It's quite possible this is the same underlying cause of #5106
The text was updated successfully, but these errors were encountered: