-
Notifications
You must be signed in to change notification settings - Fork 194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement caching of manifest-files #595
Comments
I am trying to working on this, is it possible to assign it to me? |
@swapdewalkar Thanks for picking this up! I've just assigned it to you |
Hi @swapdewalkar I wanted to check in and see if you have any updates on this task. If you need any assistance or if there are any obstacles, please let me know—I will be happy to help! |
Hi, can we increase the scope of this issue to cache/store all_manifests, data_manifests & delete_manifests? Or do I create a new issue for this? This feature would be useful for tasks like Incremental Scans (Append, Changelog, etc) where we frequently access manifest files. I imagine we would like this feature to be similar to the java implementation. Also, since @swapdewalkar hasn't responded yet and if they do not have the time/bandwidth for the issue, I'm happy to give this a shot! :) |
@chinmay-bhat I think we can generalize this quite easily, since from the spec:
I think we could go as easy to have a |
Thanks @Fokko for the quick response! Really appreciate it!
I'm not clear on this. Are you saying we can simply add For ex:
|
@chinmay-bhat I don't think it is as easy as that. We should ensure that the We could also make the FileIO part of the caching key. I don't think that's stricktly required, but if something changed in the FileIO we might want to invalidate the cache, but I'm open to arguments here. |
Thank you for clarifying! Here's how I imagine manifests() would look like :)
When we call |
Feature Request / Improvement
We currently loop over the manifests of a snapshot often just once. But now when we're compounding the operations (DELETE+APPEND), there is a fair chance that read a manifest more than once. The spec states that manifest files are immutable, this means that we can cache is locally using a method annotated with the
lru_cache
.The text was updated successfully, but these errors were encountered: