Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Position Deletes in V3 #11122

Open
2 of 6 tasks
aokolnychyi opened this issue Sep 12, 2024 · 0 comments
Open
2 of 6 tasks

Improve Position Deletes in V3 #11122

aokolnychyi opened this issue Sep 12, 2024 · 0 comments
Labels
proposal Iceberg Improvement Proposal (spec/major changes/etc)

Comments

@aokolnychyi
Copy link
Contributor

Proposed Change

This proposal aims to enhance the handling of position deletes in Iceberg. It builds on lessons learned from deploying the current approach at scale and addresses all unresolved questions from past community discussions and proposals. This effort primarily targets the V3 spec but some parts can be completed independently.

Shortcomings of position deletes we observe today:

  • Choosing between fewer delete files on disk (partition granularity) or targeted deletes (file granularity).
  • Dependence on external maintenance for consistent write and read performance.
  • Writing and reading overhead as in-memory and on-disk representations differ.

Proposal document

https://docs.google.com/document/d/18Bqhr-vnzFfQk1S4AgRISkA_5_m5m32Nnc2Cw0zn2XM

Specifications

  • Table
  • View
  • REST
  • Puffin
  • Encryption
  • Other
@aokolnychyi aokolnychyi added the proposal Iceberg Improvement Proposal (spec/major changes/etc) label Sep 12, 2024
@danielcweeks danielcweeks added this to the Iceberg V3 Spec milestone Sep 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
proposal Iceberg Improvement Proposal (spec/major changes/etc)
Projects
None yet
Development

No branches or pull requests

2 participants