-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spark: Read DVs when reading from .position_deletes table #11657
base: main
Are you sure you want to change the base?
Conversation
spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/PositionDeletesRowReader.java
Outdated
Show resolved
Hide resolved
spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/DVIterable.java
Outdated
Show resolved
Hide resolved
spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/DVIterable.java
Outdated
Show resolved
Hide resolved
b347427
to
cd35ea5
Compare
2ab2cda
to
b79a7da
Compare
b79a7da
to
2512b5f
Compare
spark/v3.5/spark/src/main/java/org/apache/iceberg/spark/source/DVIterable.java
Outdated
Show resolved
Hide resolved
|
||
@Override | ||
public CloseableIterator<InternalRow> iterator() { | ||
PuffinReader reader = builder.build(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
[optional] might be too much, but can we have one reader per DV file ? considering specifically for this use case we will have to read all the blobs in the DV file eventually.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
do you have a particular use case in mind as this isn't something that is being needed currently when reading the PositionDeletesTable
?
9a47998
to
3e1bafe
Compare
this is part of #11122 and has been extracted from #11545