Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow writing into Delta tables using deletion vectors #21784

Closed
wants to merge 1 commit into from

Conversation

ebyhr
Copy link
Member

@ebyhr ebyhr commented May 1, 2024

Description

This PR doesn't writing deletion vectors. Still rewriting whole files.
Relates to #17063

Release notes

(x) Release notes are required, with the following suggested text:

# Delta Lake
* TBD. ({issue}`issuenumber`)

@cla-bot cla-bot bot added the cla-signed label May 1, 2024
@github-actions github-actions bot added the delta-lake Delta Lake connector label May 1, 2024
@ebyhr ebyhr force-pushed the ebi/delta-write-without-dv branch from ba631f5 to ec8aadf Compare May 1, 2024 09:05
@ebyhr ebyhr force-pushed the ebi/delta-write-without-dv branch from ec8aadf to 71d0286 Compare May 1, 2024 09:30

assertUpdate("MERGE INTO " + tableName + " t USING " + tableName + " s " +
"ON (t.a = s.a) WHEN MATCHED THEN UPDATE SET b = -1", 2);
assertQuery("SELECT * FROM " + tableName, "VALUES (1, -1), (-3, -1)");
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Reading logic for deletion vectors might be broken though I'm not sure if Databricks writes such transaction logs:

java.lang.AssertionError: For query 20240501_092829_00018_hvqds: 
 SELECT * FROM deletion_vectorsg763ca8zs4
not equal
Actual rows (up to 100 of 1 extra rows shown, 3 rows in total):
    [2, 22]
Expected rows (up to 100 of 0 missing rows shown, 2 rows in total):

@ebyhr ebyhr closed this May 9, 2024
@ebyhr ebyhr deleted the ebi/delta-write-without-dv branch May 9, 2024 01:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cla-signed delta-lake Delta Lake connector
Development

Successfully merging this pull request may close these issues.

1 participant