-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Run optimize after Liquid Clustering #448
Comments
CC: @benc-db to see if you have any thoughts? |
I need to give this some thought, as from the docs this is something that needs to be done on on-going basis. We could use optimize on every run, but is that too often? |
Also, I need to dig a little more to understand the behavior today, because just inspecting the code, I would expect this
or this dbt-databricks/dbt/include/databricks/macros/materializations/incremental/incremental.sql Line 89 in a6ddca9
would already be running optimize |
That's what i thought also, but it's not running the |
Will investigate; changing the tag to 'bug' because I believe that the correct operation of this feature includes running optimize. |
Disclaimer: this might be a question and not really a feature request.
Describe the feature
In the liquid clustering docs: https://docs.databricks.com/en/delta/clustering.html#how-to-trigger-clustering it says we should trigger
OPTIMIZE table_name;
to trigger clustering. So I am thinking if a model hasliquid_clustered_by
then we should runOPTIMIZE table_name;
alsoDescribe alternatives you've considered
Somehow using this? https://github.com/databricks/dbt-databricks/blob/main/dbt/include/databricks/macros/adapters.sql#L368
Additional context
Please include any other relevant context here.
Who will this benefit?
Everyone using
liquid_clustered_by
Are you interested in contributing this feature?
Let us know if you want to write some code, and how we can help.
The text was updated successfully, but these errors were encountered: