-
Notifications
You must be signed in to change notification settings - Fork 406
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cleanup_metadata
doesn't remove .checkpoint.parquet
files
#1420
Comments
I think it is just missing the delta-rs/rust/src/checkpoints.rs Line 208 in b17f286
diff --git a/rust/src/checkpoints.rs b/rust/src/checkpoints.rs
index 98a4e79..41385bb 100644
--- a/rust/src/checkpoints.rs
+++ b/rust/src/checkpoints.rs
@@ -205,7 +205,7 @@ pub async fn cleanup_expired_logs_for(
) -> Result<i32, DeltaTableError> {
lazy_static! {
static ref DELTA_LOG_REGEX: Regex =
- Regex::new(r#"_delta_log/(\d{20})\.(json|checkpoint)*$"#).unwrap();
+ Regex::new(r#"_delta_log/(\d{20})\.(json|checkpoint).*$"#).unwrap();
}
let mut deleted_log_num = 0; |
Also, should this be advancing by 1? Wouldn't that also remove the latest checkpoint ? delta-rs/rust/src/checkpoints.rs Line 106 in b17f286
Edit: Removing the latest checkpoint seems to be the case on the table I just tested with. |
Environment
Delta-rs version: v0.12.0
Binding: rust
Environment:
Bug
What happened:
Running the following snippet does not remove
.checkpoint.parquet
files.What you expected to happen:
.checkpoint.parquet
files older thanlog_retention_timestamp
should be removed.How to reproduce it:
cleanup_expired_logs_for
settinglog_retention_timestamp
toUtc::now()
.json
files are removedMore details:
https://regex101.com/r/OmoZMz/1
The text was updated successfully, but these errors were encountered: