-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HBASE-28836 Parallize the file archival to improve the split times #6483
Conversation
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
hbase-server/src/main/java/org/apache/hadoop/hbase/backup/HFileArchiver.java
Show resolved
Hide resolved
d3703bb
to
f86aa9b
Compare
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
This comment has been minimized.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
boolean fileCleaned = fileFutureEntry.getValue().get(); | ||
if (!fileCleaned) { | ||
LOG.warn("Couldn't archive %s into backup directory: %s" | ||
.formatted(fileFutureEntry.getKey(), baseArchiveDir)); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know all the jdk requirements, but looks like .formatted
was added only in JDK 15. If this is intended for backport, there may be earlier JDK versions to support.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for pointing to this. Yes i was using jdk 17 so missed this.
… object store (s3)
f86aa9b
to
925ff4a
Compare
🎊 +1 overall
This message was automatically generated. |
💔 -1 overall
This message was automatically generated. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It looks like until this change, only MasterFileSystem#checkTempDir
and DeleteTableProcedure#deleteFromFs
were using parallel archival using thread pool of size defined by hbase.hfilearchiver.thread.pool.max
.
With this change, we will use parallel archival for all use cases.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1
@mnpoonia could you also create branch-2 backport PR so that after clean build results, the changes can be ported to all active branches? |
@virajjasani Sure. It can be applied to all active branches. |
Sounds good, let me verify on branch-2 and then will merge it today. |
…6483) Signed-off-by: Viraj Jasani <[email protected]> Signed-off-by: David Manning <[email protected]> Signed-off-by: Umesh Kumar <[email protected]>
…6483) Signed-off-by: Viraj Jasani <[email protected]> Signed-off-by: David Manning <[email protected]> Signed-off-by: Umesh Kumar <[email protected]>
…6483) Signed-off-by: Viraj Jasani <[email protected]> Signed-off-by: David Manning <[email protected]> Signed-off-by: Umesh Kumar <[email protected]>
…6483) Signed-off-by: Viraj Jasani <[email protected]> Signed-off-by: David Manning <[email protected]> Signed-off-by: Umesh Kumar <[email protected]>
Thank you @virajjasani |
Thank you @mnpoonia for this nice improvement! |
…pache#6483) Signed-off-by: Viraj Jasani <[email protected]> Signed-off-by: David Manning <[email protected]> Signed-off-by: Umesh Kumar <[email protected]>
Jira: HBASE-28836