Use SHA for BLOB update instead of modification time #3697

paxadax · 2024-10-01T16:24:27Z

What is the purpose of the change

Issue[4077]

When deploying Nimbus or changing the leadership within a high availability Nimbus cluster, we've verified that the Topologies workers are killed due to different modification times.
By using the modTime as the version, we have found that, while using the LocalFsBlobStoreFile, every time the the Nimbus leader goes down the following occurs:

Nimbus (1) leader goes down and a new Nimbus (2) picks up the leadership.
If blobs in Nimbus (2) have a different modTime workers are restarted (even though they might be the same).
Nimbus (1) comes back up, syncs the blobs in the startup and updates the modTime, as it downloads the blobs again.
If Nimbus (2) leader goes down, all the workers will be restarted again as Nimbus (1) has new modTime again.
This can be repeated endless as the modTime will always be different in each Nimbus leader.

In this PR, we've introduced a new feature to use the SHA version of the file instead of the modification time. With this feature, when a nimbus loses the leadership, the workers will continue running because the version of the BLOB will continue the same as the BLOB is the same and also it's correspondent SHA.

How was the change tested

Unit Tests
Test locally with specific jar on my local Storm, forced nimbus to change leadership and the workers on the topologies continued to work properly.

reiabreu · 2024-10-02T12:46:44Z

storm-server/src/main/java/org/apache/storm/blobstore/LocalFsBlobStoreFile.java

+    @Override
+    public long getVersion() throws IOException {
+        try (FileInputStream fis = new FileInputStream(path)) {
+            byte[] bytes = DigestUtils.sha1(fis);


what do you think about using something such as sha256 or sha512 to avoid (unlikely) collisions?

Thank you for the suggestion, I've update it to Sha256

Do we have a fealing how often getVersion(...) is called? Creating a SHA hash is rather expensive compared to the modification date (just hink, if we need to do caching after first call or a like) ?

If it is called often, perhaps we can use something such as MurmurHash that is used elsewhere in the code for the sharding of tuples

This is being ran by the AsyncLocalizer every interval defined by supervisor.localizer.update.blob.interval.secs but this won't have impact on the worker but on the supervisor. We wouldn't need to cache it but nevertheless we can add cache.

Since this is being ran continuously we can opt for a Murmur hash that will prioritise fast hashing(suggested by @reiabreu) and this way we would opt for not using caching

Works for me

Hey, after a brief discussion, we've decided to follow with Checksum instead of Murmur since Checksum computation is faster. Commit with the changes.

Makes sense to use checksum.
I've approved the changes

storm-client/src/jvm/org/apache/storm/blobstore/BlobStoreFile.java

paxadax

Don't merge it yet as I'm still doing some tests where some flakiness has surged

paxadax

Everything tested, we can proceed with merge

reiabreu · 2024-10-04T10:45:53Z

@rzo1 do you want to re-examine the PR?

rzo1 · 2024-10-04T10:54:56Z

lgtm. Thanks for the PR.

paxadax added 2 commits October 1, 2024 17:06

feat: Use file SHA instead of last modification time

d28e3fd

tests: Add unit tests for SHA version

788a88e

paxadax marked this pull request as draft October 1, 2024 16:25

paxadax marked this pull request as ready for review October 1, 2024 16:40

fix: add missing comment for rat-plugin

a79fd52

rzo1 requested review from reiabreu, agresch and avermeer October 1, 2024 18:14

DiogoP98 approved these changes Oct 2, 2024

View reviewed changes

reiabreu reviewed Oct 2, 2024

View reviewed changes

storm-client/src/jvm/org/apache/storm/blobstore/BlobStoreFile.java Show resolved Hide resolved

paxadax added 4 commits October 2, 2024 14:19

fix: Use sha256 instead of sha1

331e6e1

fix: fix tests

83bb99a

feat: Use Checksum instead of hash for faster computation

c135adc

tests: Add tests for checksum

381367b

reiabreu approved these changes Oct 3, 2024

View reviewed changes

paxadax commented Oct 3, 2024

View reviewed changes

rzo1 approved these changes Oct 4, 2024

View reviewed changes

reiabreu merged commit 1e8eee6 into apache:master Oct 4, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use SHA for BLOB update instead of modification time #3697

Use SHA for BLOB update instead of modification time #3697

paxadax commented Oct 1, 2024

reiabreu Oct 2, 2024

paxadax Oct 2, 2024

rzo1 Oct 2, 2024

reiabreu Oct 2, 2024

pedroazevedo-fd Oct 2, 2024

paxadax Oct 2, 2024 •

edited

Loading

rzo1 Oct 2, 2024

paxadax Oct 3, 2024

reiabreu Oct 3, 2024

paxadax left a comment

paxadax left a comment •

edited

Loading

reiabreu commented Oct 4, 2024

rzo1 commented Oct 4, 2024

Use SHA for BLOB update instead of modification time #3697

Use SHA for BLOB update instead of modification time #3697

Conversation

paxadax commented Oct 1, 2024

What is the purpose of the change

How was the change tested

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

paxadax Oct 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

paxadax left a comment

Choose a reason for hiding this comment

paxadax left a comment • edited Loading

Choose a reason for hiding this comment

reiabreu commented Oct 4, 2024

rzo1 commented Oct 4, 2024

paxadax Oct 2, 2024 •

edited

Loading

paxadax left a comment •

edited

Loading