registry/storagedriver S3 Walk optimization #17

CollinShoop · 2021-06-28T16:27:53Z

Objective

blobstore enumeration with S3 storage driver (and possibly others with follow up effort) can be optimized by several orders of magnitude in most cases by offloading more work to the S3 API. In some cases this gives identical performance but in extreme cases, eg thousands of blobs in separate folders, this gives a huge performance boost.

Changes

Use ListObjectsV2PagesWithContext without Delimiter, giving all objects of subpaths in batches up to 1000
Infer directories (no longer listed without Delimiter & recursive implementation) by comparing subsequent object paths of different subdirectories
Keep track of skipped directory and manually skip over any objects under that directory. Note: I acknowledge this could bel less efficient in some extreme cases. The usage patterns in the context of registry should be such that we get an overall performance increase in all cases, at least that I'm aware of.
Added tests for S3 Walk implementation
Added tests for fallback Walk implementation

Bug Fix
While testing, I noticed that WalkFallback does not handle ErrSkipDir as documented for non-directory.

Expected: WalkFallback should stop when ErrSkipDir is returned for a non-directory, as documented WalkFallback
Actual: WalkFallback handles ErrSkipDir for non-directory by skipping the file and does not stop. This is tested with the added case TestWalkFallback/stop early

Run S3 Tests

#export REGION_ENDPOINT=sfo3.digitaloceanspaces.com
export AWS_REGION=us-east-1
export AWS_ACCESS_KEY=<key>
export AWS_SECRET_KEY=<secret>
export S3_BUCKET=<bucket>
go test -run TestWalk -v ./registry/storage/driver/s3-aws/...

Performance

On a few test registries, I performed a rough benchmark using BlobEnumerator::enumerate twice: Once before making these chances & again with the changes. I used a few local changes to keep track of the number of objects / folders enumerated and API calls made.

Test 1 (medium) ~300 blobs

Before
- Took 53.9 seconds
- 479 API calls
After
- Took 0.128 seconds
- 1 API call

Results

Reduce runtime by factor of 421
Reduce API calls by a factor of 479

Test 2 (large) ~50k blobs
Only the first 5 minutes of Walk are recorded and extrapolated, which I think is fair to get the point across

Before - partial results
- Took 5 minutes
- 2680 API calls
Before - extrapolated (x18)
- Took 89.74 minutes
- 48100 API calls
After
- Took 9.85 seconds
- 49 API call

Results

Reduce runtime by factor of 546
Reduce API calls by a factor of 981.

registry/storage/driver/s3-aws/s3.go

registry/storage/driver/walk.go

waynr

Looks good to me, other than the nit about explaining the principle of the new doWalk implementation.

registry/storage/driver/s3-aws/s3.go

CollinShoop · 2021-06-29T12:51:53Z

Finding through further testing, the current impl does not work for scan repositories so addressing that now.

waynr · 2021-06-29T15:55:23Z

registry/storage/driver/s3-aws/s3.go

+//   => [ "/path/to/folder/folder2", "/path/to/folder/folder2/folder1" ]
+// Eg 5 directoryDiff("/", "/path/to/folder/folder/file")
+//   => [ "/path", "/path/to", "/path/to/folder", "/path/to/folder/folder" ],
+func directoryDiff(prev, current string) []string {


Are you familiar with the filepath package? Specifically, filepath.Rel. Along with the the filepath.SplitList subcommand we should be able to simplify the loop/logic constructing parents and eliminate the sort.Sort by looping over the split directory names to construct the parents one-by-one. Not totally clear to me if that would make this function overall more efficient, but if it so it should be worth the effort since it looks like we run this for every filepath we see.

Will take a look at that 👍🏻

I'm not sure how filepath.Rel and SplitList would help here, though I can replace sort.Sort with something to reverse the list ordering, as the way this is done generates a list in reverse order compared to how we want them to be walked. If you have an idea of an alternate implementation, do you mind writing it up?

Using a simple reverse function now and all the unit tests still pass 👍🏻

I'm not sure how filepath.Rel and SplitList would help here

The main benefit in my view would be improved readability and remove the need for the sort (or reverse now i guess).

registry/storage/driver/s3-aws/s3.go

adamwg

This looks great, though it took me a couple of read-throughs (and some S3 doc reading) to understand the correctness. I've left a couple of comment suggestions that would help make it more obvious for the reader.

registry/storage/driver/s3-aws/s3.go

… linked/blobstore

…rking & added a Files Removed test for WalkFilesFallback.

… Walk method.

…l that was left in.

…ng ErrSkipDir from stopping gracefully

…rom stopping gracefully

…it all into S3 tests.

…es to walk between files. This is needed for manifest enumeration among others

…with reverse

CollinShoop mentioned this pull request Jun 28, 2021

registry/cshoop/storagedriver Added StorageDriver method WalkFiles, optimize blobstore enumeration with S3 #16

Closed

waynr reviewed Jun 28, 2021

View reviewed changes

registry/storage/driver/s3-aws/s3.go Show resolved Hide resolved

CollinShoop commented Jun 28, 2021

View reviewed changes

registry/storage/driver/walk.go Show resolved Hide resolved

CollinShoop requested review from adamwg and waynr June 28, 2021 21:01

waynr approved these changes Jun 28, 2021

View reviewed changes

registry/storage/driver/s3-aws/s3.go Show resolved Hide resolved

CollinShoop requested a review from waynr June 29, 2021 14:38

waynr reviewed Jun 29, 2021

View reviewed changes

adamwg approved these changes Jun 29, 2021

View reviewed changes

registry/storage/driver/s3-aws/s3.go Outdated Show resolved Hide resolved

registry/storage/driver/s3-aws/s3.go Outdated Show resolved Hide resolved

Collin Shoop added 20 commits June 30, 2021 08:56

storagedriver: Added a new WalkFiles method (later removed)

f31195b

Replaced usage of storagedriver.Walk with storagedriver.WalkFiles for…

c83e9ae

… linked/blobstore

storagedriver/s3: Simplified conditional in Walk impl

48e2373

Fixed the fallback implementation of WalkFilesFallback that wasn't wo…

cff975b

…rking & added a Files Removed test for WalkFilesFallback.

Added unit testing for storage driver WalkFallback and WalkFilesFallback

88420f2

storagedriver/s3: additional tests

a08cc38

storagedriver/s3: refining tests

c98c694

storagedriver/s3: continue refining tests

1c3ee66

storagedriver/s3: Reverting WalkFiles method. Instead, optimizing the…

e20be1e

… Walk method.

storagedriver/s3: Reverting a few changes from previous WalkFiles imp…

8b726cc

…l that was left in.

storagedriver/s3: test refactor. fixed a bug in WalkFallback preventi…

aea873c

…ng ErrSkipDir from stopping gracefully

storagedriver/s3: fixed a bug in s3 Walk impl preventing ErrSkipDir f…

847738f

…rom stopping gracefully

storagedriver: Adding an interface conformance test for Walk

fbb45bd

storagedriver: Adding an interface conformance test for Walk, cont.

70573f9

storagedriver/s3: Forgoing any interface conformance tests and moved …

b2bdea2

…it all into S3 tests.

storagedriver/s3: Cleaning up tests.

05a2d10

storagedriver/s3: Reverting unintentional change.

9a4da20

storagedriver/s3: Added Walk test case for dealing with errors

9618ba7

storagedriver/s3: Updating comments

8d38cde

storagedriver/s3: Major change to the S3 Walk impl to infer directori…

b9b0cac

…es to walk between files. This is needed for manifest enumeration among others

Collin Shoop added 2 commits June 30, 2021 08:56

storagedriver/s3: Updating the directoryDiff naming and replace sort …

d257d6c

…with reverse

storagedriver/s3: More comment wording

dec90d3

CollinShoop merged commit 26e4128 into digitalocean:master Jun 30, 2021

This was referenced Aug 12, 2021

Closed #21

Closed

Optimize storagedriver/s3 Walk (up to ~500x) + small bugfix distribution/distribution#3480

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

registry/storagedriver S3 Walk optimization #17

registry/storagedriver S3 Walk optimization #17

CollinShoop commented Jun 28, 2021 •

edited

Loading

waynr left a comment

CollinShoop commented Jun 29, 2021

waynr Jun 29, 2021

CollinShoop Jun 29, 2021

CollinShoop Jun 29, 2021

CollinShoop Jun 29, 2021

waynr Jun 30, 2021

adamwg left a comment

registry/storagedriver S3 Walk optimization #17

registry/storagedriver S3 Walk optimization #17

Conversation

CollinShoop commented Jun 28, 2021 • edited Loading

Objective

Changes

Run S3 Tests

Performance

waynr left a comment

Choose a reason for hiding this comment

CollinShoop commented Jun 29, 2021

waynr Jun 29, 2021

Choose a reason for hiding this comment

CollinShoop Jun 29, 2021

Choose a reason for hiding this comment

CollinShoop Jun 29, 2021

Choose a reason for hiding this comment

CollinShoop Jun 29, 2021

Choose a reason for hiding this comment

waynr Jun 30, 2021

Choose a reason for hiding this comment

adamwg left a comment

Choose a reason for hiding this comment

CollinShoop commented Jun 28, 2021 •

edited

Loading