Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Checksum at object stores #388

Merged
merged 21 commits into from
Jun 14, 2022
Merged

Checksum at object stores #388

merged 21 commits into from
Jun 14, 2022

Conversation

parasj
Copy link
Contributor

@parasj parasj commented Jun 8, 2022

Fixes #391
Fixes SKY-158
Fixes SKY-177

Implements checksums by generating a checksum object at object store download, storing it to the chunk then checking the MD5 hash at object store write-back within the destination object store.

  • Add checksum generation on object store download
  • Check MD5 on S3 upload
  • Add upload checksum check for Azure
  • Update chunk requests with checksums, then check when uploading

@linear
Copy link

linear bot commented Jun 8, 2022

SKY-158 Checksumming

Failure cases not covered:

  • Gateways failing
  • Client failing
  • Stalls or other socket issues (verification may catch this but we don't have timeouts)

@parasj parasj changed the title [Draft] Checksum at object stores Checksum at object stores Jun 10, 2022
@parasj parasj requested a review from sarahwooders June 10, 2022 03:12
@parasj parasj self-assigned this Jun 10, 2022
@parasj parasj mentioned this pull request Jun 11, 2022
@linear
Copy link

linear bot commented Jun 11, 2022

parasj and others added 8 commits June 11, 2022 20:59
* Refactored src dest querying code into helper method

* Sync design implemented

* Sync bug fixed + redesign implemented

* Style and typing fixes

* typing issue:

* Style formatting

* Added TransferObjectList dataclass

* Minor further refactor of TransferObjectList, add new replicate_helper_random function

* Further clean up

* Move TransferObjectList

* Consolidate replicate_helper_cp and replicate_helper_random

* Inline replicate helper

Co-authored-by: Paras Jain <[email protected]>
@parasj parasj merged commit bf415c5 into main Jun 14, 2022
@parasj parasj deleted the dev/checksum branch June 14, 2022 16:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Checksumming
3 participants