-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docs: differentiate types of file hashes #1469
Comments
I would suggest to generalize it somehow instead of trying to mention all possible fields everywhere. E.g. outs can have hash (e.g. md5 or etag). |
Agree but we still need to update all those docs with the more general concept of "file hash" or "hash value" — which fortunately we've already been doing in the past 🙂 |
I think this issue is linked to #1448 more than it looks like. |
Which kind of file hash is used on each type of external dependency. Same as in add and run. Or maybe in the External Data guide and link from all those refs into there.
Yes, maybe. |
Also @skshetry this all applies to .dvc files as well, right? Not just dvc.lock |
Oh and what about external outputs? Or do |
@jorgeorpinel They might be |
@jorgeorpinel, this is about external dependencies and outputs. So, it applies to
OSS uses |
oss also doesn't support external deps/outs. |
This comment has been minimized.
This comment has been minimized.
On OSS, please clarify:
vs.
Thanks |
Wait no, sorry. Saugat said MD5 for GS, never mind. eTag is for HTTP(s), S3, and Azure. |
p.s. you can just review #1527 instead. |
hi all! I'm working on a tool to upload data to a dvc remote without actually using dvc and I noticed that the md5 of a file is not calculated correctly if the file is in windows format. Looks like the CRLF are replace by LF before calculation. iterative/dvc#775 This is a minor issue because will treat as equal files with different ending. The point is, I first though that for files added with |
Could it be Git doing that though (depending on the repo's config)? Cc @efiop on this Q anyway.
Not much @MetalBlueberry, which is why this issue and #68 exist. The one place where we've already put some of this info is in our DVC Metafiles guide: https://dvc.org/doc/user-guide/dvc-files-and-directories (please find the Feel free to ask any questions about this or anything else directly in our http://dvc.org/chat ! |
@MetalBlueberry if we talk about regular |
I've created an issue for the CRLF problem iterative/dvc#4658. |
I think this is probably addressed now ( https://dvc.org/doc/user-guide/project-structure/pipelines-files#dvclock-file ) but I'll double check the list in the issue description. |
Extracted from #1448 (comment) and #1494 (review):
("key" referring to the actual name with which the values are saved in dvc.yaml)
Related: #68
I believe this affects the following docs:
some of which may need more examples that feature
etag
andchecksum
fields in dvc.lock.The text was updated successfully, but these errors were encountered: