-
-
Notifications
You must be signed in to change notification settings - Fork 996
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
... to behave in a more straightforward way when dealing with bookmarks/favourites/etc. specific IDs are now grouped by their owner, album-id, ... to allow for duplicates when it would be expected.
- Loading branch information
Showing
11 changed files
with
59 additions
and
27 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
5008e10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For
tumblr.py
, thisarchive_fmt = "f_{blog[name]}_{id}_{offset}"
is currently only defined for the classTumblrLikesExtractor
. Is this intentional?Not sure because of
TumblrUserExtractor
for example... or are the IDs always the same?5008e10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The other Tumblr extractors are currently using
archive_fmt = "{id}_{offset}"
(defined in theTumblrExtractor
base class).I thought it would be OK to group the results of
user
,post
andtag
extractors together, since they all download their media into the same directory anyway.I could change the archive-format of user-extractors, for example, to
u_{blog[name]}_{id}_{offset}
, just to be on the safe side and to have a separate "namespace" for each blog, but that would be redundant, I think.5008e10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I noticed, and the base class is using
directory_fmt = ["{category}", "{name}"]
for the output directory, so how would this work with reblogs, for example? If you download blog A, and that has a specific post X, and then download blog B, which has a reblog of that post X, would it end up in the archive (and thus missing in the directory for blog b)? Or are the IDs always unique across different blogs?5008e10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A reblog has a different ID than the original post.
For example:
original: https://mikf123.tumblr.com/post/167623548569
its reblog: https://mikf123.tumblr.com/post/169341068404
5008e10
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The same for successive reblogs on another blog.. okay, so this seems safe to use. Also the same for re-uploads.
Good, this is fine then. Although Tumblr's ID format still does not make any sense to me..