-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text posts download archive (archive for postprocessors
)
#2421
Comments
The problem with implementing this feature is that files generated by post processors do not go through the same pipeline like regular, downloaded files do, and that
You could move that long format string to an external file and reference it here with |
'archive', 'archive-format', and 'archive-prefix'
The |
Based on the description for
Seems like this does apply to this example here, no? (Working example for such an postprocessor set-up is here): Now, I'll only have to activate this postprocessor config for the Twitter extractor... "twitter":
{
"<Rest of Twitter options here, yada yada>"
"postprocessors": ["modtime", "twitter-thread"]
},
|
That should work, except you will also need a custom |
For "archive": "~/gallery-dl-postprocessors.sqlite",
"archive-format": "twitter_p1_{retweet_id|tweet_id}", I get
UPD. Ok, anyway I can use UPD2. Checked the data base. I don't need to add Here |
Twitter text/metadata extraction postprocessor: "postprocessor":
{
"modtime":
{
"name": "mtime",
"event": "post"
},
"twitter-thread":
{
"name": "metadata",
"directory": "metadata",
"filename": "[{category}] {author[name]}—{date:%Y.%m.%d}—{retweet_id|tweet_id}.html",
"event": "post",
"mtime": true,
"mode": "custom",
"format": "<div id='{retweet_id|tweet_id}'><h4><a href='https://twitter.com/{author[name]}/status/{retweet_id|tweet_id}'>{retweet_id|tweet_id}</a> by <a href='https://twitter.com/{author[name]}'>{author[name]}</a></h4><div class='content'>{content}</div><hr><div>{date:%Y.%m.%d %H:%M:%S}</div><hr></div><br>",
"archive": "E:\\Archives\\Downloads\\gallery-dl\\gldl-archive-twitter-metadata.db",
"archive-format": "{tweet_id}_{retweet_id}"
}
} (This is the default archive-format from the Twitter extractor, minus |
@AlttiRi @Hrxn yeah that should work like that. The post processor order does matter; mtime needs to run before metadata if the generated metadata file should have its modification time set. |
Okay, so.. This does not work: yes? (I'm removing that part from my comment above...) |
I should really update |
I don't think this "spam" is due to
|
Yes, that's what I meant. Thanks! 😄 |
Can confirm, seems to work for me as well! |
I would like to have the archive for text posts.
Currently a metadata
postprocessor
, for example, for Twitter:will create text files every run.
It's very inconvenient if you use
--download-archive
option.I need to delete a lot of tiny duplicate metadata files every time.
I think it should look so (in the second postprocessor):
"archive"
is to specify there to store entries (a SQLite DB file)."archive-format"
is an entry format.The text was updated successfully, but these errors were encountered: