-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Allow local text or TSV files instead of Google Spreadsheets #148
Comments
Hi kkarhan, thanks for opening the issue - this is something we may look at and would welcome pull requests to add a TSV feeder. Currently we do support using a command line feeder ( We are planning on working on the documentation of the auto-archiver so hopefully that will help with correctly configuring for different workflows. |
Hey @kkarhan thanks for the clear issue and suggestion. Adding to Galen's answer: for now we only implemented 2 main feeders: GoogleSheets and CommandLine. Internally, that covers all our needs, so this is not something we will not be worked on by us atm (adding Still, we'll leave this issue open for a while in case you or others find it a valuable addition and want to contribute it to the project. |
Thanks so far for the feedback and keeping the issue open. Is there any conclusive documentation re:
Cuz if similar to |
No good documentation on it unfortunately. If you look at the code auto-archiver/src/auto_archiver/feeders/cli_feeder.py Lines 17 to 24 in b166d57
it is essentially expecting a What I'd suggest is you either create a new, very similar feeder, that accepts a filename instead of a csv of hardcoded urls OR actually modify the cli_feeder to have another parameter just for filenames and force at least one of them to be present. This should not be hard to achieve assuming you've been able to run/test the auto-archiver locally on your development environment. |
*this would be preferable to piping giving the current sofware architecture of the library. |
Hi,
as I asked on the fediverse, there's like a not-so insignificant need to allow self-hosting, which admittedly it doesn't do as of now.
--local [filename]
flag (similar to the batch option ofyoutube-dlp
andcurl
) and just use that.I sincerely hope this will help your project going forward and if needed I'll gladly provide samples of sites that one may want to archive.
Yours faithfully,
Kevin Karhan
The text was updated successfully, but these errors were encountered: