Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is Archive.is working? And Docker image updates? #56

Open
ItsNoted opened this issue Aug 30, 2023 · 11 comments
Open

Is Archive.is working? And Docker image updates? #56

ItsNoted opened this issue Aug 30, 2023 · 11 comments
Labels
question Further information is requested

Comments

@ItsNoted
Copy link

It appears for me that when I try to send a page to be archived to archive.is it just gets stuck on the captcha page. I also noticed the Docker image hasn't been updated in over a year. Maybe that is the issue? Any plans to update the image?

image

@ItsNoted ItsNoted changed the title Is Achive.is working? And Docker image updates? Is Archive.is working? And Docker image updates? Aug 30, 2023
@daydiff
Copy link

daydiff commented Sep 3, 2023

Looks like you might be using cloud flare DNS. Archive.is will not work with cloud flare DNS.

@daydiff
Copy link

daydiff commented Sep 3, 2023

@ItsNoted
Copy link
Author

ItsNoted commented Sep 3, 2023

Here's an explanation for the reasons: https://blog.archive.today/post/634795612966125568/when-will-your-site-be-accessible-from-cloudflare

The irony is I cannot even access that blog article.

@daydiff
Copy link

daydiff commented Sep 3, 2023

Here's an explanation for the reasons: https://blog.archive.today/post/634795612966125568/when-will-your-site-be-accessible-from-cloudflare

The irony is I cannot even access that blog article.

Try switching your DNS to the one from your ISP or some other public DNS, e.g. Google public DNS.

@ItsNoted
Copy link
Author

ItsNoted commented Sep 3, 2023

I was able to get to it over a VPN. Weird. But that statement was written 3 years ago.

@jonschoning
Copy link
Owner

Are you hosting espial on a cloud server? I had gotten this quite a bit when hosting on the cloud. My assumption was that IP range was flagged by whomever to trigger captchas. My solution was to utilize the ARCHIVE_SOCKS_PROXY_HOST/ARCHIVE_SOCKS_PROXY_PORT environment variables to proxy the archive requests to one of my machines I run at home which avoids the captcha.

@jonschoning
Copy link
Owner

I will update the docker image; most of the updates previously have just been updates to base libraries, but i'll do another update. I've been a bit busy so haven't really added many features lately, but there are some I'm planning on getting around too based on what's in the github issues

@jonschoning jonschoning added the question Further information is requested label Sep 3, 2023
@daydiff
Copy link

daydiff commented Sep 3, 2023

I was able to get to it over a VPN. Weird. But that statement was written 3 years ago.

Well, the concerns are still valid. I'd say even more so, after kiwifarms drop.

@ItsNoted
Copy link
Author

ItsNoted commented Sep 5, 2023

I'm curious if this could work with Archive Box so we could self-host our own archives and not sorry about a 3rd party to do the job. Having something off-site is nice but there are caveats like this sometimes.

@jonschoning
Copy link
Owner

perhaps, but would have to figure out if it's an "official" integration or make some kind of configuration to support it.. definitely don't want to force users to jump through extra configuration system-setup hoops. maybe it's an additional docker variant.

@srd424
Copy link

srd424 commented Oct 13, 2024

Given the rise of AI slop and the recent attacks on the Internet Archive, having a local archive feels like it could be a good thing. As a really simple first-step hack could the archive service domain be configurable? Then it should be possible to hack something up that talks the same "protocol" as archive.li ..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants