Archival Tools - How to archive anything.

I might need to update it for their CAPTCHA, but it worked pretty well for me.
So it doesn't solve their captcha? That's my biggest issue with archive.ph. I always get captchas due to using a VPN and I would like a way to automatically bypass it so batch archiving is possible for me.
 
I think you need an API key for the reCAPTCHA service to do so, but I wasn't going to pay for it.
 
I think you need an API key for the reCAPTCHA service to do so, but I wasn't going to pay for it.
Isn't the API key to implement reCAPTCHA on a site? A few Google searches don't seem to show how an API key can bypass it.
 
Last edited:
Does anyone know how you archive facebook posts and profiles now? When I use archive.ph, it says "Not found (yet?)" after spending an age in the queue. When using ghostarchive, I just get a login page?
 
  • Thunk-Provoking
Reactions: Suikafag
Yes, because you likely need to be logged in to view those profiles. Ghost and Archive aren't logged in since they're just loading the page from their bot browser. Perhaps there's a paid service that uses a proxy with login information?
 
Yes, because you likely need to be logged in to view those profiles. Ghost and Archive aren't logged in since they're just loading the page from their bot browser. Perhaps there's a paid service that uses a proxy with login information?
One of the posts doesn't require you to be logged in the view it but it still doesn't work.

Edit: archive.ph link is showing the content for that post now. Weird. The profile link is private though so I just saved that page as a PDF instead.
 
I know. I archived direct pages on my own due to this:
So many 4chan archives have gone down that I don't trust it. Archive any link from the 4chan archive you post on archive.is
I'm archiving the archives now using 4plebs for /tv/, DesuArchive for /a/, & b4k for /v/.

That still doesn't explain why the pages directly aren't archiving. It just goes on for a while & shows Not Found.
 
Has Archive Today been compromised?

From Tor, archive.today presently redirects to archive.ph (normal) and presents a CAPTCHA page (normal for Tor). Then, whether I start trying to solve the CAPTCHA challenge or not, in a few seconds, the page redirects to https://rurtnews.com/. (not normal). This same behavior is observed on archive.is and other archive.today domains. This same behavior persists across restarts of the Tor browser and changes of the Tor circuit, even when javascript is disabled.

I have not tested this from the clear net nor another VPN.
 
Last edited:
Has Archive Today been compromised?

From Tor, archive.today presently redirects to archive.ph (normal) and presents a CAPTCHA page (normal for Tor). Then, whether I start trying to salve the CAPTCHA challenge or not, in a few seconds, the page redirects to https://rurtnews.com/. (not normal). This same behavior is observed on archive.is and other archive.today domains. This same behavior persists across restarts of the Tor browser and changes of the Tor circuit, even when javascript is disabled.

I have not tested this from the clear net nor another VPN.
Using VPN on clearnet, I get CAPTCHA but no redirect to rurtnews.com. It's also normal if I use Tor with both clearnet and onion domain.

Is something wrong with your computer (malware etc)?
 
Last edited:
  • Like
Reactions: I'm a Silly
Using VPN on clearnet, I get CAPTCHA but no redirect to rurtnews.com. It's also normal if I use Tor with both clearnet and onion domain.

Is something wrong with your computer (malware etc)?
I haven't seen any signs of malware yet, but will be doing a scan.

On clearnet, I get straight to the main archive.ph page with no CAPTCHA, and can search and archive with no issues.

I wonder if something is wrong with the CAPTCHA redirect, then. In chat, @Gog & Magog said he could not get past the CAPTCHA while on Tor, if I understood him right.

EDIT: No scan results yet, but I get the same behavior on my phone. Clearnet: no CAPTCHA, no redirect to rurtnews.com. TOR: CAPTCHA, which redirects to rurtnews.com
 
Last edited:
Back