Archival Tools - How to archive anything.

  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account
Also, neither can archive really long video (like an hour long ones).
Sometimes PreserveTube will work on an hour-long video, maybe up to 90 minutes. It probably has something to do with the video's quality/bitrate. Once you get the message about not having unlimited storage, you can abandon it.
 
I managed to compress 5 hours of khaantent into two files. Both being around 300MB. When I try to upload them the upload fucks itself (maybe it's my wifi). Could I be able to just host the video somewhere and link it like that then? If not that I can share data from my phone and upload it like that?
 
I managed to compress 5 hours of khaantent into two files. Both being around 300MB. When I try to upload them the upload fucks itself (maybe it's my wifi). Could I be able to just host the video somewhere and link it like that then? If not that I can share data from my phone and upload it like that?
Last I checked the file size limit was 200 MB. No idea if that's increased since then, but maybe make them smaller?
 
Was trying to find an old archived baraag page on .today, when I noticed this message appears when you look up any baraags links on archive.today.
rfBoMsQXAX.webp


All baraag links are blocked off from access, and going to https://www.jugendschutz.net/ brings me to this page:
kHAMo5TJQW.webp
Does anyone know what this German site is and why does it have any authority to tell archive.today to NOT host baraag links? (I mean I can imagine why, but why this German site specifically?)
 
Does anyone know what this German site is and why does it have any authority to tell archive.today to NOT host baraag links? (I mean I can imagine why, but why this German site specifically?)

About jugendschutz.net​

jugendschutz.net is a major player when it comes to the protection of minors on the internet. The organization combines research and action taken in terms of violations of youth protection laws with raising awareness among providers, parents and young people and informing them about potential risks. The tasks of jugendschutz.net are defined in the Interstate Treaty on the Protection of Minors on the Media (JMStV) as well as the Youth Protection Act (JuSchG).

jugendschutz.net checks internet content for violations of youth protection laws. jugendschutz.net also operates a hotline to which internet users can report illegal and harmful content and regularly searches for potential risks on the internet. The focus is on topics and services specifically attractive to children and young people.

Internationally, jugendschutz.net works closely with the networks INHOPE and INACH. In case of questions on international cooperation, please contact us at the following email address: international(at)jugendschutz.net.

It's federally funded. It seems similar to the UK's Internet Watch Foundation, but more closely related to the German government.
 

About jugendschutz.net​

jugendschutz.net is a major player when it comes to the protection of minors on the internet. The organization combines research and action taken in terms of violations of youth protection laws with raising awareness among providers, parents and young people and informing them about potential risks. The tasks of jugendschutz.net are defined in the Interstate Treaty on the Protection of Minors on the Media (JMStV) as well as the Youth Protection Act (JuSchG).

jugendschutz.net checks internet content for violations of youth protection laws. jugendschutz.net also operates a hotline to which internet users can report illegal and harmful content and regularly searches for potential risks on the internet. The focus is on topics and services specifically attractive to children and young people.

Internationally, jugendschutz.net works closely with the networks INHOPE and INACH. In case of questions on international cooperation, please contact us at the following email address: international(at)jugendschutz.net.

It's federally funded. It seems similar to the UK's Internet Watch Foundation, but more closely related to the German government.
Yeah found out all Baraag links were taken down at the request of NCMEC last year, it seems jugen was another proponent in getting baraag links blocked off.
 
Preservetube seems to be down. Attempting to archive videos brings a 1006 error.
1756364287399.webp
Tried this on multiple videos, all lead to the same result.
 
Preservetube seems to be down. Attempting to archive videos brings a 1006 error.
View attachment 7841610
Tried this on multiple videos, all lead to the same result.
While half asleep last night I was messing around with the DNS settings to better route that one school shooter video, and accidentally removed the api. subdomain. Fixed it the second I woke up.

Clear your DNS cache, it should work.
 
Is Ghostarchive being fucky for anyone else? I've been trying to archive 2 pages of a site for a bit and nothing has gotten archived. I first tried it through the bookmarklet but it just resulted in an infinite loop (which is something that happens a lot with it). I then tried manually entering the links through Ghostarchive itself and went through its captcha. The captcha doesn't appear with the bookmarklet which I thought might be affecting it but, nope. I've been getting infinite loops once again along with 502 bad gateways:
3290432883492348.webp
(the black bars are covering PII)
Refreshing on a bad gateway either just loops on the archiving in process page and brings me back to it or just loops on it like usual. I want to know if anyone else is experiencing these issues as well.
 
I've had trouble rearchiving pages on Archive.today with the "/again?url=" trick. It fails and immediately returns to the archived page instead. In the past this might have happened if you tried to rearchive within around 10 minutes of the previous copy, but the last two times I've tried it was more like 5 hours. Maybe it doesn't work at all anymore.

There's an easy fix: Just put an empty identifier ("#") or fragment identifier at the end of the URL that won't affect what the page displays (dynamic pages could detect the identifier with JavaScript and change the page's contents based on it). Unfortunately, you don't have a shared history with the older copies lacking the #. Example: https://archive.ph/6rDtF

Is Ghostarchive being fucky for anyone else? I've been trying to archive 2 pages of a site for a bit and nothing has gotten archived.
...
Refreshing on a bad gateway either just loops on the archiving in process page and brings me back to it or just loops on it like usual. I want to know if anyone else is experiencing these issues as well.
Yes, this has been typical of my experience with it the last few days, and has happened frequently before that. Sometimes you eventually get it to go through and get the archival failure message or a working archive.

Ghostarchive's ability to handle scripting is generally unparalleled, although Wayback Machine can also work. And if you need an Archive.today alternative, you can try Megalodon.jp.
 
Ghost Archive keeps giving 552 whenever I try to archive, from memory and experience those usually mean the archives a lost cause and will be in an endless loop of archiving, but they've happened so often besides very few occassions. Which is a shame because Ghost Archive is kinda good, idk what it is my it my area megalodon does not even load the page unless I'm on a VPN or using Pale Moon and when I'm on Pale Moon I can't even archive pages only look at them. If ghost archive gets worse (which it has) then it's just archive.today and wayback machine.
 
Which is a shame because Ghost Archive is kinda good, idk what it is my it my area megalodon does not even load the page unless I'm on a VPN or using Pale Moon and when I'm on Pale Moon I can't even archive pages only look at them.
That's too bad. One thing I've had great success for with Megalodon.jp are YouTube video pages (comments) and community posts. It works and is fast. Whereas Archive.today might work a fraction of the time, but fail to capture any comments and take 15 minutes of looping to reach a conclusion.

Megalodon.jp is also handling PDFs very well, whereas Ghostarchive might work but is hell for anything now, and Archive.today only saves a screenshot of part of the first page.
 
I'm having problems with accessing an archived video on preservetube which worked last week:
Gamers Nexus documentary on GPU smuggling
Seems to be working for smaller videos, but this one comes up with
s290cp3pZ.webp
Download starts and fails after a few seconds
I've cleared my DNS cache too. No change.

Is anyone able to watch it?
 
I'm having problems with accessing an archived video on preservetube which worked last week:
Gamers Nexus documentary on GPU smuggling
Seems to be working for smaller videos, but this one comes up with
View attachment 7914265
Download starts and fails after a few seconds
I've cleared my DNS cache too. No change.

Is anyone able to watch it?
If it's not beyond 30 minutes you could try Ghost Archive.

For anyone wanting to archive twitch profiles, I've found a safetwitch instance (similar thing to nitter) . However due to how the pages load or something like that, it only works with archive.today. Example
 
Last edited by a moderator:
A question I have, how does something like archive.today and wayback get past DDOS captchas like cloudflare or kiwiflare. But ghost archive and megalodon have a harder time? What is the trick Wayback and archive.today get around this? Does anyone know exactly?
 
Back
Top Bottom