Archival Tools - How to archive anything.

  • 🐕 I am attempting to get the site runnning as fast as possible. If you are experiencing slow page load times, please report it.
Nah, they were already talking about it beforehand. Wayback is well known for being gaping vaginal maws.
It would be interesting to know whether they did it:
  1. Because they're faggots
  2. Because of legal threats
  3. Because of false reports from people claiming to be the site owner and wanting it deleted
  4. Because of robots.txt poisoning by faggot Matthew Prince
  5. Because I made a joke about Jason Scott
I agree it's probably #1, but 2-4 are definite possibilities and have happened before due to IA policies.
 
No

Like what? Archive in html or as full page screenshots and send the files as torrents?
Yeah pretty much we have to save it literally on our hard drive as a torrent file and seed it so that way it is archive. I am planning in archiving all of Keffals Vod and screenshoting all of Keffals tweets. Not also that, but Taylor Lorenze tweet. She got her uncle that work for Archive.org to nuke her archive tweets.
 
  • Like
Reactions: awoo and $quid
According to this article (archived), archive.org no longer respects robots.txt.
Maybe it somehow makes errors sometimes, I'd have no clue, but even if that were true I'm sure webmasters can just tell them to fuck off like the trannies did here to a site they don't even own, like any rando can ask for in the archive.org forums. This author even says that.
1662879882864.png

But I am pretty certain that it respects robots.txt even if that article from years ago said otherwise, because I've seen sites that I am pretty sure didn't go out of their way to ask that have their entire content unarchived, yet have decent user bases.
 
  • Like
Reactions: awoo
Yeah pretty much we have to save it literally on our hard drive as a torrent file and seed it so that way it is archive. I am planning in archiving all of Keffals Vod and screenshoting all of Keffals tweets. Not also that, but Taylor Lorenze tweet. She got her uncle that work for Archive.org to nuke her archive tweets.
Torrents are the most resilient content systems on earth. If you thought trannies were takedown-aggressive, they can't compare to the MPAA sending ISP letters and actively lobbying in Congress.

That being said, they still require seeding. Theoretically, blockchain-based storage is even better, but I'm not sure how mature or easy to use this technology is. I would imagine it's fine for text but would not support media as easily.
 
  • Like
Reactions: $quid
Anyone can request Wayback to exclude their domain/social media from being archived.
I heard that too and might of spread the idea that it's because of her uncle.

They're going after archive.ph now. They're burning the lolbrary of Alexandria because a failed troon porn star got caught with a shitty past.
 
Someone recommended GhostArchive but when clearnet is up, GhostArchive archives the damn DDOS page instead of the actual forum page.
 
  • Like
Reactions: $quid
I heard that too and might of spread the idea that it's because of her uncle.

They're going after archive.ph now. They're burning the lolbrary of Alexandria because a failed troon porn star got caught with a shitty past.
Can confirm.

76d111057b7dec7bdd67c273ed3937282f0dfc44d1b224149fb800a86f4d3920.jpeg
 
May have missed it in my 15 page skim, but does anyone know if there's a script around to rip Fandom wikis? There are quite a few games where the only comprehensive guide is a legacy wiki migrated to Fandom and I don't trust them to not nuke everything nonprofitable or unmaintained eventually.
 
May have missed it in my 15 page skim, but does anyone know if there's a script around to rip Fandom wikis? There are quite a few games where the only comprehensive guide is a legacy wiki migrated to Fandom and I don't trust them to not nuke everything nonprofitable or unmaintained eventually.
You might to go the dirty route: find the sitemap for the wiki and run wget on them. Then you have a local version of the site. There are some arguments to wget you might need to keep the stylesheets (I don't know them off the top of my head).
 
Anyone having trouble with archive.ph right now? Downforeveryoneorjustme is reporting it as down and I'm getting an eternally spinning loading icon.

Looks like it's back up.


Spoke too soon. Seems to be endlessly loading again.
 
Last edited:
I've noticed that there's been a large backlog with archive.ph in recent days and I can't get archive.st to work either. How difficult would it be to set up another archive site, hosting and DMCA-wise?
 
  • Like
Reactions: notafederalagent
Back