Preservetube - A Youtube archival site.

can you make it so that when on .net domains when you save a video it doesn't redirect to the .com address
 
You can always tell when it's just one fag mass uploading all his autism kek
1767645108053.png
1767645119986.png

I wonder, do they genuinely think this stuff is worth archiving? Or are they just using the site as a front-end without ads?
I don't see why they'd even want to download this shit, and I've seen much worse garbage being uploaded all the time
 
You can always tell when it's just one fag mass uploading all his autism kek
View attachment 8378692View attachment 8378694
I wonder, do they genuinely think this stuff is worth archiving? Or are they just using the site as a front-end without ads?
I don't see why they'd even want to download this shit, and I've seen much worse garbage being uploaded all the time
don't yuck other's yuck xister or however it goes
 

Some videos are giving a message like:
Upload failed for 2At30OjZJNc: ultralight-s3 Module: Request failed with status 409: Unknown - Conflict, err body: File already exists
And then the file, of course, does not exist. It happened after reloading after several failures, I guess eventually something gets in and causes that "already exists" issue.
 
Last edited:
I don't know if this is new, but PreserveTube just hit me with this message while I was trying to archive a channel.

Screenshot - PreserveTube.jpg

Now I feel bad for all the slop I've archived.
 
While on this topic, what would be everyones thoughts on starting to remove low quality archives?

The current idea I have is having a specific page with videos that are planning to be deleted. Videos would stay on that page for around a week before actually being deleted to ensure if someone has complaints to voice, they can do it.

While this would go against "never delete anything", I don't know how else to handle this.

I'm looking at the biggest files stored, and I see ~60g of just looped videos, such as :
Code:
10 hours of shrek farting (gHWfNHZENGE): 7979.08 MB
[10 HOURS] SUPERSTAR - CG5 (FNAF Security Breach Song Animation) (oSiD2oyewmM): 3176.00 MB
Candyland, But it is low quality [10 HOURS] (XpHx1QNq0KI): 3232.00 MB

This storage could go to a lot better use.
 
How about a limit on the length of videos that can be submitted for archival? I'm thinking somewhere between 1-2 hours, if there are exceptions that need to be archived then someone can email you. I don't think anyone will care about losing 10 hours of shrek farting.
 
I'm surprised 10 hour videos made it through, I never succeeded in archiving something longer than 2 hours.
If you're serious about keeping PreserveTube as a capsule of "internet history", I would implement an age check on the videos. Videos younger than 10 years can't be archived, as an example. If you think 10 is too severe, make it 5 or 3.
 
How about a limit on the length of videos that can be submitted for archival? I'm thinking somewhere between 1-2 hours, if there are exceptions that need to be archived then someone can email you. I don't think anyone will care about losing 10 hours of shrek farting.
I'm pretty sure they already do this if my memory is correct.
 
While on this topic, what would be everyones thoughts on starting to remove low quality archives?

The current idea I have is having a specific page with videos that are planning to be deleted. Videos would stay on that page for around a week before actually being deleted to ensure if someone has complaints to voice, they can do it.

While this would go against "never delete anything", I don't know how else to handle this.

I'm looking at the biggest files stored, and I see ~60g of just looped videos, such as :
Code:
10 hours of shrek farting (gHWfNHZENGE): 7979.08 MB
[10 HOURS] SUPERSTAR - CG5 (FNAF Security Breach Song Animation) (oSiD2oyewmM): 3176.00 MB
Candyland, But it is low quality [10 HOURS] (XpHx1QNq0KI): 3232.00 MB

This storage could go to a lot better use.
one suggestion is to filter out the YouTube generated - Topic channels from being archived. Some people might be using it to bypass region blocks / download music, and it's unlikely that autogenerated - Topic pages (which are created automatically by YouTube's DRM platform) are prone to deletion. probably less than 10% of what's being archived but might be something to consider.

for junk data like the "10 hours of x" is it possible to use some type of scene detector and sample random points in time of the video? idk how to do it, as you'd need to check both unique audio and video, but ffmpeg has some stuff for checking scene change score in video atleast.
 
I'm surprised 10 hour videos made it through, I never succeeded in archiving something longer than 2 hours. Videos younger than 10 years can't be archived, as an example. If you think 10 is too severe, make it 5 or 3.
These videos are from ~2024, before this really became a problem. I don't think limiting the age would help. If anything, it goes against the purpose. The longer the video is on Youtube, the less likely it is for it to be taken down.
I'm pretty sure they already do this if my memory is correct.
Correct. It has been lowered to ~50M, looking to higher it back to 75M soon.
Some people might be using it to bypass region blocks / download music
Music is short, and the topic videos are fairly static from what I remember (usually the album cover), so they compress well.
for junk data like the "10 hours of x" is it possible to use some type of scene detector and sample random points in time of the video?
I don't think that's really worth all the compute.. Looking at the size and title should be enough to weed most of them out.
wait you've added a fucking hcaptcha?
Turnstile can run in the background without any user interaction. You would need to solve the Cloudflare WAF once, and then you can go through the whole limit with little to no interaction required.

Now that I think about it, Turnstile has non-invisable captchas as well, so those might be worth a try later on.

-----

I've added https://preservetube.com/deletion that shows what videos have, or will be, deleted.
 
Back
Top Bottom