Video Game Archival Autism / TCRF / Jul / Sonic Retro / And More - Harvest Troon: Friends of Byuu-Near-al town

Yeah, that's true - relevant to the discussions at hand, the fact that most assets in the domain of art, etc are handled with contemporary filetypes does makes it easier and more standard to interact with them since you don't really need to decrypt them or anything (unless there's something *really* fucky going on), so something like extracting asset files is generally gonna be easier than mapping out and rewriting code. I think I described that bit a little too naively.
I think it's a fair summary. It is a case-by-case thing, but yes: more people and companies using off-the-shelf engines means more standardization across the board. Additionally, there are cases when the spec isn't publicly available.
 
the fact that most assets in the domain of art, etc are handled with contemporary filetypes does make it easier and more standard to interact with them since you don't really need to decrypt them or anything (unless there's something *really* fucky going on), so something like extracting asset files is generally gonna be easier
If only it worked like that always. Oh, King's Bounty для PC98, how I lust for your delicious graphics.
 
Just as a quick recap, here are all the posts about scraping the site from both this thread and Workman's in Stinkditch:
Whoever’s working to make a TCRF clone will probably be very happy to hear that…

In fact, here’s a test export I made of everything in the “fan games” category. (This includes all the templates on the game pages as well as a few subcategory pages.)
We should probably work quick before X finds a way to disable the feature though.
Just finished dumping this entire thing. 1GB file baby!
Did every single fucking category that could contain a substantial amount of articles like "games with unused characters", also I'll add that it's just a xml file it didn't save any of the pics.

I was also interested to start this wiki because if anything it's probably fairly lucrative with how big and popular this thing is, but I know @kona2kona was also meaning to start the wiki and I didn't want to compete with him/her.

I don't understand troons sometimes, you had a huge fucking wiki that gets cross-referenced in youtube videos constantly and you're still willing to throw all of that money away over pride flags? Talk about wasted potential.

EDIT: Including an attachment in txt format so it's also available locally, you'll have to rename the file back to xml to use it though.
Nevermind kiwifarts won't cooperate.

EDIT2: Special thanks to @Toilet Paper for the tip. I shoved the file into a gzip and then into a regular zip (because kiwifarms hates most file formats or something) and it made the file only take 96MB a huge 90% difference!
Also pic unrelated, for some reason kiwifarms won't keep my export.zip if I don't embed anything else alongside it.
Well that's nice and all, but there's no way that there would be some all-encompassing Category containing most, if not all of the games, right?

View attachment 7493401
Oh... Well, that's the text of 16000 articles
Exported most of consoles, two Nintendo/Gamefreak related leaks and some of popular publishers and developers.
Yeah I had noticed the same, but kept it on the down low specifically because I was prepping for an automated mass archive. Since it's now been posted here, I just went ahead and did most of it manually. This should be, although may not be, everything that is more than 9 pages within a category, or approximately 99.26% of the wiki. Alongside this is a TXT file that contains the categories I missed.
Let's just say I know a guy, who knows a guy, whos on ArchiveTeam, who would LOVE to scrape TCRF.
View attachment 7493963
View attachment 7494031
TCRF had a website backup on 2023-09-04, done by ArchiveTeam. This was completed successfully, but until now, they are blocking known user agents associated with the ArchiveBot. In response, ArchiveTeam is AWARE of tcrf.net, and currently in process of archiving TCRF using wikibot and wikiteam tools.

Hail Hydra
The text dump is great, but having a look through it earlier its image file references are to the File:<filename> page, which is an added middleman page. It doesn't refer to the actual file location on the server.

I went to the liberty of crawling through another way and got the links to all the images posted to TCRF from July 2023 (i.e. where the old dumps stop) to this afternoon. There's about 159,000 of them.

The beatings archiving efforts will continue until morale improves.
Xkeepcaca is going to seethe so hard the moment xhe realizes I dumped most of the files (~92'000) listed from the txt document thanks to https://chromewebstore.google.com/detail/tab-save/lkngoeaeclaebmpkgapchgjdbaekacki
Let me know if it works, it's my frst time doing this.

And to Xkeeper:
Kiwifarms wins again. + You lost. + You are a failure. + Nobody likes you. + You'll die alone. + You suck at basic site maintenance. + L + ratio + you fell off + get a job + unfunny + you're trans + never liked you anyway + cope + ur allergic to gluten + don't care + cringe ur a kid + literally shut the fuck up + galileo did it better + your avi was made in MS Excel + ur bf is kinda ugly + i have more subscribers + owned + ur a toddler + reverse double take back + u sleep in a different bedroom from your wife + get rekt + i said it better + u smell + copy + who asked + dead game + seethe + ur a coward + stay mad + you main yuumi + aired + you drive a fiat 500 + the hood watches xqc now + yo mama + skibidi toilet ahh aura + you got the rizz of ohio + go get some bitches + go touch grass
View attachment 7507308
<[ANTI-TROON SPACE]
I'm an impatient little fuck, someone make a mirror wiki so I can start contributing to it without any troon hands manhandling it now NOW NOW!
105444 - SoyBooru.gif105444 - SoyBooru.gif105444 - SoyBooru.gif
 
Yeah, that's true - relevant to the discussions at hand, the fact that most assets in the domain of art, etc are handled with contemporary filetypes does make it easier and more standard to interact with them since you don't really need to decrypt them or anything (unless there's something *really* fucky going on), so something like extracting asset files is generally gonna be easier than mapping out and rewriting code. I think I described that bit a little too naively.
I wrote and deleted a too-long and spergy response to the guy thinking people who could "decrypt" any given N64 game are rare unicorns, but yes--and importantly while games vary, whatever they're doing has the hardware as an endpoint and those are known and documented, so there's always a way in.
For example, old universal DirectX game asset rippers which were game agnostic because they hooked into the render pipeline to grab meshes/textures on the way to the graphics card. Or the way the NES treats sprite tiles basically like a font, or whatever. Even if the internal format is wacky, you can always pull that thread... but do you need to? Because the game itself will almost always help you out by happily trying to process/display/play whatever data of a known asset/event you change a reference to, and in the end we care more about what comes out of the screen or speakers when it comes to preserving obscure stuff than we do the original bytes.

Time investment's really the thing, like the process of mapping out what assets are never referenced (or worse, referenced but unseen because they're in an inaccessible developer room or hidden behind an off-by-one mistake or something), so you need to be both invested and spergy enough to care. Any decent programmer could figure out how to patch the incomplete format of those proto Zelda levels/models for example, but there aren't many who give a shit about doing that for Barbie Horse Dungeon 64 and would also notice the jewel on this one proto saddle being green instead of blue. (Which is not actually a huge problem because nobody else gives a shit either, right? The returns drastically diminish beyond the work people merrily do for free anyway.)

An asset's format is in fact a language insofar the spec (or whatever they hacked together ad-hoc) can be described as a formal grammar.
go to bed chomsky
 
Last edited:
Because the game itself will almost always help you out by happily trying to process/display/play whatever data of a known asset/event you change a reference to
Based, living off the land. On the prior point about rippers, many emulators also come with memory viewers, texture displays, all sorts of functions useful for asset ripping. (Obviously, that's there because someone already did the work of decoding the data formats used by the original hardware.)

I also think this shit isn't hard, but it requires autism to care.
 
Last edited:
One of XKeeper's xisters tried to start some shit on Hacker News and epic failed.
View attachment 7550811
Not only is the thread full of tumbleweeds, some normie multi-dunks on him by complete accident.
This is how you should respond to any tech queries from Xkeeper or his tranny jannies. Completely ignore the question and point out how ridiculous Workman is.
 
I'm in the discord, so I'll do a bit of poking. I think if we had a bunch of people just quietly pulling content from the channels it would help massively.
I could setup a bot to auto scrape messages and save them in a database, same as I did for bossman's discord server. If you want, I could send you the script.
 
Last edited:
Hello, I've created a wiki inspired by some of the ideas in this thread. It's not a gaming cut content wiki, but a prerelease/cut content wiki for any kind of media and it's called the Development History Wiki. I'm working on an article about The Thief and The Cobbler as my first project.
The url is https://www.devhistory.wiki.
If anyone has any questions you can reach me at my talk page at User:Admin since I'm probably not gonna use this account again.
Cheers. Also, feel free to import any stuff from TCRF.
 
Hello, I've created a wiki inspired by some of the ideas in this thread. It's not a gaming cut content wiki, but a prerelease/cut content wiki for any kind of media and it's called the Development History Wiki. I'm working on an article about The Thief and The Cobbler as my first project.
The url is https://www.devhistory.wiki.
If anyone has any questions you can reach me at my talk page at User:Admin since I'm probably not gonna use this account again.
Cheers. Also, feel free to import any stuff from TCRF.
Import.webp

Importing pages requires admin privileges. Can you test out site's import functionality to see how well it works?
 
Back