Shadow Libraries - Anna's Archive, Library Genesis, Sci-Hub, Z-Library, and more

  • Want to keep track of this thread?
    Accounts can bookmark posts, watch threads for updates, and jump back to where you stopped reading.
    Create account
Schizo take, I legit suspect AA to be an corpo operation now. It was created 2022, post initial AI hype and when most shit was scraped. A really good sorting system (ISBN, book type etc). It's just a too convenient source to get ebooks, many even OCR scanned or easy to feed in a scanner, then just start training the model. All this without dealing with a bureaucratic copyright licence for with the single major publisher. 30 companies just happend to use it? No, I think they created it. The LLM section on AA is essentially a service offer.

If we see an next gen audio models within the next months, it just has to be some type of a corpo op. Such an odd thing to start archiving all the sudden.
 
Last edited:
1766428390525.png
46% of the people who have read this article are retarded apparently.
 
TorrentFreak: Anna’s Archive Loses .Org Domain After Surprise Suspension (archive) (mega)
Popular shadow library Anna's Archive has lost control over its main domain name. Annas-archive.org was suspended and put on serverHold status, which is an action that's typically taken by the domain name registry. The site's operator doesn't believe that the actions are related to its recently announced Spotify backup and stresses that the site remains accessible through alternative domains.
https://www.reddit.com/r/Annas_Archive/comments/1q3zxlb/message_from_anna_were_fine/ (archive)
The .org domain apparently has been suspended. Our other domains work fine, and we've added some more. We recommend checking our Wikipedia page for the latest domains.

This unfortunately happens to shadow libraries on a regular basis. We don't believe this has to do with our Spotify backup.

To keep our operations running properly we're always looking for donations. We're doing a fundraiser this month where you get double downloads for the entire duration of your membership.

Thanks for your continued support.
Ars Technica: Anna’s Archive loses .org domain, says suspension likely unrelated to Spotify piracy (archive)

I did notice .ORG was down yesterday and confirmed the downtime with SLUM, but I figured it was temporary.

I'd consider .SE the new primary domain, but there's also .LI, and .PM plus .IN which were newly added to the Wikipedia article, FAQ, and sidebar.

I've rearchived the whole blog on .SE. Ghostarchive to be filled in later because their captcha verification isn't working.
 
Last edited:
Schizo take, I legit suspect AA to be an corpo operation now. It was created 2022, post initial AI hype and when most shit was scraped. A really good sorting system (ISBN, book type etc). It's just a too convenient source to get ebooks, many even OCR scanned or easy to feed in a scanner, then just start training the model. All this without dealing with a bureaucratic copyright licence for with the single major publisher. 30 companies just happend to use it? No, I think they created it. The LLM section on AA is essentially a service offer.

If we see an next gen audio models within the next months, it just has to be some type of a corpo op. Such an odd thing to start archiving all the sudden.
Ars Technica: Anna’s Archive loses .org domain, says suspension likely unrelated to Spotify piracy (archive)

Your take is shared, in part, by some of the Ars commenters in the story above. Some of the anti-AI spergs are taking notice:
anna-intrusion.webpsaanaito.webpall-piracy.webp
Piracy IS preservation, faggot.

The 30 companies are supposedly mostly Chinese. I believe the AA people have correctly identified that they can get some significant benefits for their mission by partnering with the people who care the least about intellectual property. And some of the fruits of that partnership are seen in the recent massive expansion of Chinese content. Data/metadata is just as valuable to AA as funding:

https://annas-archive.se/llm (archive) (mega)
This is enterprise-level access that we can provide for donations in the range of tens of thousands USD. We’re also willing to trade this for high-quality collections that we don’t have yet.

We can refund you if you’re able to provide us with enrichment of our data, such as:
  • OCR
  • Removing overlap (deduplication)
  • Text and metadata extraction

Rather than AA being a "corpo op", I think they are sincere about their mission, and their goals happen to align well with AI/LLM companies hoovering up as much data as possible:

https://annas-archive.se/faq (archive) (mega)
Anna’s Archive is a non-profit project with two goals:
  1. Preservation: Backing up all knowledge and culture of humanity.
  2. Access: Making this knowledge and culture available to anyone in the world.

They are openly stating as much on the blog:

Anna's Archive Blog (January 31, 2025): Copyright reform is necessary for national security (archive) (ghost) (mega)
My team and I are ideologues. We believe that preserving and hosting these files is morally right. Libraries around the world are seeing funding cuts, and we can’t trust humanity’s heritage to corporations either.

Then came AI. Virtually all major companies building LLMs contacted us to train on our data. Most (but not all!) US-based companies reconsidered once they realized the illegal nature of our work. By contrast, Chinese firms have enthusiastically embraced our collection, apparently untroubled by its legality. This is notable given China’s role as a signatory to nearly all major international copyright treaties.

Zuckbook has been caught with their hands in the cookie jar, but is just a leech, offering no benefits:

TorrentFreak: ‘Meta Torrented over 81 TB of Data Through Anna’s Archive, Despite Few Seeders’ (archive) (mega)

It should go without saying that whatever the true purpose of AA is, I support Total Copyright Death, so it doesn't matter to me much. Chinese LLM companies have also been providing some good AI models that can be run locally. The rising tide lifts all boats.

Here's a Hacker News discussion for the TorrentFreak .org suspension article:
https://news.ycombinator.com/item?id=46497164 (archive) (mega)
anna-foot-soldier.webp

Noticed this tard guide in search results: https://annasarchive.info/ (archive) (mega)
 
Last edited:
Last edited:
TorrentFreak: U.S. Court Order Against Anna’s Archive Spells More Trouble for the Site (archive) (ghost) (mega)
Anna’s Archive is having a rough month. Following mysterious .org and .se domain suspensions, the shadow library is now facing a permanent injunction from a federal court. After dropping a multi-million damages claim, OCLC won a default judgment and permanent injunction against Anna's Archive, which it plans to enforce against hosting companies.
A few days ago, the domain trouble continued when Anna’s Archive’s .SE domain suddenly became unresponsive after being operational for years. For this domain, the registrar took action, as the site was put on clientHold. While we tried to get additional information from the registrar, our requests remained unanswered.
anna-conclude.png.webp

This might be the weapon being used to bludgeon domain registrars.

Ars Technica: Judge orders Anna’s Archive to delete scraped data; no one thinks it will comply (archive) (mega)
 

Attachments

Last edited:
Am I being retarded or does annas-archive still not have an .onion address? I'm sure every normie thing I want will still be findable even if they cark it but it's so convenient to have a unified, reliable source for novels.
No. I'm sure they will launch one if they feel the need to. They are barely playing the whack-a-mole game at this point. Two domains is nothing.


The latest current domains have always been rapidly edited into the Wikipedia article sidebar.

https://old.reddit.com/r/Annas_Archive/comments/1qdrc1s/soo_who_is_anna/ (archive)
anarchivism.webp

https://old.reddit.com/r/Annas_Archive/comments/1qdhov1/petition_to_annas_archive_to_publish/ (archive)
anna-public-key.webp

https://old.reddit.com/r/Annas_Archive/comments/1qd5hx0/spotify_torrents_down/ (archive)
tdxxpxl4yedg1.png
 
Last edited:
@snowdunes
TorrentFreak: NVIDIA Contacted Anna’s Archive to Secure Access to Millions of Pirated Books (archive) (ghost) (mega)
NVIDIA executives allegedly authorized the use of millions of pirated books from Anna's Archive to fuel its AI training. In an expanded class-action lawsuit that cites internal NVIDIA documents, several book authors claim that the trillion-dollar company directly reached out to Anna's Archive, seeking high-speed access to the shadow library data.
competat.png.webpallegdata.png.webpgreen-lght.png.webp

You can also discuss this in A&N: https://kiwifarms.st/threads/nvidia...e-access-to-millions-of-pirated-books.237325/
 

Attachments

This is very bad. While I am nearly certain that most uses of copyrighted material as training data would be fair use if using legally obtained material not obtained under some restrictive contract, these would be clearly illegally obtained. The infringement itself would be subject to liability, and is such massively wholesale infringement it could justify the full $150,000 per infringement penalty.

While obtaining material illegally does not completely foreclose a fair use defense, it is certainly a factor that would be taken into account.

I think using illegally obtained material for training is at best sketchy as fuck and probably well over the line separating fair use from unprotected infringement.

In short, this is a bad idea on the part of everyone involved, and extreme enough it may even cross the line into criminal commercial infringement should some creative federal prosecutor feel up to the task.
 
In short, this is a bad idea on the part of everyone involved, and extreme enough it may even cross the line into criminal commercial infringement should some creative federal prosecutor feel up to the task.
It's bad, but the giant pile of money involved in this AI bubble tells me everyone's going to just quietly pretend it didn't happen, Anna's will get a mysterious reprieve from legal crackdowns on piracy, the plaintiff here will get a pittance, and nothing of substance will actually change.
 
TorrentFreak: Unsealed: Spotify Lawsuit Triggered Anna’s Archive Domain Name Suspensions (archive) (ghost) (mega) (wayback) (A&N thread) (A&N Cloudflare thread)
Spotify and several major record labels, including UMG, Sony, and Warner, have taken legal action against the unknown operators of Anna's Archive. The action follows the shadow library's announcement that it would release hundreds of terabytes of scraped Spotify data. Unsealed documents reveal that the court already issued a broad preliminary injunction, ordering hosting companies, Cloudflare, and domain name services, to take action.

Mystery solved
 

Attachments

Last edited:
TorrentFreak: Spotify’s Crackdown on Anna’s Archive Domains Hits a Jurisdiction Snag (archive) (mega)
The music industry’s legal offensive against Anna's Archive has hit a jurisdictional roadblock. While U.S. court orders have successfully suspended the shadow library's .ORG, .IN, and .SE domains, not all foreign intermediaries may automatically take action. This includes the domain name privacy company Njalla, as well as the Switzerland-based Switch Foundation.
At this stage, the music industry doesn’t appear to know who is behind the site. The RIAA previously discovered that “Cyberdyne S.A.” was the registrant for the .se domain. However, that trace doesn’t appear to lead anywhere either.

“‘Cyberdyne’ is also the name of the fictional technology company in the ‘Terminator’ movie series behind the ‘Skynet’ artificial intelligence network that achieved super intelligence and self-awareness, leading to nuclear devastation,” RIAA’s content protection chief informed the court, noting that this may be a fabricated name.
 

Attachments

TorrentFreak: Danish Students Face Legal Action and Fines Over Textbook Piracy (archive) (mega)
After "awareness" campaigns that failed to move the needle for years, Denmark’s leading anti-piracy group is shifting to a more aggressive litigation strategy. The Rights Alliance confirmed it will begin filing civil lawsuits against individual students who are caught sharing even a single digital textbook. The anti-piracy group prefers not to mention the targeted platforms but says it uses undercover monitoring of private groups to gather evidence.
My guess is they're going after Telegram and/or Discord groups.
 

Attachments

Last edited:
The whack-a-mole continues. What asshole tricked the content cartel into pissing into the wind with this kind of stupidity again after the fucking Pirate Bay failed to die after twenty fucking years and the imprisonment of two of its operators?

There are infinite domain names. Anna's Archive can literally keep doing that forever. Every fucking time the cartel wants one shut down, they have to piss away more money in court to get the order issued. How have they still not figured this out? It's been decades now for fuck's sake.
 
Every fucking time the cartel wants one shut down, they have to piss away more money in court to get the order issued.
The preliminary injunction has been weaponized to a point where they likely only need to send a demand letter in most cases, and they've apparently dodged AFNIC's .PM noncompliance by going to Openprovider instead.

But at some point they hit a brick wall, such as with .LI (so far). And of course, AA can keep it up indefinitely, and there's even a splinter site using the codebase: welib.org.

https://annas-archive.li/faq (archive)

What are your official mirrors?​

Currently our official mirrors are:
- annas-archive.li
- annas-archive.gl

Not recommended mirrors (don’t contribute back)
- welib.org (NOT RECOMMENDED): They have forked our codebase and files. They haven’t released their new code as open source, nor have they shared any new collections.

Fraudulent
- annas-archive.su (DO NOT USE): Uses our name without permission. Steals your donations.
 
Back
Top Bottom