217
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 19 Jan 2026
217 points (100.0% liked)
Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ
66324 readers
488 users here now
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.
Rules • Full Version
1. Posts must be related to the discussion of digital piracy
2. Don't request invites, trade, sell, or self-promote
3. Don't request or link to specific pirated titles, including DMs
4. Don't submit low-quality posts, be entitled, or harass others
Loot, Pillage, & Plunder
📜 c/Piracy Wiki (Community Edition):
🏴☠️ Other communities
FUCK ADOBE!
Torrenting/P2P:
- !seedboxes@lemmy.dbzer0.com
- !trackers@lemmy.dbzer0.com
- !qbittorrent@lemmy.dbzer0.com
- !libretorrent@lemmy.dbzer0.com
- !soulseek@lemmy.dbzer0.com
Gaming:
- !steamdeckpirates@lemmy.dbzer0.com
- !newyuzupiracy@lemmy.dbzer0.com
- !switchpirates@lemmy.dbzer0.com
- !3dspiracy@lemmy.dbzer0.com
- !retropirates@lemmy.dbzer0.com
💰 Please help cover server costs.
![]() |
![]() |
|---|---|
| Ko-fi | Liberapay |
founded 2 years ago
MODERATORS



Maybe I'm missing something, but I'm confused how they can promise "high speed access" to the data while also claiming:
Do they have the data or do they not have it?
They also claim to be able to do things like extract text and deduplicate the data... That seems to suggest a significant amount of storage and compute power for a non-profit that has only been around for ~3 years.
I find this entire thing fishy as fuck. Call me a conspiracy theorist, but I'm not convinced that the entire existence of this data theft operation isn't simply to be a illicit data broker for AI companies. And now their is direct evidence tying both Anthropic and NVidia to them.
i think they mean they'll provide direct access to data hosted by "third party"s (torrents?), without the captchas and throttling/rate limiting present when normally using the annas archive website
they're asking for text extraction and dedup in exchange for providing datasets. at least publicly they claim this whole project is aimed at data preservation and wide access.. they're mostly aggregating/collecting data from other shadow libraries and even if they have malicious(?) intent, i'd say they're a net positive since their code and datas are mostly(?) open sourced.
Nono, they need deduplication and text extracts in exchange for access.