420
submitted 3 days ago* (last edited 3 days ago) by geneva_convenience@lemmy.ml to c/fediverse@lemmy.ml

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

you are viewing a single comment's thread
view the rest of the comments
[-] halcyoncmdr@lemmy.world 13 points 3 days ago

They'd have to host it from somewhere not related to Meta in any way, otherwise someone on the fediverse would find that link and spread the word, and it would be blocked the exact same way. It only takes one person making that connection, Meta knows they're hated.

[-] Clent@lemmy.dbzer0.com 6 points 2 days ago

Mega corps do that all the time. They have shell corporations for the exact purpose of obfuscating their future intentions.

[-] kn33@lemmy.world 6 points 3 days ago

They could stick it in Azure or AWS or something.

[-] halcyoncmdr@lemmy.world 5 points 3 days ago

Or they could just use their existing scrapers and try to brute force it. Meta isn't exactly known for being sneaky.

this post was submitted on 08 Aug 2025
420 points (99.5% liked)

Fediverse

21127 readers
30 users here now

A community dedicated to fediverse news and discussion.

Fediverse is a portmanteau of "federation" and "universe".

Getting started on Fediverse;

founded 5 years ago
MODERATORS