410
submitted 2 days ago* (last edited 2 days ago) by geneva_convenience@lemmy.ml to c/privacy@lemmy.ml

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

(page 2) 17 comments
sorted by: hot top controversial new old
[-] qaz@lemmy.world 9 points 1 day ago

Does anyone have a link to the .txt file? I can't grep the PDF.

[-] bdonvr@thelemmy.club 22 points 1 day ago

By nature of federation it really trains on basically all Lemmy data

[-] ferric_carcinization@lemmy.ml 10 points 1 day ago

And multiple times, up to once per instance. Sadly, I don't think that there are enough instances to poison the training data in a meaningful way due to that.

[-] mugita_sokiovt@discuss.online 14 points 2 days ago* (last edited 2 days ago)

At least Discuss.Online has Anubis to prevent this nonsense.

[-] brucethemoose@lemmy.world 16 points 2 days ago

My impression was that Meta's backing out of Llama LLMs anyway, to focus on “products”

[-] WalnutLum@lemmy.ml 7 points 2 days ago

That's good and also somewhat disappointing as they were the first to release the weights and mechanism to run them as open weights.

A lot of fully open source (and "ethically trained", depending on your opinion of that entire idea) models still use major portions of the code they open sourced.

A lot of relatively "good" LLM models run on top of Llama.cpp

[-] brucethemoose@lemmy.world 5 points 2 days ago* (last edited 2 days ago)

Meta pays for PyTorch development as well!

Llama.cpp will be fine of course, it technically has nothing to do with Meta.

But yeah, it’s mostly disappointing IMO…

And kinda stupid. These are literally experimental models; they release one experiment with mixed results, and admittedly catastrophically marketing for it, and Zuck pulls the rug?

[-] burgerchurgarr@lemmus.org 11 points 2 days ago

Enjoy my dong zucc, fucking lizard

[-] GammaGames@beehaw.org 9 points 2 days ago

😮‍💨

[-] sun@slrpnk.net 4 points 2 days ago

Everything published on the fediverse, everyone can get their hands on it.

[-] tdawg@lemmy.world 2 points 2 days ago

literally why

load more comments
view more: ‹ prev next ›
this post was submitted on 08 Aug 2025
410 points (99.5% liked)

Privacy

40679 readers
601 users here now

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

Related communities

much thanks to @gary_host_laptop for the logo design :)

founded 5 years ago
MODERATORS