429
submitted 5 days ago* (last edited 5 days ago) by geneva_convenience@lemmy.ml to c/fediverse@lemmy.ml

Dropsitenews published a list of websites Facebook uses to train its AI on. Multiple Lemmy instances are on the list as noticed by user BlueAEther

Hexbear is on there too. Also Facebook is very interested in people uploading their massive dongs to lemmynsfw.

Full article here.

Link to the full leaked list download: Meta leaked list pdf

you are viewing a single comment's thread
view the rest of the comments
[-] flamingos@feddit.uk 9 points 5 days ago

There's like half a dozen feddits and somehow feddit.uk is the only one to make it onto this?

Here's a list of instances in feddit.uk linked instances that appear in the list:

List of instance

beehaw.org
furry.engineer
ibe.social
fediworld.de
framatube.org
trailers.ddigest.com
nrw.social
lemmynsfw.com
video.hardlimit.com
digitalcourage.social
xn--baw-joa.social
tube.kockatoo.org
equestria.social
wisskomm.social
social.anoxinon.de
freiburg.social
toobnix.org
toot.bike
mstdn.lalafell.org
peertube.linuxrocks.online
social.rebellion.global
mastodon.cipherbliss.com
social.sdf.org
corteximplant.com
typo.social
www.404media.co
mastodon.ml
video.liberta.vip
tilvids.com
todon.eu
hessen.social
digipres.club
shigusegubu.club
mastodon.me.uk
zdf.social
mastodon.sdf.org
spore.social
kolektiva.media
gruene.social
share.tube
nso.group
mastouille.fr
masto.es
vivaldi.com
literatur.social
mstdn.mx
kirche.social
mastodon.hams.social
federation.network
lile.cl
todon.nl
betweenthelions.link
ipv6.social
linuxrocks.online
peertube.otakufarms.com
pawb.social
mastodon-belgium.be
jasette.facil.services
machteburch.social
mastodont.cat
mastodon.eus
eupolicy.social
social.bau-ha.us
toot.berlin
amicale.net
hexbear.net
mastodon.bida.im
reddthat.com
shelter.moe
mastodon.nl
dju.social
bonn.social
mstdn.chrisalemany.ca
social.sciences.re
tldr.nettime.org
lemy.lol
climatejustice.social
rollenspiel.social
mastodon.org.uk
social.kyiv.dcomm.net.ua
pouet.chapril.org
ecoevo.social
social.politicaconciencia.org
darmstadt.social
peertube.tv
lemmus.org
libretooth.gr
hackers.town
tooter.social
anarchism.space
diode.zone
video.infosec.exchange
mastodon.thirring.org
aussie.zone
social.bund.de
apobangpo.space
shitpost.cloud
berlin.social
toot.aquilenet.fr
social.beachcom.org
lemmygrad.ml
mastodon.radio
nerdculture.de
programming.dev
decayable.ink
kafeneio.social
functional.cafe
things.uk
fuzzies.wtf
diaspodon.fr
dalek.zone
sunbeam.city
tooting.ch
fediscience.org
mastodon.tetaneutral.net
social.librem.one
im-in.space
lemmy.sdf.org
legal.social
post.lurk.org
mastodon.uy
noc.social
tube.pol.social
lemmy.ml
don.linxx.net
infosec.pub
kolektiva.social
masto.bike
furries.club
zhub.link
lemmy.world
openbiblio.social
mastodon.zaclys.com
mamot.fr
clacks.link
discuss.tchncs.de
cyberplace.social
graz.social
pl.kitsunemimi.club
mastodonczech.cz
masto.nobigtech.es
hostux.social
pawb.fun
mastodon.trueten.de
norden.social
systemli.social
mander.xyz
ciberlandia.pt
woem.men
sopuli.xyz
lemmy.ca

[-] addie@feddit.uk 3 points 4 days ago

Number one! Number one! Woo!

[-] poVoq@slrpnk.net 7 points 5 days ago

Given that we used to see lots of Meta scraping a while back on our instance and had to implement Anubis as a result, it is interesting to see that slrpnk.net doesn't seem to be on this list (anymore).

this post was submitted on 08 Aug 2025
429 points (99.5% liked)

Fediverse

21155 readers
12 users here now

A community dedicated to fediverse news and discussion.

Fediverse is a portmanteau of "federation" and "universe".

Getting started on Fediverse;

founded 5 years ago
MODERATORS