386
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 08 Aug 2025
386 points (99.7% liked)
Fediverse
21089 readers
636 users here now
A community dedicated to fediverse news and discussion.
Fediverse is a portmanteau of "federation" and "universe".
Getting started on Fediverse;
- What is the fediverse?
- Fediverse Platforms
- How to run your own community
founded 5 years ago
MODERATORS
Check out the robots.txt on any Lemmy instance....
If they have a brain, and they do have the experience from Threads, they don't need to scrape Lemmy. They can just set up a shell instance, subscribe to Lemmy communities, and then use federation to get their data for free. That doesn't use robots.txt at all regardless.
Linked article in the body suggests that likely wouldn't have made a difference anyway
Yeah ive seen the argument in blog posts that since they are not search engines they dont need to respect robots.txt. Its really stupid.
"No no guys you don't understand, robots.txt actually means just search engines, it totally doesn't imply all automated systems!!!"
Scrapers ignore it
Thieves can smash a window to get into my house but I still lock my doors.
This is more like being there when they come to steal and you ask them to ignore some rooms please.