662
Reddit blocking all major search engines, except Google
(readwrite.com)
This is a most excellent place for technology news and articles.
this is just going to cause indexers to ignore robots.txt
"We always obey the robots.txt"
Rate limiting could “fix” that unfortunately.
They're likely blocking user agents too, which I think also doesn't have legal enforcement (as in DuckDuckGo can just use "Google" unless they said otherwise.
LinkedIn tried blocking scraping that way but as long as the scraping isn't burdensome it's basically legal but you can still be bound by TOS and civil claims
https://natlawreview.com/article/hiq-and-linkedin-reach-proposed-settlement-landmark-scraping-case