589
submitted 2 months ago* (last edited 2 months ago) by Tea@programming.dev to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] catloaf@lemm.ee 19 points 2 months ago

An HTTP request is a request. Servers are free to rate limit or deny access

[-] FaceDeer@fedia.io 18 points 2 months ago

And Wikimedia, in particular, is all about publishing data under open licenses. They want the data to be downloaded and used by others. That's what it's for.

[-] LostXOR@fedia.io 4 points 2 months ago

Even so I think it would be totally reasonable for them to block web scrapers, as they provide better ways to download all their data.

[-] FaceDeer@fedia.io 7 points 2 months ago

At the root of this comment chain is a proposal to have laws passed about this.

People can set up their web servers however they like. It's on them to do that, it's their web servers. I don't think there should be legislation about whether you're allowed to issue perfectly ordinary HTTP requests to a public server, let the server decide how to respond to them.

[-] taladar@sh.itjust.works 12 points 2 months ago

Rate limiting in itself requires resources that are not always available. For one thing you can only rate limit individuals you can identify so you need to keep data about past requests in memory and attach counters to them and even then that won't help if the requests come from IPs that are easily changed.

this post was submitted on 02 Apr 2025
589 points (99.3% liked)

Technology

71399 readers
2623 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS