128
you are viewing a single comment's thread
view the rest of the comments
[-] CarbonScored@hexbear.net 12 points 5 months ago* (last edited 5 months ago)

The real beauty of this is that he's released it as code you can deploy on your sites. It's not just a single website he owns that will quickly be blacklisted, it's a tarpit you can put anywhere.

I also liked these snippets from their site:

"Lastly, optional Markov-babble can be added to the pages, to give the crawlers something to scrape up and train their LLMs on, hopefully accelerating model collapse."

Let's say you've got horsepower and bandwidth to burn, and just want to see these AI models burn. Nepenthes has what you need:

Don't make any attempt to block crawlers with the IP stats. Put the delay times as low as you are comfortable with. Train a big Markov corpus and leave the Markov module enabled, set the maximum babble size to something big. In short, let them suck down as much bullshit as they have diskspace for and choke on it.

this post was submitted on 27 Jan 2025
128 points (98.5% liked)

technology

23878 readers
54 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS