445

The creator of an open source project that scraped the internet to determine the ever-changing popularity of different words in human language usage says that they are sunsetting the project because generative AI spam has poisoned the internet to a level where the project no longer has any utility. 

Wordfreq is a program that tracked the ever-changing ways people used more than 40 different languages by analyzing millions of sources across Wikipedia, movie and TV subtitles, news articles, books, websites, Twitter, and Reddit. The system could be used to analyze changing language habits as slang and popular culture changed and language evolved, and was a resource for academics who study such things. In a note on the project’s GitHub, creator Robyn Speer wrote that the project “will not be updated anymore.”

you are viewing a single comment's thread
view the rest of the comments
[-] SirQuackTheDuck@lemmy.world 1 points 2 months ago* (last edited 2 months ago)

Imagine being an author whose sole income is writing books.

Here comes an AI that ~~stole~~ indexed your work and is asked by a customer of OpenAI to summarise your books. It does so perfectly and the issuer is able to use your results freely, since they think it's AI generated and doesn't require attribution.

You receive nothing in return.

Good luck making a living.

Edit: stole to indexed, added edit note

[-] Gorillazrule@lemmy.dbzer0.com 2 points 2 months ago

This is such a nothing argument. If all you're talking about is a summary of a book, people have been able to get that long before AI. I can go to a wikipedia entry right now of any book and look at a plot summary. The author does not get paid for me looking at the summary on Wikipedia. There are numerous other sites where you can find summaries of books. And if you're asking an AI for a summary of a specific book by a specific author, what attribution would you like to see? The user already knows the source because they're specifically asking for a summary of that source.

A bigger concern would be the AI reproducing your works and using them in responses.

this post was submitted on 19 Sep 2024
445 points (99.6% liked)

Technology

59598 readers
3061 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS