459
        you are viewing a single comment's thread
view the rest of the comments
    
  
  
    view the rest of the comments
        this post was submitted on 09 Jul 2023
        
  
      
  
      459 points (96.7% liked)
      Technology
    76347 readers
  
      
      742 users here now
  
      This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
        founded 2 years ago
      
  
  
      MODERATORS
      
  
    
Scraping the web is legal and training AI on data is also legal.
Reusing the content you scraped, if copyright protected, is not.
Edit: unless you get the authorization of the original authors but OpenAI didn't even asked, that's why it's a crime.
Sounds like fair use to me.
That really will be the question at hand. Is the ai producing work that could be considered transformative, educational, or parody? The answer is of course yes, it is capable of doing all three of those things, but it's also capable of being coaxed into reproducing things exactly.
I don't know if current copyright laws are capable of dealing with the ai Renaissance.
Yeah it is. The only protection in copyright is called derivative works, and an AI is not a derivative of a book, No more than your brain is after you've read one.
The only exception would be if you manage to overtrain and encode the contents of the book inside of the model file. That's not what happened here because I'll chat GPT output was a summary.
The only valid claim here is the fact that the books were not supposed to be on the public internet and it's likely that the way open AI the books in the first place was through some piracy website through scraping the web.
At that point you just have to hold them liable for that act of piracy, not the fact that the model release was an act of copyright violation.