379
submitted 10 months ago by L4s@lemmy.world to c/technology@lemmy.world

GenAI tools ‘could not exist’ if firms are made to pay copyright::undefined

you are viewing a single comment's thread
view the rest of the comments
[-] Marcbmann@lemmy.world 38 points 10 months ago

Reproduction of copyrighted material would be breaking the law. Studying it and using it as reference when creating original content is not.

[-] 1Fuji2Taka3Nasubi@lemmy.zip 8 points 10 months ago

Reproduction of copyrighted material would be breaking the law. Studying it and using it as reference when creating original content is not.

I’m curious why we think otherwise when it is a student obtaining an unauthorized copy of a textbook to study, or researchers getting papers from sci-hub. Probably because it benefits corporations and they say so?

[-] Marcbmann@lemmy.world 7 points 10 months ago

While I would like to be in a world where knowledge is free, this is apples and oranges.

OpenAI can purchase a textbook and read it. If their AI uses the knowledge gained to explain maths to an individual, without reproducing the original material, then there's no issue.

The difference is the student in your example didn't buy their textbook. Someone else bought it and reproduced the original for others to study from.

If OpenAI was pirating textbooks, that would be a wholly separate issue.

[-] 1Fuji2Taka3Nasubi@lemmy.zip 2 points 10 months ago

I agree that the issues

  • whether AI output are derivative works of its input, and
  • whether input to AI is fair use and requires no compensation

are separate, but I think they are related, in that AI companies are trying to impose whatever interpretation of copyright that is convenient to them to the rest of the society.

And indeed Meta pirated books to feed its AI.

https://www.techspot.com/news/101507-meta-admits-using-pirated-books-train-ai-but.html

[-] sixCats@lemmy.dbzer0.com 2 points 10 months ago

I was under the impression they mentioned at some point torrenting things

[-] 1Fuji2Taka3Nasubi@lemmy.zip 1 points 10 months ago
[-] Blackmist@feddit.uk 2 points 10 months ago

The fact that the "AI" can spit out whole passages verbatim when given the right prompts, suggests that there is a big problem here and they haven't a clue how to fix it.

It's not "learning" anything other than the probable order of words.

[-] FatCrab@lemmy.one 4 points 10 months ago

I really hate this reduction of gpt models. Is the model probabilistic? Absolutely. But it isn't simply learning a comprehensible probability of words--it is generating a massively complex conditional probability sequence for words. Largely, humans might be said to do the same thing. We make a best guess at the sequence of words we decide to use based on conditional probabilities along a myriad number of conditions (including semantics of the thing we want to say).

[-] Marcbmann@lemmy.world 0 points 10 months ago

Completely agree. And that should be the focal point of the issue.

Sam Altman is correctly stating that AI is not possible without using copyrighted materials. And I don't think there's anything wrong with that.

His mistake is not redirecting the conversation. He should be talking about the efforts they're making to stop their machine from reproducing copyrighted works. Not whether or not they should be allowed to use it in the first place.

[-] homesweethomeMrL@lemmy.world 6 points 10 months ago

humans studying it, is fair use.

[-] hglman@lemmy.ml 11 points 10 months ago

So if a tool is involved, it's no longer ok? So, people with glasses cannot consume copyrighted material?

[-] Harbinger01173430@lemmy.world 5 points 10 months ago

No. A tool already makes it unnatural. /S

[-] hedgehog@ttrpg.network 7 points 10 months ago

Copyright can only be granted to works created by a human, but I don’t know of any such restriction for fair use. Care to share a source explaining why you think only humans are able to use fair use as a defense for copyright infringement?

[-] LainTrain@lemmy.dbzer0.com 3 points 10 months ago

What's the difference? Humans are just the intent suppliers, the rest of the art is mostly made possible by software, whether photoshop or stable diffusion.

[-] Marcbmann@lemmy.world 0 points 10 months ago

I don't agree. The publisher of the material does not get to dictate what it is used for. What are we protecting at the end of the day and why?

In the case of a textbook, someone worked hard to explain certain materials in a certain way to make the material easily digestible. They produced examples to explain concepts. Reproducing and disseminating that material would be unfair to the author who worked hard to produce it.

But the author does not have jurisdiction over the knowledge gained. They cannot tell the reader that they are forbidden from using the knowledge gained to tutor another person in calculus. That would be absurd.

IP law protects the works of the creator. The author of a calculus textbook did not invent calculus. As such, copyright law does not apply.

[-] wewbull@feddit.uk 1 points 10 months ago

The model itself is a derivative work. It's existence is what is under dispute. It's not about using the model to produce further works

[-] Marcbmann@lemmy.world 4 points 10 months ago

Then every single student graduating college produces derivative work.

Everything that required the underlying knowledge gained from the textbooks studied, or research papers read, is derivative work.

At the core of this, what are we saying? Your machine could only explain calculus because it was provided information from multiple calculus textbooks? Well, that applies to literally everyone.

[-] assassin_aragorn@lemmy.world 0 points 10 months ago

Charge AI recurring tuition then.

this post was submitted on 28 Jan 2024
379 points (95.2% liked)

Technology

59670 readers
1939 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS