109

OpenAI: 'The New York Times Paid Someone to Hack Us' * TorrentFreak (torrentfreak.com)

submitted 2 years ago by aPirate@lemmy.dbzer0.com to c/technology@lemmy.world

17 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Catoblepas@lemmy.blahaj.zone 63 points 2 years ago

“Established copyright doctrine will dictate that the Times cannot prevent AI models from acquiring knowledge about facts, any more than another news organization can prevent the Times itself from re-reporting stories it had no role in investigating,” OpenAI writes.

Oh boy, their defense is that their advanced predictive text can acquire knowledge? Please, proceed.

[-] phoneymouse@lemmy.world 41 points 2 years ago

It would be a plausible defense if the AI model wasn’t regurgitating Times articles verbatim.

[-] Bye@lemmy.world 3 points 2 years ago

It still is defensible. I can quote a whole bunch of lines from “talladega nights” and “old school” verbatim. I can sing the entirety of “Amish paradise”, with close to 100% accuracy.

My recall ability does not mean that I’ve violated copyright.

[-] phoneymouse@lemmy.world 42 points 2 years ago* (last edited 2 years ago)

This doesn’t matter. You personally reciting movie quotes as a private individual is fair use. OpenAI’s ChatGPT has a commercial purpose, and you could say it does compete with The NY Times.

[-] abhibeckert@lemmy.world 0 points 2 years ago* (last edited 2 years ago)

you could say it does compete with The NY Times

Only indirectly - as in airplanes competing with cars. And the law generally encourages that type of competition as it leads to substantial innovation and economic growth.

[-] GiveMemes@jlai.lu 4 points 2 years ago* (last edited 2 years ago)

That's like saying amazon and mom and pop gift shops don't compete. Like yeah, a lot of people will still prefer the atmosphere and curation of the mom and pop shop but that doesn't fucking matter when the vast majority of people just use Amazon, driving the shop out of business. This despite the fact that Amazon is more general and only competes indirectly.

[-] Kbin_space_program@kbin.social 22 points 2 years ago

No, but you're not trying to sell your abilities to write things. The entire point of OpenAI as a company is to sell its LLM.

[-] abhibeckert@lemmy.world -1 points 2 years ago* (last edited 2 years ago)

What does that have to do with copyright infringement though? And how would it be illegal?

I could totally start a website, maybe call it "New York Stories", read every news article about New York (I'd get a lot of them from NYT) and then working off my own memory, not copy/pasting the text write/publish the same story. That would not be copyright infringement. In fact the NYT themselves do it all the time, publishing things that were originally reported elsewhere. You're allowed to do that as long as you don't produce exact copies.

LLMs generally don't do exact copies of anything - they're just not exact at all. If you ask the AI exactly the same question a thousand times, you won't get precisely the same exact response twice.

For example asking "What should I eat in New York?" gave me:

New York City offers a vast array of culinary experiences, reflecting its diverse culture. Here's a mix of iconic eats and modern must-tries:

Pizza: New York-style pizza is famous worldwide. Visit classic spots like Di Fara, Lombardi's, or newer favorites like Lucali for a slice of this iconic dish.

Bagels and Lox: New York bagels are [... several more paragraphs ...]

Then the same question again:

New York City is a melting pot of cultures, making it one of the best places in the world to explore a wide variety of cuisines. Here are some iconic foods and places to consider when deciding what to eat in New York:

Pizza: New York-style pizza is famous worldwide. Look for places with a long history and great reviews, such as Lombardi's (America's first pizzeria), Di Fara Pizza, or Joe's Pizza for a classic slice.

Bagels: Another iconic New York [...]

It's approximately the same response but not exactly the same and even recommends different restaurants.

Being exact matters when it comes to copyright infringement. Like OpenAI I'm genuinely curious how they got it to output a verbatim copy of anything. That's highly unusual behaviour and if they had reported it to the company I'm sure it would have been fixed. Just like if someone posted an exact copy of an NYT article in this community it would be removed and nobody would be taken to court.

[-] JoBo@feddit.uk 22 points 2 years ago

Are you charging for your performances?

[-] Maestro@kbin.social 16 points 2 years ago

If you write it down and sell it you 100% do violate copyright

[-] sphericth0r@kbin.social 2 points 2 years ago

Google be in trouble then

[-] mosiacmango@lemm.ee 10 points 2 years ago* (last edited 2 years ago)

Do you have paying customers that ask you for movie scripts and song lyrics like OpenAi does? If so, the above would be flat out copyright infringement.

[-] bitwaba@lemmy.world 2 points 2 years ago

Pffffft, this guy can't recite "Forgot about Dre" from memory...

this post was submitted on 28 Feb 2024

109 points (95.0% liked)

Technology

84878 readers

1072 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws