165
submitted 1 year ago by floofloof@lemmy.ca to c/technology@lemmy.ml
you are viewing a single comment's thread
view the rest of the comments
[-] Dr_Cog@mander.xyz 11 points 1 year ago* (last edited 1 year ago)

Neither citation nor compensation are necessary for fair use, which is what occurs when an original work is used for its concepts but not reproduced.

[-] SheeEttin@lemmy.world 5 points 1 year ago

Sure, but fair use is rather narrowly defined. You must consider the purpose, nature, amount, and effect. In the case of scraping entire bodies of work as training data, the purpose is commercial, the nature is not in the public interest, the amount is the work in its entirety, and the effect is to compete with the original author. It fails to meet any criteria for fair use.

[-] Dr_Cog@mander.xyz 1 points 1 year ago

The work is not reproduced in its entirety. Simply using the work in its entirety is not a violation of copyright law, just as reading a book or watching a movie (even if pirated) is not a violation. The reproduction of that work is the violation, and LLMs simply do not store the works in their entirety nor are they capable of reproducing them.

[-] SheeEttin@lemmy.world 2 points 1 year ago

It doesn't have to be reproduced to be a copyright violation, only used. For example, publishing your Harry Potter fanfic would be infringement. You're not reproducing the original material in any way, but you're still heavily depending on it.

[-] Electricblush@lemmy.world 0 points 1 year ago

Breach of trademark, not copyright, whole different barrel of fish.

this post was submitted on 02 Oct 2023
165 points (89.5% liked)

Technology

34909 readers
236 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS