677
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 22 Aug 2023
677 points (95.6% liked)
Technology
59381 readers
978 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
People are acting like ChatGPT is storing the entire Harry Potter series in its neural net somewhere. It’s not storing or reproducing text in a 1:1 manner from the original material. Certain material, like very popular books, has likely been interpreted tens of thousands of times due to how many times it was reposted online (and therefore how many times it appeared in the training data).
Just because it can recite certain passages almost perfectly doesn’t mean it’s redistributing copyrighted books. How many quotes do you know perfectly from books you’ve read before? I would guess quite a few. LLMs are doing the same thing, but on mega steroids with a nearly limitless capacity for information retention.
Using Copyrighted Work as Art as example still influences the AI which their make Profit from.
If they use my Works then they need to pay thats it.
Still kinda blows my mind how like the most socialist people I know (fellow artists) turned super capitalist the second a tool showed like an inkling of potential to impact their bottom line.
Personally, I'm happy to have my work scraped and permutated by systems that are open to the public. My biggest enemy isn't the existence of software scraping an open internet, it's the huge companies who see it as a way to cut us out of the picture.
If we go all copyright crazy on the models for looking at stuff we've already posted openly on the internet, the only companies with access to the tools will be those who already control huge amounts of data.
I mean, for real, it's just mind-blowing seeing the entire artistic community pretty much go full-blown "Metallica with the RIAA" after decades of making the "you wouldn't download a car" joke.
I feel like a lot of internet people (not even just socialists) go from seeing copyright as at best a compromise that allows the arts to have value under capitalism to treating it like a holy doctrine when the subject of LLMs comes up.
Like, people who will say "piracy is always okay" will also say "ban AI, period" (and misrepresent organizations that want regulations on it's use as wanting a full ban.)
Like, growing up with an internet full of technically illegal content (or grey area at best) like fangames and YouTube Poops made me a lifelong copyright skeptic. It's outright confusing to me when people take copyright as seriously as this.
I say piracy is always okay but also am a big fan of AI. I had chat GPT write my last cover letter and got the job
Based