1011
Absolutely not training data no way no sir
(i.redd.it)
For preserving the least toxic and most culturally relevant Tumblr heritage posts.
Image descriptions and plain text captions of written content are expected of all screenshots. Here are some image text extractors (I looked these up quick and will gladly take FOSS recommendations):
-web
-iOS
Please begin copied raw text posts (lacking a screenshot that makes it apparent it is from Tumblr) with:
# This has been reposted here to Lemmy as part of the "Curated Tumblr Project."
I made the icon using multiple creative commons svg resources, the banner is this.
One is theft and an infringement of privacy for nefarious ends and the other is a painting. There's a world of difference between agreeing to let someone paint you and a corporation using your data to train AI. Spinning this basic reality into sinophobia is mind boggling. There are people in this thread shitting on Google for the same thing. Would you call it amerophobic to criticize Google for the same shit? Of course you wouldn't
It's not theft. Fuck copyrights.
Sure, for corporations and the wealthy. But it straight up yanks information from small authors, artists, etc. I couldn't give less of a shit if Disney is impacted from AI, but there is real potential for harm to average people. Submitting your shit to AI should be opt-in, scraping the web for content that company didn't create with no consent from the content creators for the means of profiting off their labor is wrong. Copyright is fucked, yes. It protects the wealthy more than it protects the non-wealthy, yes. These companies practices are still fucked too. Two things can be bad and there is plenty of room for nuance in this area
I never said it was sinophobic I said that they're utilizing peoples preexisting dispositions to consolidate power in the AI space. Which is objectively true, the large companies are currently doing everything they can to demonize open source models.