Greg Rutkowski Was Removed From Stable Diffusion, But AI Artists Brought Him Back

[+] raccoona_nongrata@beehaw.org 35 points 2 years ago

[deleted]

[-] selzero@syzito.xyz 10 points 2 years ago* (last edited 2 years ago)

@raccoona_nongrata @fwygon

Rutowski, Monet, and Rockwell could also not create without human art.

All creativity is a combination of past creativity.

Even Monet.

Even Shakespeare.

Even Beethoven.

[+] raccoona_nongrata@beehaw.org 21 points 2 years ago

[deleted]

[-] selzero@syzito.xyz 9 points 2 years ago

@raccoona_nongrata

Actually. It is necessary. The process of creativity is much much more a synergy of past consumption than we think.

It took 100,000 years to get from cave drawings to Leonard Da Vinci.

Yes we always find ways to draw, but the pinnacle of art comes from a shared culture of centuries.

[+] raccoona_nongrata@beehaw.org 12 points 2 years ago

[deleted]

[-] selzero@syzito.xyz 2 points 2 years ago

@raccoona_nongrata

A machine will not unilaterally develop an art form, and develop it for 100,000 years.

Yes I agree with this.

However, they are not developing an art form now.

Nor did Monet, Shakespeare, or Beethoven develop an art form. Or develop it for 100,000 years.

So machines cannot emulate that.

But they can create the end product based on past creations, much as Monet, Shakespeare, and Beethoven did.

[-] ParsnipWitch@feddit.de 3 points 2 years ago

No, humans create and develope styles in art from "mistakes" that AI would not continue pursuing. Because they personally like it or have a strange addiction to their own creative process. The current hand mistakes for example were perhaps one of the few interesting things AI has done...

Current AI models recreate what is most liked by the majority of people.

[-] I_Has_A_Hat@lemmy.ml 1 points 2 years ago

And what if the human running the AI likes one of these "mistakes" and tells the AI to run with it?

[-] ParsnipWitch@feddit.de 1 points 2 years ago

But that's still not how it works for an artist. I don't mean stumbling upon an accident and using it in your work but deliberately creating something that's not liked and perfect the way you do it. For someone who just instructs a tool and generates images in rapid speed they go a very different path.

[-] glenatron@dice.camp 13 points 2 years ago

@selzero @raccoona_nongrata @fwygon But human creativity is not ONLY a combination of past creativity. It is filtered through a lifetime of subjective experience and combined knowledge. Two human artists schooled on the same art history can still produce radically different art. Humans are capable of going beyond has been done before.

Before going too deep on AI creation spend some time learning about being human. After that, if you still find statistical averages interesting, go back to AI.

[-] selzero@syzito.xyz 5 points 2 years ago* (last edited 2 years ago)

@glenatron @raccoona_nongrata @fwygon

I mean, yes, you are right, but essentially, it is all external factors. They can be lived through external factors, or data fed external factors.

I don't think there is a disagreement here other than you are placing a lot of value on "the human experience" being an in real life thing rather than a read thing. Which is not even fully true of the great masters. It's a form of puritan fetishisation I guess.

[-] glenatron@dice.camp 8 points 2 years ago

@selzero @raccoona_nongrata @fwygon I don't think it's even contraversial. Will sentient machines ever have an equivalent experience? Very probably. Will they be capable of creating art? Absolutely.

Can our current statistical bulk reincorporation tools make any creative leap? Absolutely not. They are only capable of plagiarism. Will they become legitimate artistic tools? Perhaps, when the people around them start taking artists seriously instead of treating them with distain.

[-] selzero@syzito.xyz 6 points 2 years ago

@glenatron @raccoona_nongrata @fwygon

This angle is very similar to a debate going on in the cinema world, with Scorsese famously ranting that Marvel movies are "not movies"

The point being without a directors message being portrayed, these cookie cutter cinema experiences, with algorithmically developed story lines, should not be classified as proper movies.

But the fact remains, we consume them as movies.

We consume AI art as art.

[-] glenatron@dice.camp 3 points 2 years ago

@selzero @raccoona_nongrata @fwygon I try not to consume it as art. There is plenty of original art by real artists. The averages of that dataset are less interesting to me than the original data points.

[-] aredridel@kolektiva.social 1 points 2 years ago

@selzero @glenatron @raccoona_nongrata @fwygon And thousands of people's creativity is in the Marvel movie, but one person hammering out a prompt on the AI art. They're still vastly different. Even the most banally corporate movie is still a work of staggering human creativity and _working together_.

Stable diffusion image generators are not.

[-] selzero@syzito.xyz 1 points 2 years ago

@aredridel @glenatron @raccoona_nongrata @fwygon

Humans are also machines, biological machines, with a neurology based on neurons and synapse. As pointed out before, human "creativity" is also a result of past external consumption.

When AI is used to eventually make a movie, it will use more than one AI model. Does that make a difference? I guess your "one person" example is Scorsese's "auteur"?

It seems we are fetishizing biological machines over silicon machines?

[-] aredridel@kolektiva.social 1 points 2 years ago

@selzero @glenatron @raccoona_nongrata @fwygon no. Human relationships of cocreation over purely extractive ones. It’s not the biology (though humans have human relevant social drives simple algorithms don’t), it’s the relationships.

It’s obscuring that as if these clusters of Gpus care about creating and form relationships based on them that is so offensive.

[-] selzero@syzito.xyz 1 points 2 years ago

@aredridel @glenatron @raccoona_nongrata @fwygon

I don't understand, can you elaborate please. How is it not biological?

[-] aredridel@kolektiva.social 1 points 2 years ago

@selzero @glenatron @raccoona_nongrata @fwygon it’s biological the way zoology is physics. Technically true but so deeply ignorant of the orders of magnitude of history and emergent complexity for that also to not be relevant. It’s a profoundly reductive way to look at things to the point of missing their fundamental nature.

[-] selzero@syzito.xyz 1 points 2 years ago

@aredridel @glenatron @raccoona_nongrata @fwygon

So, a human being a link in the chain of this historical cultural development of creation, is "more valuable" than a machine doing that?

Who makes these rules?

There is some kind of value structure at play here that I have not been made privy to?

[-] housepanther@mstdn.goblackcat.com 2 points 2 years ago

@raccoona_nongrata @fwygon This is absolutely correct!

[-] Thevenin@beehaw.org 19 points 2 years ago

It doesn't change anything you said about copyright law, but current-gen AI is absolutely not "a virtual brain" that creates "art in the same rough and inexact way that we humans do it." What you are describing is called Artificial General Intelligence, and it simply does not exist yet.

Today's large language models (like ChatGPT) and diffusion models (like Stable Diffusion) are statistics machines. They copy down a huge amount of example material, process it, and use it to calculate the most statistically probable next word (or pixel), with a little noise thrown in so they don't make the same thing twice. This is why ChatGPT is so bad at math and Stable Diffusion is so bad at counting fingers -- they are not making any rational decisions about what they spit out. They're not striving to make the correct answer. They're just producing the most statistically average output given the input.

Current-gen AI isn't just viewing art, it's storing a digital copy of it on a hard drive. It doesn't create, it interpolates. In order to imitate a person't style, it must make a copy of that person's work; describing the style in words is insufficient. If human artists (and by extension, art teachers) lose their jobs, AI training sets stagnate, and everything they produce becomes repetitive and derivative.

None of this matters to copyright law, but it matters to how we as a society respond. We do not want art itself to become a lost art.

[-] Fauxreigner@beehaw.org 8 points 2 years ago

Current-gen AI isn’t just viewing art, it’s storing a digital copy of it on a hard drive.

This is factually untrue. For example, Stable Diffusion models are in the range of 2GB to 8GB, trained on a set of 5.85 billion images. If it was storing the images, that would allow approximately 1 byte for each image, and there are only 256 possibilities for a single byte. Images are downloaded as part of training the model, but they're eventually "destroyed"; the model doesn't contain them at all, and it doesn't need to refer back to them to generate new images.

It's absolutely true that the training process requires downloading and storing images, but the product of training is a model that doesn't contain any of the original images.

None of that is to say that there is absolutely no valid copyright claim, but it seems like either option is pretty bad, long term. AI generated content is going to put a lot of people out of work and result in a lot of money for a few rich people, based off of the work of others who aren't getting a cut. That's bad.

But the converse, where we say that copyright is maintained even if a work is only stored as weights in a neural network is also pretty bad; you're going to have a very hard time defining that in such a way that it doesn't cover the way humans store information and integrate it to create new art. That's also bad. I'm pretty sure that nobody who creates art wants to have to pay Disney a cut because one time you looked at some images they own.

The best you're likely to do in that situation is say it's ok if a human does it, but not a computer. But that still hits a lot of stumbling blocks around definitions, especially where computers are used to create art constantly. And if we ever hit the point where digital consciousness is possible, that adds a whole host of civil rights issues.

[-] Thevenin@beehaw.org 4 points 2 years ago

It’s absolutely true that the training process requires downloading and storing images

This is the process I was referring to when I said it makes copies. We're on the same page there.

I don't know what the solution to the problem is, and I doubt I'm the right person to propose one. I don't think copyright law applies here, but I'm certainly not arguing that copyright should be expanded to include the statistical matrices used in LLMs and DPMs. I suppose plagiarism law might apply for copying a specific style, but that's not the argument I'm trying to make, either.

The argument I'm trying to make is that while it might be true that artificial minds should have the same rights as human minds, the LLMs and DPMs of today absolutely aren't artificial minds. Allowing them to run amok as if they were is not just unfair to living artists... it could deal irreparable damage to our culture because those LLMs and DPMs of today cannot take up the mantle of the artists they hedge out or pass down their knowledge to the next generation.

[-] Fauxreigner@beehaw.org 2 points 2 years ago

Thanks for clarifying. There are a lot of misconceptions about how this technology works, and I think it's worth making sure that everyone in these thorny conversations has the right information.

I completely agree with your larger point about culture; to the best of my knowledge we haven't seen any real ability to innovate, because the current models are built to replicate the form and structure of what they've seen before. They're getting extremely good at combining those elements, but they can't really create anything new without a person involved. There's a risk of significant stagnation if we leave art to the machines, especially since we're already seeing issues with new models including the output of existing models in their training data. I don't know how likely that is; I think it's much more likely that we see these tools used to replace humans for more mundane, "boring" tasks, not really creative work.

And you're absolutely right that these are not artificial minds; the language models remind me of a quote from David Langford in his short story Answering Machine: "It's so very hard to realize something that talks is not intelligent." But we are getting to the point where the question of "how will we know" isn't purely theoretical anymore.

[-] Zyansheep@lemmy.ml 2 points 2 years ago

How do you know human brains don't work in roughly the same way chatbots and image generators work?
What is art? And what does it mean for it to become "lost"?

[-] gianni@lemmy.ca 14 points 2 years ago

He literally just explained why.

[-] Zyansheep@lemmy.ml 2 points 2 years ago

No, he just said AI isn't like human brains because its a "statistical machine". What I'm asking is how he knows that human brains aren't statistical machines?

Human brains aren't that good at direct math calculation either!

Also he definitely didn't explain what "lost art" is.

[-] ParsnipWitch@feddit.de 13 points 2 years ago

Current AI models do not learn the way human brains do. And the way current models learn how do "make art" is very different from how human artists do it. To repeatedly try and recreate the work of other artists is something beginners do. And posting these works online was always shunned in artist communities. You also don't learn to draw a hand by remembering where a thousand different artists put the lines so it looks like a hand.

[-] shiri@foggyminds.com 5 points 2 years ago* (last edited 2 years ago)

@fwygon all questions of how AI learns aside, it's not legally theft but philosophically the topic is debatable and very hot button.

I can however comment pretty well on your copyright comments which are halfway there, but have a lot of popular inaccuracies.

Fair use is a very vague topic, and they explicitly chose to not make explicit terms on what is allowed but rather the intents of what is to be allowed. We've got some firm ones not because of specific laws but from abundance of case evidence.

* Educational; so long as it is taught as a part of a recognized class and within curriculum.
* Informational; so long as it is being distributed to inform the public about valid, reasonable public interests. This is far broader than some would like; but it is legal.
* Narrative or Commentary purposes; so long as you're not copying a significant amount of the whole content and passing it off as your own. Short clips with narration and lots of commentary interwoven between them is typically protected. Copyright is not intended to be used to silence free speech. This also tends to include satire; as long as it doesn't tread into defamation territory.

These are basically all the same category and includes some misinformation about what it does and does not cover. It's permitted to make copies for purely informational, public interest (ie. journalistic) purposes. This would include things like showing a clip of a movie or a trailer to make commentary on it.

Education doesn't get any special treatment here, but research might (ie. making copies that are kept to a restricted environment, and only used for research purposes, this is largely the protection that AI models currently fall under because the training data uses copyrighted data but the resulting model does not).

* Transformative; so long as the content is being modified in a substantial enough manner that it is an entirely new work that is not easily confused for the original. This too, is far broader than some would like; but it still is legal.

"Easily confused" is a rule from Trademark Law, not copyright. Copyright doesn't care about consumer confusion, but does care about substitution. That is, if the content could be a substitute for the original (ie. copying someone else's specific painting is going to be a violation up until the point where it can only be described as "inspired by" the painting)

* Reasonable, 'Non-Profit Seeking or Motivated' Personal Use; People are generally allowed to share things amongst themselves and their friends and other acquaintances. Reasonable backup copies, loaning of copies, and even reproduction and presentation of things are generally considered fair use.

This is a very very common myth that gets a lot of people in trouble. Copyright doesn't care about whether you profit from it, more about potential lost profits.

Loaning is completely disconnected from copyright because no copies are being made ("digital loaning" is a nonsense attempt to claiming loaning, but is just "temporary" copying which is a violation).

Personal copies are permitted so long as you keep the original copy (or the original copy is explicitly irrecoverably lost or destroyed) as you already acquired it and multiple copies largely are just backups or conversions to different formats. The basic gist is that you are free to make copies so long as you don't give any of them to anyone else (if you copy a DVD and give either the original or copy to a friend, even as a loan, it's illegal).

It's not good to rely on it being "non-profit" as a copyright excuse, as that's more just an area of leniency than a hard line. People far too often thing that allows them to get away with copying things, it's really just for topics like making backups of your movies or copying your CDs to mp3s.

... All that said, fun fact: AI works are not covered by copyright law.

To be copyrighted a human being must actively create the work. You can copyright things made with AI art, but not the AI art itself (ie. a comic book made with AI art is copyrighted, but the AI art in the panels is not, functioning much like if you made a comic book out of public domain images). Prompts and set up are not considered enough to allow for copyright (example case was a monkey picking up a camera and taking pictures, those pictures were deemed unable to be copyrighted because despite the photographer placing the camera... it was the monkey taking the photos).

[-] Harrison@ttrpg.network 2 points 2 years ago

This is true in US law but it should probably be noted that a lot of the "misconceptions" you're outlining in OP's comment are things that are legal in other jurisdictions

[-] shiri@foggyminds.com 3 points 2 years ago

@Harrison ::face palm:: thank you for calling that out, I'm so used to correcting fellow americans on copyright

[-] joe_vinegar@slrpnk.net 3 points 2 years ago

This is a very nice and thorough comment! Can you provide a reputable source for these points? (no criticism intended: as you seem knowledgeable, I'd trust you could have such reputable sources already selected and at hand, that's why I'm asking).

[-] throwsbooks@lemmy.ca 4 points 2 years ago

Not the poster you're replying to, but I'm assuming you're looking for some sort of source that neural networks generate stuff, rather than plagiarize?

Google scholar is a good place to start. You'd need a general understanding of how NNs work, but it ends up leading to papers like this one, which I picked out because it has neat pictures as examples. https://arxiv.org/abs/1611.02200

What this one is doing is taking an input in the form of a face, and turning it into a cartoon. They call it an emoji, cause it's based on that style, but it's the same principle as how AI art is generated. Learn a style, then take a prompt (image or text) and do something with the prompt in the style.

Technology