10
submitted 1 year ago by ksynwa@lemmygrad.ml to c/technology@lemmy.ml

Pls explain

you are viewing a single comment's thread
view the rest of the comments
[-] Vlyn@lemmy.zip 14 points 1 year ago

Why are humans so bad with drawing hands?

They are tough, AI isn't building a logical model of a human when drawing them. It's more like taking a best guess where pixels should go. So it's not "thinking": Alright, drawing a human, human has two hands, each hand has five fingers, the fingers are posed like this, ..

It's drawing a human, so it roughly throws a human shape on there, human shape roughly has a head, when there is a torso two arms should come out (roughly) and on the end of those two arms is something too, but what that is is complicated and always looks different. It's all approximation, extremely well done, but in the end the AI is just guessing where to put something.

If you trained a model on just a single type of hand and finger position it would perfectly replicate it. But every hand is different and each hand has a near unlimited amount of positions it can be in (including each finger). So it's usually a mess.

I saw one way to get better results, but that's pretty much giving the AI beforehand a pose (like a stick figure) so it already knows where things should go. If you just freely generate "Human male, holding hands up" you probably get a mess with 6 fingers out and maybe a third arm going to nowhere in the back.

[-] ksynwa@lemmygrad.ml 4 points 1 year ago

Why are humans so bad with drawing hands?

The rest of your answer makes sense but this rhetorical question is not helpful IMO. There are lots of things that humans are not good at but at which computers excel.

[-] Vlyn@lemmy.zip 5 points 1 year ago

That's mostly true, but not fully. Models use human drawn images and photos to learn from. So if you put in millions of drawn images and the hands aren't perfect in all of them, you might mess up the model too. That's why negative prompts like "malformed", "bad quality", "misformed hands" and so on are popular when playing with image generation.

this post was submitted on 02 Sep 2023
10 points (85.7% liked)

Technology

34728 readers
79 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS