Why is AI Pornifying Asian Women? (joysauce.com)

submitted 2 years ago by Gaywallet@beehaw.org to c/technology@beehaw.org

73 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago

I wouldn't mind. I'm here for it.

[-] jarfil@beehaw.org 2 points 2 years ago

Are you ready to run a 100B FP64 parameter model? Or even a 10B FP32 one?

Over time, I wouldn't be surprised if 500B INT8 models became commonplace with neuromorphic RAM, but there's still some time for that to happen.

[-] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago* (last edited 2 years ago)

You don't need that many parameters, 4gb checkpoints work just fine.

[-] jarfil@beehaw.org 2 points 2 years ago

For more inclusive models, or for current ones? In order to add something, either the size has to grow, or something would need to get pushed out (content, or quality). 4GB models are already at the limit of usefulness, both DALLE3 and SDXL run at about 12B parameters, so to make them "more inclusive" they'd have to grow.

[-] Even_Adder@lemmy.dbzer0.com 3 points 2 years ago

I'm saying SD 1.5 and SDXL capture the concepts just fine, it's just during fine-tuning people train away some of the diversity.

[-] jarfil@beehaw.org 1 points 2 years ago

Wait, by "fine-tuning"... do you mean LoRAs? Because those are more like brain surgery with a sledgehammer, rather the opposite of "fine". I don't think it's possible for LoRAs to avoid having undesirable side effects... and I don't think people even want that.

Actual "fine" tuning, would be adding the LoRA's training data to the original set, then training the whole model from scratch... and that would require increasing the model's size to encode the increased amount of data for the same output quality.

[-] Even_Adder@lemmy.dbzer0.com 2 points 2 years ago

I mean like this. This paper just dropped the other day.

[-] jarfil@beehaw.org 1 points 2 years ago* (last edited 2 years ago)

Nice read, and an interesting approach... although it kind of tries to hide the elephant in the room:

This work has the potential to shift the way that image gen-erators operate at achievable costs to ensure that several cat-egories of harm from ‘AI’ generated models are mitigated, while the generated images become much more realistic and representative of the AI-generated images that populations want around the world.

They show that the approach optimizes for less "stereotypes" and less "offensive", which in most cultures leads from worse to better "cultural representation"... but notice how there is a split in the "Indian" culture cohort, with an equal amount finding "more stereotypical, more offensive" to be just as good at "cultural representation":

They basically made the model more politically correct and "idealized", but in the process removed part of a culture representation that wasn't wrong, because the "culture" itself is split to begin with.

[-] Even_Adder@lemmy.dbzer0.com 1 points 2 years ago

"Indian" is a huge population of very diverse people.

[-] jarfil@beehaw.org 1 points 2 years ago

That's my point. They claim to reduce misrepresentation, while at the same time they erase a bunch of correct representations.

Going back to what I was saying: fine tuning doesn't increase diversity, it only shifts the biases. Encoding actual diversity would require increasing the model, then making sure it can output every correct representation.

[-] Even_Adder@lemmy.dbzer0.com 3 points 2 years ago

It doesn't necessarily have to shift away from diversity biases. I think with care, you can preserve the biases that matter most. That was just their first shot at it, this seems like something you'd get better at over time.

[-] jarfil@beehaw.org 2 points 2 years ago

I guess their main shortcoming was the cultural training set. I'm still unconvinced that level of fine tuning is possible without increasing model size, but we'll see what happens if/when someone curates a much larger set with cultural labeling.

The labels might also need to be more granular, like "culture:subculture:period", or something... which is kind of a snakes nest by itself.

this post was submitted on 16 Jan 2024

85 points (100.0% liked)

Technology

42437 readers

313 users here now

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

This community's icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

founded 4 years ago

MODERATORS

alyaza@beehaw.org

TheRtRevKaiser@beehaw.org

gyrfalcon@beehaw.org

rs5th@beehaw.org

coldredlight@beehaw.org

SemioticStandard@beehaw.org

TheRtRevKaiser@kbin.social

remington@beehaw.org