992

submitted 2 years ago by The_Picard_Maneuver@startrek.website to c/funny@lemmy.world

129 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] Turun@feddit.de 2 points 2 years ago

There would still need to be a corpus of text and some supervised training of a model on that text in order to “recognize” with some level of confidence what the text represents, right?

Correct. The clip encoder is trained on images and their corresponding description. Therefore it learns the names for things in images.

And now it is obvious why this prompt fails: there are no images of empty rooms tagged as "no elephants". This can be fixed by adding a negative prompt, which subtracts the concept of "elephants" from the image in one of the automagical steps.

this post was submitted on 08 Feb 2024

992 points (98.9% liked)

Funny: Home of the Haha

9257 readers

3 users here now

Welcome to /c/funny, a place for all your humorous and amusing content.

Looking for mods! Send an application to Stamets!

Our Rules:

Keep it civil. We're all people here. Be respectful to one another.
No sexism, racism, homophobia, transphobia or any other flavor of bigotry. I should not need to explain this one.
Try not to repost anything posted within the past month. Beyond that, go for it. Not everyone is on every site all the time.

Other Communities:

/c/TenForward@lemmy.world - Star Trek chat, memes and shitposts
/c/Memes@lemmy.world - General memes

founded 3 years ago

MODERATORS

Anticorp@lemmy.world

aeronmelon@lemmy.world