995
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
this post was submitted on 07 Oct 2023
995 points (97.7% liked)
Technology
70462 readers
2262 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related news or articles.
- Be excellent to each other!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
- Check for duplicates before posting, duplicates may be removed
- Accounts 7 days and younger will have their posts automatically removed.
Approved Bots
founded 2 years ago
MODERATORS
I still don’t believe the avocado comic is one-shot AI-generated. Composited from multiple outputs, sure. But I have not once seen generative AI produce an image that includes properly rendered text like this.
Bing image creator uses the new DALL-E model which does hands and text pretty good.
generated this first try with the prompt a cartoon avocado holding a sign that says 'help me'
People forget just how fast this tech is evolving
Absolutely SDXL with loras already can do a lot of what it was thought impossible.
Image generation tech has gone crazy over the past year and a half or so. At the speed it's improving I wouldn't rule out the possibility.
Here's a paper from this year discussing text generation within images (it's very possible these methods aren't SOTA anymore -- that's how fast this field is moving): https://openaccess.thecvf.com/content/WACV2023/html/Rodriguez_OCR-VQGAN_Taming_Text-Within-Image_Generation_WACV_2023_paper.html
Yeah I'm sceptical too, what tool and prompt was used to produce this?
Its Dalle 3 its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage
Shrek 2 Image
Shrek 3 Image
All of these are just generated nothing else
Huh interesting it handles text relatively well
I found the avocado comic the easiest to tell, since the missing eyebrow was so insanely out of place.
Its not that difficult to generate something like that using dalle 3 here's some shreks I generated as a showcase Shrek 1 inage
Shrek 2 Image
Shrek 3 Image
All of these are just generated nothing else
Prompt and tool links? I know there are tools that try to pick out label text in the prompt and composite it after the fact, but I don’t consider this one-shot AI generated, even if it’s a single tool from the user’s perspective.
Its Dalle 3 like I said. As far as in aware Dalle 3 doesn't do that since the text isn't always perfect still. Can't really provide prompts since its been a bit, and the history on it isn't great, but I was just mostly shrek in x style and saying "x" do mind you Dalle is very heavily censored now, so you're now unlikely to be able to recreate that.
It's on - https://bing.com/create