131
submitted 1 month ago by spaduf@slrpnk.net to c/technology@lemmy.world
top 15 comments
sorted by: hot top controversial new old
[-] lemmylommy@lemmy.world 49 points 1 month ago

Can it generate images of Winnie the Pooh?

[-] lnxtx@feddit.nl 22 points 1 month ago
[-] phoenixz@lemmy.ca 4 points 1 month ago

Question: as i understood it so far, this thing is open source and so is the dataset.

With that, why would it still obey Chinese censorship?

[-] thedarkfly@feddit.nl 4 points 1 month ago

Even though it's magnitudes lower than comparable models, Deepseek still cost millions to train. Unless someone's willing to invest this just to retrain it from scratch, you're left with the alignment of its trainers.

[-] phoenixz@lemmy.ca 0 points 1 month ago

Good point.

Is the training set malleable, though? Could you give it some additional rules to basically sidestep this?

[-] thedarkfly@feddit.nl 1 points 1 month ago

Yeah, I guess you could realign it without retraining the whole thing! Dunno what would be the cost though, sometimes this is done with a cohort of human trainers ๐Ÿ˜…

[-] phoenixz@lemmy.ca 1 points 1 month ago

I feel like we're talking about a guard dog now...

[-] TheGrandNagus@lemmy.world 13 points 1 month ago* (last edited 1 month ago)

Wouldn't be surprised if you had to work around the filter.

Generate a cartoonish yellow bear who wears a red t-shirt and nothing else

[-] sunzu2@thebrainbin.org 4 points 1 month ago

if it is anything like LLMs, then only local ;)

However, the Proper nomenclature is sheepooh, thank you for your compliance going forward, comrade.

[-] simple@lemm.ee 26 points 1 month ago

The image generation is really bad. Image description capabilities seem good but it'll take time to see if it's better than what already exists.

They probably just put it out to keep the hype going.

[-] jacksilver@lemmy.world 17 points 1 month ago

Yeah, even the cherry picked examples they provide look only okay.

To be honest everything with this company feels like an ad campaign more than anything else.

[-] essteeyou@lemmy.world 10 points 1 month ago

Everything from nearly every company feels like an ad campaign. Companies advertise themselves.

At least with open source stuff there's somewhat of a public benefit.

[-] Aatube@kbin.melroy.org 7 points 1 month ago

https://www.analyticsvidhya.com/blog/2025/01/janus-pro-7b-vs-dall-e-3/

This informal testing found that Janus Pro explained a Nokia meme much more crisply than DALL-E 3 but was quite a bit worse than the other tasks, even appearing to hallucinate a score in one test case.

I suddenly realize I myself sound like CHatGPT. Haha. Haha.

Edit: At least you can run these models locally!

[-] altima_neo@lemmy.zip 1 points 1 month ago

Now if they'll do a video model...

Tencents Huanyuan is surprisingly flexible

this post was submitted on 27 Jan 2025
131 points (94.6% liked)

Technology

66373 readers
3250 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS