200

Stability AI CEO resigns because you can't beat centralized AI with more centralized AI (techcrunch.com)

submitted 1 year ago by ylai@lemmy.ml to c/technology@lemmy.world

21 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[-] avidamoeba@lemmy.ca 107 points 1 year ago

Am I the only one who feels dotcom vibes around this field?

[-] daddy32@lemmy.world 77 points 1 year ago

I expect this to proceed similarly: many companies and funding dollars will burn in flames and still, the world will be a different place in a decade thanks to this technology.

[-] avidamoeba@lemmy.ca 20 points 1 year ago

Why am I feeling it isn't going to be a repeat of the standards-driven co-operative development supported by open source software infrastructure that occurred during the decade and a half after the dotcom bubble.. I have a feeling it would resemble the pre mass computing world of AT&T, GE and IBM.

[-] andyburke@fedia.io 25 points 1 year ago

There are a lot of open source LLMs being developed, ones you can run at home on your own data.

[-] umbrella@lemmy.ml 5 points 1 year ago

i hope these take off too

[-] LainTrain@lemmy.dbzer0.com 3 points 1 year ago

What would be the threshold for them to "take off"? It's all already out, so already there no?

[-] umbrella@lemmy.ml 1 points 1 year ago

its been a while, but last i tried it wasnt as good as the proprietary models.

[-] LainTrain@lemmy.dbzer0.com 1 points 1 year ago

Which ones did you try?

[-] umbrella@lemmy.ml 1 points 1 year ago* (last edited 1 year ago)

i tried the llama model for text, and another one meant for images i cant quite remember the name but it was one of the main ones.

are they any good now? running an llm actually sounds mildly useful.

[-] admin@lemmy.my-box.dev 1 points 1 year ago

The Mixtral models are pretty good, although they require a LOT of memory to run at a decent pace.

[-] LainTrain@lemmy.dbzer0.com 1 points 1 year ago

Honestly i think speed is something I don't care too much about with models, because even things like ChatGPT will be slower than Google for most things, and if something is more complex and a good use case for an LLM it's unlikely to be the primary bottleneck.

My ~~gf~~ private chat bot right now is a combination of Mistral 7B with a custom finetune and ~~she~~ it directs some queries to ChatGPT if I ask (I got free tokens way back might as well burn through them).

How much of an improvement is Mixtral over Mistral in practice?

[-] admin@lemmy.my-box.dev 1 points 1 year ago

Sillytavern by any chance?

And I'd say the difference between mistral and mixtral is pretty big for general usage, feels like it's a next generation.

load more comments (15 replies)

this post was submitted on 23 Mar 2024

200 points (96.3% liked)

Technology

72688 readers

868 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

L4s@hackingne.ws