254
deepseek
(lemmy.ml)
A place to share screenshots of Microblog posts, whether from Mastodon, tumblr, ~~Twitter~~ X, KBin, Threads or elsewhere.
Created as an evolution of White People Twitter and other tweet-capture subreddits.
Rules:
Related communities:
I do feel deeply suspicious about this supposedly miraculous AI, to be fair. It just seems too amazing to be true.
You can run it yourself, so that rules out it's just Indian people like the Amazon no checkout store was.
Other than that, yeah, be suspicious, but OpenAI models have way more weird around them than this company.
I suspect that OpenAI and the rest just weren't doing research into less costs because it makes no financial sense for them. As in it's not a better model, it's just easier to run, thus it makes it easier to catch up.
Mostly, I'm suspicious about how honest the company is being about the cost to train the model, that's one thing that is very difficult to verify.
Open source means it can be publicly audited to help soothe suspicion, right? I imagine that would take time, though, if it's incredibly complex
Open source is a very loose term when it comes to GenAI. Like Llama the weights are available with few restrictions but importantly how it was trained is still secret. Not being reproducible doesn't seem very open to me.
True, but in this case I believe the also open sourced the training data and the training process.
It's open source and people are literally self-hosting it for fun right now. Current consensus appears to be that its not as good as chatGPT for many things. I haven't personally tried it yet. But either way there's little to be "suspicious" about since it's self-hostable and you don't have to give it internet access at all so it can't call home.
https://www.reddit.com/r/selfhosted/comments/1ic8zil/yes_you_can_run_deepseekr1_locally_on_your_device/