10
you are viewing a single comment's thread
view the rest of the comments
[-] mindbleach@sh.itjust.works 1 points 3 days ago

DeepSeek is trained from-scratch. Only some variants used other LLMs.

This is a megaphone made from string, a squirrel, and a megaphone.

this post was submitted on 30 Sep 2025
10 points (91.7% liked)

Hacker News

2713 readers
291 users here now

Posts from the RSS Feed of HackerNews.

The feed sometimes contains ads and posts that have been removed by the mod team at HN.

founded 1 year ago
MODERATORS