13

Since I like having more than one local LLM to switch between when analysing tricky development issues I decided to try out this new MoE model today. It's a 30B A3B which means it's basically a drop-in replacement for Qwen 3.6 35B A3B with suitable llama.cpp parameters the same.

On their own published benchmark metrics it's supposed to be slightly worse than Qwen, but so far it's not something I've noticed. It's tuned to work well in Opencode which is how I'm running it as well.

Try it out, see how it works for you. I know that there are those who would rather use a Canadian than Chinese model in today's political climate and it does seem to perform better than Gemma 4 at least for me. Just don't forget to use the PR linked from unsloth's description until it has been merged into main.

30

Maybe it was just me, but in case others have done the same this post might help someone else too.

I have a workstation with plenty of CPU and system RAM, but I'm "GPU poor" in that I only have a 5060Ti with its 16GB of VRAM. Additionally, I need to use the GPU for regular system activities too which means I only have around ~14GB of VRAM available for the LLM.

I'm exclusively using this setup for development and system management tasks, and I've found Qwen 3.6 35B A3B to excel compared to other models. I don't have the VRAM to run the 27GB dense model, so I've spent time on getting the best usage out of the MoE.

Or so I thought. Since "everyone" says to use Unsloth UD-Q4_K_XL that's the quant I've been using, and I've gone a bit back'n'forth with MTP/no MTP, UB increase, mmproj since I've also started using a browser MCP etc.

Today I took another look at their quant chart and thought that since it's MoE maybe I could run Q5_K_S which would be a step up?

Well. Now I'm using Q6_K because it turns out I could run that with the exact same settings as I've optimized my Q4_K_XL setup for which means there are no drawbacks - just a better performing model. I've already noticed how it's able to get out of loops while before I had to interrupt it sometimes.

This is my setup. I get >1000 t/s prefill and >20 t/s inference. I'm not chasing faster inference since I actively read the thought process when working the LLM - but I've increased ub to get faster prefill since that's just waiting time otherwise.

./llama-server
    -hf unsloth/Qwen3.6-35B-A3B-GGUF:UD-Q6_K \
    -c 160000 \
    -n 32768 \
    -fa on \
    -ub 2048 \
    -ctk q8_0 \
    -ctv q8_0 \
    --no-mmap \
    --mlock \
    --no-warmup \
    --chat-template-kwargs '{"preserve_thinking": true}' \
    --temp 0.6 \
    --top-p 0.95 \
    --top-k 20 \
    --min-p 0.0 \
    --presence-penalty 0.0 \
    --repeat-penalty 1.0 \
    --host 0.0.0.0

I also use Opencode with the DCP and Superpowers plugins, which make a tremendous difference both to context handling as well as planning. I have no need for a larger context - I even compact early quite often since the tasks get done before reaching the limit.

2
submitted 2 months ago by troed@fedia.io to c/games@lemmy.world

Seeing as the game happily lists Mithral and Storm Leather as things that can be used for equipment and upgrades without them existing (currently) in the survival portion of the game, is it the same with the memories?

I'm at 196/200 for the last things to unlock and it's getting very difficult to find something new. The progress bar indicates there would be ~50 more things which seems ... incredible.

Anybody knows?

[-] troed@fedia.io 75 points 4 months ago

All RTO mandates are about firing people without saying it loud for the financial markets.

[-] troed@fedia.io 83 points 1 year ago

At least two, maybe all three, of those photos are from Norway.

But yeah, we have the same word. I do believe the fact that our trains have "slutstation" to be funnier.

[-] troed@fedia.io 79 points 1 year ago

We southern Swedes will never forgive you for forcing us to close down our perfectly working nuclear plant out of your irrational fears.

[-] troed@fedia.io 104 points 1 year ago

It's a list from 2021 and as a cybersec researcher and Jellyfin user I didn't see anything that would make me say "do not expose Jellyfin to the Internet".

That's not to say there might be something not listed, or some exploit chain using parts of this list, but at least it's not something that has been abused over the last four years if so.

195
submitted 1 year ago by troed@fedia.io to c/globalnews@lemmy.zip

74% of Ukrainians support fighting Russia even without U.S. assistance. A significant majority—59% of respondents—also believe that Ukraine can defeat Russia on the battlefield

only 6% of respondents said they were willing to make territorial concessions regarding areas occupied by Russia after the full-scale invasion in 2022

Additionally, 70% of respondents are against lowering the mobilization age,

Original article is paywalled, quotes from https://ukrainetoday.org/74-of-ukrainians-ready-to-resist-russia-without-u-s-aid-support-zelenskyys-actions/

[-] troed@fedia.io 81 points 1 year ago

Seems like a no, unfortunately.

Because, if the publishing history for von Braun’s book on Wikipedia is correct, then Errol couldn’t have heard about it when he was a child and used it as the basis for naming his son – as it wasn’t actually published until well after Elon Musk was born.

(It should be noted that the technical appendix to the book, which contained the specifications for the novel’s expedition to Mars, was published earlier: in Germany in 1952, and in English the following year – however, this appendix as it appears in Project Mars does not contain any mention of ‘Elon’.)

https://www.dailygrail.com/2024/12/did-elon-musks-father-confirm-that-he-was-named-after-the-martian-leader-in-a-science-fiction-novel-by-a-nazi-rocket-scientist/

[-] troed@fedia.io 116 points 1 year ago

... the billionaire proof version of Bluesky is ... Mastodon.

[-] troed@fedia.io 234 points 2 years ago

We're seeing a substantial increase on the Mastodon instance I help moderate too, but there's no aggregate marketing department at Mastodon so we don't get any headlines.

71
submitted 2 years ago by troed@fedia.io to c/privacy@lemmy.ml

Swedish author and famous pro-Ukraine blogger Lars Wilderäng (Cornucopia) reports today that the Swedish security expert Karl Emil Nikka has revealed that Kagi is using the Kremlin propaganda tool Yandex as a backend for searches.

Wilderäng speculates this might mean search terms are leaking to Russia, while others worry about how Kremlin thus can get their talking points into western search results.

Security expert Karl Emil Nikka tells us that the search engine Kagi, popular among tech geeks, uses Russian Yandex, which was introduced after the full-scale invasion. This, of course, gives Russia the opportunity to look at what is searched for via Kagi.

Link (in Swedish), see 11:22 update: https://cornucopia.se/2024/10/uppdateras-ryssland-medger-bruk-av-c-stridsmedel-mot-ukraina-rysk-pilot-som-mordade-68-ukrainare-ihjalslagen-med-hammare-bland-de-allra-storsta-ryska-forlusterna-under-kriget-igar/

[-] troed@fedia.io 84 points 2 years ago

Despite fixing the issue, Zendesk ultimately chose not to award a bounty for my report. Their reasoning? I had broken HackerOne's disclosure guidelines by sharing the vulnerability with affected companies

Regardless of everything else they should be kicked out from HackerOne since it's clearly Zendesk not being truthful here.

[-] troed@fedia.io 113 points 2 years ago

Well I mean murdering someone breaks the very definition of libertarianism so you can be very sure they're just using that moniker because it fits whatever they're really trying to accomplish.

the maximum freedom for each individual to follow his own ways, his own values, as long as he doesn't interfere with anybody else who's doing the same.

https://www.hoover.org/research/take-it-limits-milton-friedman-libertarianism

But you're absolutely right that a lot of people who are today clearly cheering for fascists used to call themselves libertarians.

[-] troed@fedia.io 93 points 2 years ago

As a dad I care about the subjects my kids care about, so I'm able to take a genuine part in their (almost completely online) lives.

[-] troed@fedia.io 163 points 2 years ago

Maybe it's time to move on from using SSNs for security? We have someting similar in Sweden - "person numbers". If I call the tax authority and ask for someone's "person number" they will tell me. They're not secret in any way, and thus not used as some form of authentication either.

[-] troed@fedia.io 147 points 2 years ago

No shit. My lease on the Model 3 I got in 2020 is up in a few months and the requirements we had for the replacement was "anything but Tesla".

(which turned out to be a VW ID.7)

view more: next ›

troed

joined 3 years ago