16

cross-posted from: https://lemmygrad.ml/post/8207089

Jan-nano is a model fine-tuned with DAPO on Qwen3-4B. Jan-nano comes with some unique capabilities:

  • It can perform deep research (with the right prompting)
  • It picks up relevant information effectively from search results
  • It uses tools efficiently

The model was evaluated using SimpleQA - a relatively straightforward benchmark to test whether the model can find and extract the right answers.

Jan-nano outperforms Deepseek-671B on this metric, using an agentic and tool-usage-based approach. A 4B model obviously has its limitations, but it's interesting to see how far these things can be pushed. Jan-nano can serve as your self-hosted Perplexity alternative on a budget.

You can find the model at: https://huggingface.co/Menlo/Jan-nano

And a gguf is available at: https://huggingface.co/Menlo/Jan-nano-gguf

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here
this post was submitted on 15 Jun 2025
16 points (94.4% liked)

technology

23859 readers
177 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 5 years ago
MODERATORS