Writing
Blog.
Notes on how MingLLM works, what we learned building it, and the decisions behind the product.
EngineeringApr 17, 20262 min
Making 4B models feel like 400B on your laptop
Three moves that matter: expert iteration, selective generation, and tight tool-use shaping. How we get to 94.9%.
Read
ProductApr 1, 20262 min
Why Jarvis does not have a chat window
A chat UI is the wrong default for a voice-first agent. We went looking for a better one.
Read
ResearchMar 19, 20261 min
The Orb, one month in
What we learned from the first four weeks of persistent memory in private beta.
Read
CompanyMar 4, 20262 min
Why we are local-first
Cloud-first is a choice disguised as gravity. Here is what we are choosing instead.
Read
EngineeringFeb 13, 20261 min
Tensor: from idea to extension in six weeks
A short postmortem of how we built the first Tensor alpha, what broke, and what we cut.
Read
All posts by Yiming Beckmann. Specific numeric claims reflect in-progress evaluations and may shift between draft and published research.