Frequently asked
Questions,
answered honestly.
The questions that come up the most. If yours isn't here, email us — we read every one.
Product
What is MingLLM, in one sentence?
A personal-intelligence runtime — voice (Jarvis), browser (Tensor), and CLI (Tensor Code) — that runs locally on your Mac, leaves a receipt for every action, and does nothing without your permission.
Is this a chat app?
No. There is no chat window. MingLLM is an agent that lives behind the surfaces you already use — your inbox, your browser, your terminal. The voice ear is one shortcut away; the rest is silent until you ask.
What can it actually do today?
Read your inbox and triage it. Answer across open tabs with citations pinned to the source. Move calendar events. Run shortcuts. Open apps. Read your repo and propose diffs. Every action is shown to you before it fires.
Local-first
Does it really run on my machine?
Yes. The model is on your disk. Inference happens on your Apple Silicon. Your memory is stored in MingLLM's app sandbox. The default state has zero network access — turn the wifi off and MingLLM still works for everything that doesn't reach out.
What about cloud calls?
Off by default. If you want a remote tool — search the web, send mail through your provider, hit a third-party API — you grant that one capability and MingLLM shows you each request before it fires. You can revoke any of those grants from one screen.
Do you train on my data?
No. We have no pipeline that exfiltrates your local memory. The model on your machine is the model we shipped — your data stays on your machine.
Practical
What hardware do I need?
Apple Silicon Mac (M1 or newer), 16 GB unified memory recommended. An 8 GB mode is on the roadmap for Q3 2026. Disk: about 8 GB for the base model and your local memory. Intel Macs work but slower.
When are Windows + Linux coming?
After v1 stabilizes — current target is 2027. We built for macOS first to keep scope tight; the underlying runtime is portable, the system-integration layers are not yet.
How much does it cost?
MingLLM is in invite-only preview today. Pricing for v1 is not finalized — but the local-first design means the unit economics aren't dominated by inference cost, which gives us room to do something fairer than per-token billing.
About us
Who's building this?
A small team, currently funded by Yiming Beckmann and a small group of pre-seed investors. Hiring is open for research, client engineering, and model training — see the team page.
Why MingLLM and not a frontier lab?
Frontier scale is a liability when your product lives on the user's device. We chose the constraint, then built every layer — training, model, runtime, product — to compound on that decision. That's the moat.