v1.0.0 — Generally Available
Your local AI workstation, no cloud required.
Run llama.cpp, MLX, vLLM, and ONNX models on your hardware. Orchestrate agents, build RAG pipelines, and ship OpenAI-compatible APIs — all from one open-source platform.
macOS · Windows · Linux · Docker
Built for production
5
Inference engines
31
Database tables
100
Build batches
14
Packages
< 50 ms
API gateway p50
Pricing
Start free, scale on your terms.
Local
Free
Single user, local machine only.
Download- All inference engines
- OpenAI-compat API
- Chat + Prompt Studio
- Local RAG
- Community support
Pro
$12/mo
Power users and solo developers.
Get started- Everything in Local
- Remote model sync
- Agents + Workflows
- Priority support
- Marketplace access
Team
$49/mo
Up to 10 seats, shared workspace.
Get started- Everything in Pro
- Multi-user workspace
- Role-based access
- SSO (OIDC)
- Audit logs
Enterprise
Custom
Air-gapped, unlimited seats.
Contact us- Everything in Team
- Air-gap mode
- Custom SLA
- On-prem deployment
- Dedicated support
Own your AI stack.
Open-source, self-hosted, no telemetry. Your models, your data, your infrastructure.
Download — it's free