1. Choose a plan
Pick a monthly subscription or a one-shot credit pack. Same API key, same models — credits and subscription can stack.
Monthly subscription
$75 / mo
Starter · 15M tokens included · overage $5/M
$150 / mo
Pro · Most popular
50M tokens included · overage $4/M · smart routing across 6 models · FIDO2 required (BYO)
$200 / mo
Elite · 100M tokens included · overage $3/M · 128k context · FIDO2 required (BYO)
$499 / mo
Business · 300M tokens · overage $2/M · reserved capacity · priority DM support
$1,999 / mo
Scale
1.5B tokens · overage $1.50/M · 99.5% SLA · dedicated founder channel · annual prepay $19,990/yr
Entry-level — under $100 (BYO add-on $19/mo)
Low-volume single-use-case packages. BYO Keys add-on stacks at $19/mo. BYO included free on every $100+ plan below.
$5 one-shot
$5 Trial Pack · ~5M tokens · smart-route-fast · BTC minimum entry · credits never expire
$9 / mo
Hobbyist · 2M tokens · Llama 3.1 8B · personal projects + learning
$29 / mo
Solo Dev · 10M tokens · smart-route + smart-route-fast · for freelancers
$49 / mo
Mix Pack · best value 25M tokens · smart-route + smart-route-coder + smart-route-fast · for side projects
$59 / mo
Vision Pack · 15M tokens · multimodal Llama-4-Maverick + smart-route-fast · for designers / image apps
$19 / mo
BYO Keys add-on
Stacks on any sub-$100 base · adds Anthropic / OpenAI / Google / OpenRouter / Ollama routes via your own keys · INCLUDED FREE on $100+ plans
Specialty bundles — narrower model menu
Pick a specialty if you know exactly what you're routing. Pro covers the general case at $150/mo.
$69 / mo
Fast Pack · 50M tokens · Llama 3.1 8B + GPT-OSS-120B · real-time chat / agents / IDE autocomplete
$99 / mo
Coder Pack · 30M tokens · Qwen3-Coder-480B + Llama 3.3 70B · for devs / coding agents / IDE plugins
$129 / mo
Reasoner Pack · 25M tokens · Qwen3-235B + Nemotron-Super-120B · multi-step reasoning / research
$249 / mo
EU-Sovereign Pack · 60M tokens · 100% EU-resident · GDPR Article 28 DPA · 90-day audit log
$249 / mo
US-Sovereign Pack · 60M tokens · 100% US-resident · US legal jurisdiction · 90-day audit log · deliverable today
$100 / mo
Stack · BYO Ollama + BYO OpenRouter key + consortium free · unified routing layer
Add-on — stacks on any subscription
$89 / mo
Frontier Credits add-on
5M tokens / mo · Claude Sonnet 4.6 / Opus 4.7 / GPT-5.5 via Anthropic Agent SDK lane · stacks on any base plan · live 2026-06-15
$99 / mo
Consortium Pro
Unlocks consortium-routed open-weights at 25-50% discount (community-hosted GPU pool). Full Pro tier underneath. Automatic fall-back to LiteLLM if no consortium box is healthy. Annual $990/yr (2 mo free).
Production — small-team & up
$649 / mo
Fleet · small-team production · priority queue · 5 project keys · max_parallel=20 / rpm=120 / tpm=200k · all open-weights + BYO included · annual $6,490/yr
$1,199 / mo
Fleet Plus · Fleet + 60M frontier credits/mo (Sonnet/GPT-5.5/Gemini Pro) · max_parallel=40 / rpm=240 / tpm=400k · 10 project keys · cache-priority · 24h-priority support · annual $11,990/yr
$129 one-time
Burst Day · 24 hours at Fleet-tier limits · 8M tokens any model (frontier included) · stacks with any subscription · no auto-renew
Or pay as you go — credit packs
Credits never expire. Burn at per-1M rates: Starter $0.60/$1.20 · Pro $2.00/$5.00 · Elite $4.00/$9.00 (input/output).
$20
~22M tokens on Starter · ~5M on Pro
$50
~55M on Starter · ~14M on Pro · ~7M on Elite
$100
~111M on Starter · ~28M on Pro · ~15M on Elite
$250
~278M on Starter · ~71M on Pro · ~38M on Elite
$500
~556M on Starter · ~143M on Pro · ~77M on Elite
$1000
~1.1B on Starter · ~286M on Pro · ~154M on Elite
BTC is instant and self-serve. XMR / LTC trigger an operator DM — typical
turnaround 1–4 hours. Subscriptions renew monthly (cancel anytime). Credits never expire.
Read the FAQ .