llmdeal.me — OpenAI-compatible LLM gateway. US & EU GPUs, no KYC.

Six reasons this costs less than your current provider.

No magic, no VC hand-waving. Just boring infrastructure decisions that add up.

01 Smart routing

An open-source classifier scores each request by difficulty and routes it to the cheapest model that can handle it. Thresholds are tunable per workload.

02 Our own model

Qwen2.5-Coder-32B runs on our own GPU hardware. Flat hardware costs, no per-token margin to a cloud provider — so we undercut every commodity Llama-70B reseller.

03 `llama-3.3-70b-self-hosted` — LIVE on our GPU

Now live: llama-3.3-70b-self-hosted running on our own GPU — no upstream fees, same API key as the rest of the gateway. Choose EU-resident routing and it stays EEA end-to-end. In the Pro routing mix today.

04 US & EU infrastructure, customers anywhere

We run GPU capacity in both the US and the EU. Open to developers worldwide. Want your inference in the EEA? Choose EU-resident routing and GDPR jurisdiction applies end-to-end. Prefer lower-latency US capacity? That's available too.

05 Crypto, no KYC

Pay in BTC, XMR, or LTC. No identity check, no card decline, no chargeback exposure. We don't know who you are and don't need to.

06 Minimal data retention

We keep only what billing requires: contact handle, order ID, dollar amount, token counts. We do not store prompt content, response content, or IPs past settlement. GDPR-compliant by default — same rules apply worldwide, not just for EU users.

Preorder credits — +30% bonus, expires 2026-05-20.

Pay in crypto now. Lock in 30% extra credits, get a beta key the day the gateway opens. No KYC. No subscription. Credits don't expire.

After Mon 18 May 2026, these prices are gone as a standing offer — the same model mix will only be available through consortium-tier deals with added profit margin. The honest version: if preorder volume doesn't fund the GPU, the window closes and that's it.

$20 pack

Low-commitment entry

$20 → $26

~13M tokens on Starter · ~5M on Pro

+30% bonus credited immediately
BTC self-serve · XMR + LTC operator-confirmed
Credits never expire
Beta key DMed on gateway open

Preorder $20 in BTC

$100 pack

Best value preorder

$100 → $130

~65M tokens on Starter · ~25M on Pro

+30% bonus credited immediately
Locks in v0 pricing on every credit in this pack
Priority onboarding on gateway open
Full Starter + Pro tier access
BTC refund if launch slips past 2026-06-01

Preorder $100 in BTC

$250 pack

Direct supporter

$250 → $325

~162M tokens on Starter · ~65M on Pro

+30% bonus credited immediately
Founder-batch beta key — first 10 onboarded
Elite EU-resident routing active at launch (EEA-pinned by default)
Direct DM line to the operator (Telegram / Signal)
BTC refund if launch slips past 2026-06-01

Preorder $250 in BTC

$50 pack on the buy page. Need a custom amount? DM me.

Founder & Charter tiers — limited slots

Larger commit, larger perks. Each tier directly funds the next GPU node.

Founder Member $500 → $700

25 slots · +40% bonus · funds US low-model GPU

Permanent Founder badge on the customer portal
Direct DM to the operator (Telegram / Signal)
1-hour router-threshold tuning call
Early access to Llama-3.3-70B EU at launch
Launch pricing locked forever ($0.80/$1.50 Starter never increases)
Name credit in router_logic.py when it open-sources

Preorder Founder ($500)

25 slots remaining

Charter Patron $1000 → $1500

10 slots · +50% bonus · funds 1 month of full expansion

Everything in Founder Member
Smart-route classifier thresholds tuned to your workload
GitHub "Charter patron" attribution when llmdeal/router opens
Direct input on the Q3 roadmap (Llama-405B? VL models? Local TTS?)
Quarterly call with the operator, first year

Preorder Charter ($1000)

10 slots remaining

FAQ

Straight answers. More added as beta progresses.

What does the preorder actually get me?

The gateway isn't fully live yet — GPU nodes are being provisioned. Preorders fund the hardware deposit and signal real demand. In return: +30% bonus credits baked into every preorder pack, plus priority onboarding when the gateway opens.

Pay $X in BTC today, get $X × 1.30 in llmdeal credits at launch. Credits never expire. No subscription follows.

When does the gateway launch?

Target: Monday 18 May 2026, GMT+2 (Central European Summer Time). GPU is being provisioned this week; smart routing and gateway go live on that date.

If launch slips past 2026-06-01, every preorder is refundable in BTC to the address you paid from. No questions — DM me on Telegram / Signal.

Status updates go to your DMs (whatever handle you give at checkout) — at least one before launch day.

What if my credits never arrive?

Full refund. BTC value back to your payment address, on request, any time before launch.

After launch, credits stay refundable too — gated by recorded usage. See the money-back guarantee below.

Do these prices hold after launch on Mon 18 May 2026?

The tier prices on /pricing.html are the preorder window. After public launch on 18 May, the same model mix will likely only be reachable through consortium-tier deals with added profit margin — not as standing retail pricing on this page.

The honest version: if significant preorder volume doesn't arrive, the GPU doesn't get funded, the gateway doesn't open at these prices, and the window closes. No preorders → no llmdeal at this price level → you're welcome to keep paying frontier providers what they ask, where the money really, really matters. That's the deal.

Money-back guarantee: how does the 3-hour usage gate work?

Full refund available on every order ever placed, gated only by cumulative token-usage time recorded on your account. Under 3 hours total of recorded usage across all orders? Refundable on request — DM the founder. Past 3 hours, the service is considered consumed.

Refunds are per-second prorated against recorded usage, minus the on-chain crypto network fee. For BTC refunds you cover the network fee in fiat upfront — we don't deduct it from the refunded principal; the BTC value you paid comes back to your payment address. XMR + LTC refunds net the fee on-chain.

This isn't a 14-day trial gimmick — the clock runs on actual usage, not calendar time. Preorder, hold credits, never call the API? You can refund a year later.

Can I independently verify the cost-effectiveness claims?

Yes — check before you preorder, not after. Every model in our Pro mix has a public per-token price and an independent benchmark score on third-party surfaces with no financial stake in llmdeal.me:

ATLAS Benchmark — adaptive cognitive testing across 6 engines (Pattern Recognition, RL, Language Learning, Working Memory, Concept Drift, Long-Horizon).
Artificial Analysis — per-token cost, speed, and quality leaderboards across every major provider.
LMSYS Chatbot Arena — blind human pairwise rankings (Elo).
OpenRouter public rankings — real-world usage volume and cost per provider.

If our Pro mix isn't on those leaderboards at the prices we quote, the refund guarantee applies — see above.

When does the A6000 EU node deploy after the $1k threshold?

Within 7 days of the public counter crossing $1k. The marketing milestone flips the moment it's crossed; the A6000 spins up shortly after.

Honest caveat: we apply a small internal safety buffer — slightly above the public threshold plus a check that real customers, not a single whale, drove the number — before wiring up the A6000. That prevents one $500 Founder order followed by a refund from triggering a month of GPU rental we can't sustain. The public milestone still flips green the moment the public threshold is met.

Is llmdeal.me just an OpenAI wrapper?

No. We run our own model (Qwen2.5-Coder-32B) on our own GPU. We smart-route to Groq, Together, Cerebras, and DeepSeek when a query needs more horsepower than our model can give.

The goal isn't to replace frontier models. It's to not pay frontier prices for queries that don't need them.

How does smart routing decide which model runs my query?

Every request hits a small open-source classifier (RouteLLM-style) that scores difficulty. Easy queries (formatting, simple regex, syntax fixes) → our Qwen-Coder-32B on our own GPU. Fast workhorse queries → llama-3.3-70b-self-hosted on our own GPU. Reasoning queries → DeepSeek-V3. Coding-heavy queries → Mistral Codestral. Highest-difficulty queries → Qwen3-235B on Cerebras (frontier OSS-class).

Override per-request by passing any specific model name (e.g. model: "deepseek-chat"). The router only activates when you set model: "smart-route".

What data do you retain about my prompts?

Short version: contact handle and order record are stored. Prompts and responses are not retained. Full breakdown in the Privacy & data section above.

Our own models run on our GPU hardware — GDPR compliance applies. We don't log prompt content. We keep token counts for billing only.

When queries route to open-weight providers (Groq, Cerebras, Together, DeepSeek's own API), those requests go to them under their respective data policies — the same as if you called them directly. None of these providers train on API traffic by policy.

The Elite tier uses EU-resident routing by default — when that's active, requests stay within EEA jurisdiction end-to-end. US capacity is switchable on request.

Does llmdeal.me work outside the EU?

Yes. We serve developers worldwide — no geographic restrictions on signup. We run GPU capacity in both the US and the EU. "EU-resident inference" is a privacy option (the default on Elite), not a limit on who can sign up or where compute runs.

We apply GDPR-level handling to every customer regardless of location. Varying it per-jurisdiction is operational overhead we don't want.

Why is crypto the only payment method?

Three reasons. One: credit cards require KYC and we won't ask for it. Two: chargebacks on usage-based products are a nightmare. Three: devs paying for an API shouldn't need to identify themselves.

We accept BTC (auto-checkout), XMR and LTC (semi-manual — DM us, we send a one-time address, you pay, we credit your account within 1-4 hours).

Is the gateway code open source?

The router classifier and gateway code will be open-sourced once the gateway is stable post-launch (target Mon 18 May 2026). The marketing site and inference stack are private.

The underlying model (Qwen2.5-Coder-32B) is Apache 2.0 — Alibaba's open release. We didn't train it; we serve it.

Who runs llmdeal.me?

An EEA-based independent operator on owned bare-metal infrastructure. No VC, no team, no roadmap deck. Pricing is honest because the cost structure is honest.

Launch date — final answer?

Public launch Monday 18 May 2026 (GMT+2). Preorders are open now — every BTC backer locks in +30% bonus credits and a beta key delivered on launch day.

One API key. Smart routing. Pay 55-85% less than your current LLM bill.

Most of your LLM spend is mis-routed to models that are overkill.

✗ Today (single-model)

✓ With llmdeal.me

Six reasons this costs less than your current provider.

01 Smart routing

02 Our own model

03 `llama-3.3-70b-self-hosted` — LIVE on our GPU

04 US & EU infrastructure, customers anywhere

05 Crypto, no KYC

06 Minimal data retention

Token pricing — preview

Starter

Pro

Elite

Preorder credits — +30% bonus, expires 2026-05-20.

$20 pack

$100 pack

$250 pack

Founder & Charter tiers — limited slots

Founder Member $500 → $700

Charter Patron $1000 → $1500

Stretch goals — what your preorder funds.

Talk to the founder directly.

Not ready to preorder? Get one ping at launch.

Launch notification (free)

Privacy & data — exactly what we keep, exactly what we don't.

✓ What we store

✗ What we don't store

FAQ

Backed by

Most of your LLM spend is mis-routed to models that are overkill.

✗ Today (single-model)

✓ With llmdeal.me

Six reasons this costs less than your current provider.

01 Smart routing

02 Our own model

03 llama-3.3-70b-self-hosted — LIVE on our GPU

04 US & EU infrastructure, customers anywhere

05 Crypto, no KYC

06 Minimal data retention

Token pricing — preview

Starter

Pro

Elite

Preorder credits — +30% bonus, expires 2026-05-20.

$20 pack

$100 pack

$250 pack

Founder & Charter tiers — limited slots

Founder Member $500 → $700

Charter Patron $1000 → $1500

Stretch goals — what your preorder funds.

Talk to the founder directly.

Not ready to preorder? Get one ping at launch.

Launch notification (free)

Privacy & data — exactly what we keep, exactly what we don't.

✓ What we store

✗ What we don't store

FAQ

Backed by

03 `llama-3.3-70b-self-hosted` — LIVE on our GPU