All EU. All open-source. All private.

One API. Every model. Guaranteed European.

40+ open-source models, every request processed in EU data centres, zero data retention. Drop-in OpenAI replacement.

Get Started View Docs
# Drop-in replacement. Change two lines.

from openai import OpenAI

client = OpenAI(
  base_url="https://router.vokt.ai/v1",
  api_key="sk-vokt-your-key"
)

response = client.chat.completions.create(
  model="qwen3-235b",
  messages=[{"role": "user", "content": "Hello"}]
)

# Works with any OpenAI SDK. Python, Node, Go, Rust.
# Every request stays in EU. Nothing stored.

Three steps. Five minutes.

Get your key

Top up credits and receive your API key instantly. No lengthy onboarding, no sales calls, no enterprise minimum.

Call the API

Same OpenAI format you already use. Change the base URL and key. Your existing code works without modification.

Pay for what you use

Pay per token. No monthly minimum, no per-seat charges, no surprises.

29+ language models ยท 40 models total

Every model runs in Europe.

Hosted across Finland, France, Sweden. No US routing, no China routing. Prices shown in USD per 1M tokens โ€” Router automatically picks the cheapest EU host for each call.

ModelRegionInput / 1MOutput / 1MBest for
mistral-nemo๐Ÿ‡ซ๐Ÿ‡ท$0.02$0.02Long-context Mistral
gpt-oss-20b๐Ÿ‡ซ๐Ÿ‡ฎ ๐Ÿ‡ซ๐Ÿ‡ท$0.05$0.17Cheapest. Background tasks.
nemotron-nano-30b๐Ÿ‡ซ๐Ÿ‡ฎ$0.07$0.28Fast routing, extraction
qwen3-coder-30b๐Ÿ‡ซ๐Ÿ‡ท$0.07$0.25Code generation
gpt-oss-120b๐Ÿ‡ซ๐Ÿ‡ฎ ๐Ÿ‡ซ๐Ÿ‡ท$0.09$0.46General purpose, tool use
qwen3-32b๐Ÿ‡ซ๐Ÿ‡ฎ ๐Ÿ‡ซ๐Ÿ‡ท$0.09$0.26General purpose
mistral-small-3.2๐Ÿ‡ซ๐Ÿ‡ท$0.10$0.32Vision + text
mistral-7b๐Ÿ‡ซ๐Ÿ‡ท$0.11$0.11Compact, fast
qwen3-30b๐Ÿ‡ซ๐Ÿ‡ฎ$0.11$0.34Chat, reasoning
llama-3.3-70b๐Ÿ‡ซ๐Ÿ‡ฎ ๐Ÿ‡ซ๐Ÿ‡ท ๐Ÿ‡ธ๐Ÿ‡ช$0.14$0.44Reliable all-rounder
hermes-4-70b๐Ÿ‡ซ๐Ÿ‡ฎ$0.15$0.46Memory, learning agents
qwen3-next-80b-thinking๐Ÿ‡ซ๐Ÿ‡ฎ$0.17$1.38Deep reasoning, thinking
intellect-3๐Ÿ‡ซ๐Ÿ‡ฎ$0.23$1.26Open reasoning
llama-3.1-8b๐Ÿ‡ซ๐Ÿ‡ท$0.23$0.23Edge, low-latency
mistral-small๐Ÿ‡ซ๐Ÿ‡ท$0.23$0.69Balanced reasoning
qwen3-235b๐Ÿ‡ซ๐Ÿ‡ฎ ๐Ÿ‡ซ๐Ÿ‡ท$0.23$0.69Complex reasoning, long context
gemma-3-27b๐Ÿ‡ซ๐Ÿ‡ท$0.29$0.57Compact Google open model
minimax-m2.5๐Ÿ‡ธ๐Ÿ‡ช$0.32$1.26MoE, 200k context
deepseek-v3.2๐Ÿ‡ซ๐Ÿ‡ฎ$0.34$0.52Efficient reasoning + code
holo2-30b๐Ÿ‡ซ๐Ÿ‡ท$0.34$0.80Conversational AI
devstral-2-123b๐Ÿ‡ซ๐Ÿ‡ท$0.46$2.30Large code model
kimi-k2๐Ÿ‡ซ๐Ÿ‡ฎ$0.57$2.76Agentic, tool orchestration
kimi-k2.5๐Ÿ‡ธ๐Ÿ‡ช$0.57$2.76Agentic, 256k context
glm-4.5๐Ÿ‡ซ๐Ÿ‡ฎ$0.69$2.53Bilingual Chinese-English
qwen3.5-397b๐Ÿ‡ซ๐Ÿ‡ท$0.69$4.14Frontier multimodal MoE
deepseek-r1-distill-70b๐Ÿ‡ซ๐Ÿ‡ท$1.03$1.03Reasoning distill
hermes-4-405b๐Ÿ‡ซ๐Ÿ‡ฎ$1.15$3.45Maximum capability
glm-5.1๐Ÿ‡ธ๐Ÿ‡ช$1.61$5.06Long-context multilingual
mistral-large๐Ÿ‡ซ๐Ÿ‡ท$2.30$6.90Frontier Mistral

Plus 5 embedding models, 2 vision models, 1 speech-to-text models, 2 image generators, and 1 safety classifier(s). Full list via the /v1/models endpoint.

Your data never crosses an ocean.

EU data residency

Every request processed in Finland, France, Sweden. No US routing. No China routing. Architecturally guaranteed.

Zero data retention

Inputs and outputs are never stored, logged for training, or routed to non-EU systems. Vokt holds the upstream contracts so you only sign one DPA โ€” with us.

GDPR native

Built for European regulation, not retrofitted. SOC 2, HIPAA, ISO 27001 certified infrastructure. Custom DPAs available.

Automatic failover

Provider down? Transparent switch to the next EU provider. Same model, different infrastructure. No downtime for your application.

Cost-based routing

When the same model is available on multiple providers, we route to the cheapest one. You get the best EU price automatically.

OpenAI compatible

Same API format. Change two lines in your code. Works with every OpenAI SDK, LangChain, LlamaIndex, and any tool that supports custom endpoints.

EU data residency Zero data retention SOC 2 Type II HIPAA ISO 27001 GDPR
Simple pricing

Top up. Start building.

No monthly minimum. No per-seat fees. No hidden costs. Buy credits, use any model, pay per token.

$10
$10 credits
Pay per token
$50
$50 credits
Pay per token
$100
$100 credits
Pay per token

European AI inference. No compromises.

One API key. 40+ models. Every request stays in Europe. Start in five minutes.

Get Started