The router classifies every prompt by task and dispatches to the best-fit model — so you stop paying flagship rates for chat. Every major provider, one bill. 0% token markup. $3 free to start.
# pip install openai client = OpenAI( base_url="https://api.gammainfra.com/v1", api_key="sk-gammainfra-...", ) response = client.chat.completions.create( model="gammainfra/auto", messages=[{ "role": "user", "content": "Write a binary search in Python" }] ) # Response headers tell you exactly what served you: # X-GammaInfra-Endpoint: anthropic/claude-sonnet-4-6 # X-GammaInfra-Cost-USD: 0.000087 # X-GammaInfra-Router-Version: v2
We charge on top-ups, not tokens. See pricing →
Every prompt is classified by task type: code generation, reasoning, summarisation, multimodal, tool use, or chat.
Providers are scored on live p50 latency (30-second refresh, 5-min window), token cost, and per-task quality.
The request routes to the best-fit model. Every decision is logged and transparent.
Latency stats refresh every 30 seconds. Provider health propagates to every worker. The router's picture of the upstream stays current — without touching your prompt content.
Select a task type to see how GammaInfra routes. Chains are prioritised by quality with automatic fallback.
Access every major LLM through a single API. Direct-route to any provider, or let the router decide.
Pay only for what you use. Credits never expire.
gammainfra/auto for smart routing, gammainfra/fast to optimize for latency, or gammainfra/cheap to optimize for cost./v1/chat/completions. Replace https://api.openai.com/v1 with https://api.gammainfra.com/v1 in your existing OpenAI SDK code — every other field (messages, temperature, stream, tools, tool_choice, response_format) works as-is.402 byok_balance_empty — never a silent fallback to managed credits.X-GammaInfra-Cost-USD (total) plus X-GammaInfra-Input-Cost-USD and X-GammaInfra-Output-Cost-USD for the per-direction split. The X-GammaInfra-Endpoint header tells you which provider/model actually served the request. Sum these across a session to know exactly what your workload cost.X-GammaInfra-Fallback-Chain response header. The customer never sees the failure unless every chain entry fails.No credit card. $3 free credit. Verify your email, get your key, start in 60 seconds.
Smart routing built-in · Prompts never logged · Credits never expire · No subscriptions
By signing up you agree to the terms of service. We'll send you a one-time verification link. Signup is rate-limited to prevent abuse.
Already have an account? Sign in · Forgot password?
We sent a verification link to you. Click it to activate your account and get your $3.00 starting credit.
Didn't get it? Check your spam folder, then .