Glossary

GammaInfra glossary — definitions of the concepts and terminology used throughout the documentation. Each entry is a standalone 400-800 word explainer with a direct-answer definition, body sections, and developer-question FAQ.

Terms

Common questions about this glossary

What does this glossary cover?
The vocabulary used throughout GammaInfra's docs — LLM gateway, LLM router, OpenAI-compatible API, task-aware routing, BYOK, fallback chain, hedged requests, cost-quality dial, generative engine optimization, and the Bedrock cross-region inference profile. Each entry is 400–800 words with a direct-answer definition, body sections, and a developer-question FAQ.
How do these terms relate to each other?
The LLM gateway is the system; the LLM router is the decision component inside it; task-aware routing is the most common router strategy; the cost-quality dial is a caller-side preference signal to the router; the fallback chain is the recovery cascade; hedged requests are a parallel-race optimization; BYOK is an alternative billing mode; OpenAI-compatible API is the wire format that gateways implement. The Bedrock cross-region inference profile is an AWS-specific quirk that matters for newer Claude and Nova models.

Last updated 2026-05-15.