HYPERNYM×Freezone
routerislandssanctumtundraLIVE
MODELS
One OpenAI-compatible endpoint. Pass
model in the request body.4
Hypernym-enhanced
1
Self-hosted
2
Passthrough
NOMINAL
Router state
enhancedfastest
hypernym/llama-3.1-8b-compressed
Llama 3.1 8B · Compressed
semantic-compressionp-span-shearcost-optimized
Llama-3.1-8B routed through Hypernym semantic compression. Pre-summarises long contexts at the shear boundary so you pay for fewer tokens with the same downstream answer.
enhancedflagship
hypernym/llama-3.1-70b-academy
Llama 3.1 70B · Academy
affinity-routingisland-cascadeauto-failover
Llama-3.1-70B served via the Tolarian Academy affinity router. Concurrent=80, automatic burst-island scaling under load.
hosted
hypernym/glm-4.7-cerebras
GLM 4.7 · Cerebras
cerebras-directsub-second-ttft
Wafer-scale inference on Cerebras with concurrent=1000. Tuned for TTFT under 200ms.
passthrough
openai/gpt-4o
GPT-4o
OpenAI passthrough. Same upstream price; Freezone adds usage and routing only.
passthrough
anthropic/claude-sonnet-4-6
Claude Sonnet 4.6
Anthropic passthrough. Useful when you want Sonnet but want one bill across providers.
enhanced
hypernym/bge-m3-sanctum
BGE-M3 · Sanctum
cache-hit-90plain-failover
Embeddings via Serra's Sanctum — 304K-row text-cache short-circuits before the GPU on hits. 1024 dimensions.
enhancedbeta
hypernym/moxruby-lift
MoxRuby · Lift
multi-axis-extractionmountain-routed
Hypernym-native "lift" primitive — extracts content along configurable axes via the splash-mountain backend.