Model Gateway — 30+ Voice Models, One API

Pricing

Docs

Model gateway

One API for every voice model

One API. 30+ models. Keep your orchestrator, your keys, and your contracts. US$ 0.0033 per agent minute.

Runs with your stack

Workflow

Discover. Test. Deploy.

Browse, benchmark, and ship voice models from one API. Same request format. Same response format. Regardless of provider.

Discover

Browse 30+ models by use case, region, provider, or language. Open-source and proprietary. All normalized under one API.

Test

Send sample inputs, compare quality and latency, and review structured outputs with the same schema across every model.

Deploy

Call /stt, /tts, or your LLM endpoint directly. The Context Router handles region selection, provider routing, and compliance enforcement per request.

Deployment

Run models your way

Your orchestrator. Your keys. Three ways to run.

SLNG Hosted

We run the model on sovereign GPUs in your region. You call the endpoint. No provider contract needed.

SLNG Proxied

Keep your provider. We route to them with automatic failover, compliance enforcement, and observability added.

BYOK

Your API keys. Your provider contracts. Cost goes down. Reliability goes up. Nothing else changes.

Integration

Plug in the endpoints. Nothing else changes

In-region compute

Select your region per request

11 sovereign hubs. No redeployment, no config changes.

-H "X-Region-Override: eu-west-1"

One header controls where your audio and model calls are processed.

Full-stack agent execution

30+ STT and TTS models. Add models without redeploying. Switch providers without changing code.

Deepgram Nova 3

STT

Low-latency transcription for live voice agents, captions, and interactive systems. 30+ languages.

Soniox STT AI

STT

Transcribes and translates 60+ languages from recorded files or live audio streams.

Reson8 Resonant 1

STT

Adapts to domain-specific vocabulary in real time. No fine-tuning required. Built for European languages.

Rime Arcana 3

TTS

Provides industry leading latency, enabling natural back-and-forth interactions without awkward pauses.

Cartesia Sonic 3

TTS

Streaming speech with voice cloning. Full control over pronunciation and accent.

Murf AI Falcon

TTS

Built for real-time voice. Low-latency streaming with consistent pronunciation across 20+ languages.

Soniox TTS RT

TTS

Handles the edge cases that break most production speech systems. Built-in safeguards against hallucination across 60+ languages.

Deepgram Aura 2

TTS

Real-time text-to-speech model built for conversational AI. Generates natural, human-like speech with low latency.

What runs on every call

Every request goes through the execution layer before it hits a model.

PII redaction

Stripped before audio reaches your models or logs.

Automatic failover

The Execution layer reroutes before your end user notices.

TTS smart caching

Repeated TTS responses served from cache. Provider never gets called.

Context routing

Only complex turns hit your LLM. Fewer model calls per conversation.

Analytics

Cost and latency tracked against baseline. Dashboard shows the delta.

Transcripts

Region, provider, model, cost, latency logged per step, per call.

Compliance and security

HIPAA

ISO 27001:2022

GDPR

SOC 2 Type II

Pricing

Under 1¢ per agent minute

US$ 0.0099 full stack. Or pick your components.

STT

US$ 0.0033 / agent minute · All models included · Or BYOK and pay your provider directly

Execution layer

US$ 0.0033 / agent minute · LLM routing · Smart caching · PII redaction · Analytics

TTS

US$ 0.0033 / agent minute · All models included · Or BYOK and pay your provider directly

Do I need to change my orchestrator?

No. The Model Gateway works with any orchestrator that can call an HTTP endpoint. LiveKit, Pipecat, Twilio, Telnyx, Daily, or custom. Replace three provider URLs with SLNG endpoints and add one header. Your orchestrator, your business logic, and your provider contracts stay exactly the same.

How much does the Model gateway cost?

US$ 0.0033 per agent minute per component. Execution layer, STT, and TTS are each US$ 0.0033. Full stack is US$ 0.0099 per agent minute. Agent minute means the call length, not audio processed. No contracts, no minimums.

Does my voice AI data stay in-region?

Yes. Audio processed in a region stays in that region. Data residency is enforced at the routing layer, not promised in documentation. Every call is logged with region, provider, model, and timestamp. SLNG is ISO 27001 certified, HIPAA compliant, and GDPR compliant.

Is SLNG compliant with GDPR?

GDPR compliant. ISO 27001 certified. HIPAA compliant. SOC 2 Type II in audit. Details at trust.slng.ai.

How do I select a region?

One header: X-Region: ap-south-1. Add it to any API call. Same endpoint, same models, different physical location. No DNS changes, no separate deployments.

What does "per agent minute" mean?

Agent minute is the length of the call, not the amount of audio processed. A 3-minute call is 3 agent minutes regardless of how much silence or overlap it contains. SLNG bills per agent minute. Model providers typically bill per audio minute. SLNG simplifies this.

How do I integrate SLNG with my voice AI stack?

Swap your STT, LLM, and TTS endpoints to SLNG. Add one header (X-Region) to select your region. No changes to your orchestrator, your models, or your agent logic. Works with LiveKit, Pipecat, Cognigy, or any custom stack.

Which models are supported?

30+ models across STT, TTS, and LLM. Providers include Deepgram Nova 3, ElevenLabs, Cartesia Sonic, Soniox, Rime Arcana, Sarvam, and Whisper. Models are available as SLNG Hosted, SLNG Proxied, or BYOK. Full catalog at docs.slng.ai/models.

Stay in the loop

Unmuted.

Product

Regions

Solutions

Resources

Company

Contact

Pricing

Pricing