Deepgram · Audio model

Deepgram Nova-3 / Aura-2

Production speech APIs, billed by the second.

One wallet. Every model.

per minute
Example
AI audio
Commercial use included Verified May 26, 2026 Outputs are yours No training on your data
Variants

Deepgram Nova-3 / Aura-2, every version.

One model. Every resolution. One token balance.

Deepgram Nova-3 (STT)

Audio generation

One wallet. One bill.

$0.01 / minute

Deepgram Aura-2 (TTS)

Audio generation

One wallet. One bill.

$0.05 / 1k chars

Capabilities

What it does best.

Deepgram capabilities

Nova-3. Speech-to-Text

  • Handles 45+ languages across monolingual and multilingual variants[^1]
  • Multilingual variant detects language automatically within a single audio stream[^1]
  • Streaming and pre-recorded transcription share the same per-minute rate. No real-time premium[^2]
  • True per-second billing: a 14-second file is billed for exactly 14 seconds, not rounded up to 15 or 60[^2]
  • Speaker Diarization add-on identifies and labels individual speakers in multi-speaker recordings[^2]
  • Smart Formatting included at no extra charge: adds punctuation, correct casing, dates, and currency[^2]
  • Keyterm Prompting add-on boosts recognition accuracy for domain-specific vocabulary[^2]
  • PII Redaction add-on removes SSNs, credit card numbers, and phone numbers from transcripts automatically[^2]
  • Optimized for noisy environments, crosstalk, and far-field audio[^1]

Aura-2. Text-to-Speech

  • Supports 7 languages: English, Spanish, German, French, Dutch, Italian, and Japanese[^3]
  • 40+ named English voices spanning accents, age profiles, and use-case profiles including IVR, customer service, storytelling, and advertising[^3]
  • Select Spanish voices support codeswitching, alternating between English and Spanish mid-conversation[^3]
  • Low-latency synthesis designed for interactive voice response systems and conversational AI[^3]

Platform limits

  • Pre-recorded audio files: up to 2 GB per request[^4]
  • Processing timeout: 10 minutes per request[^4]
  • Concurrency: up to 100 concurrent REST requests and 150 concurrent WebSocket streams[^4]
How it works

Generate Deepgram Nova-3 / Aura-2 in three steps.

No code. No API. Describe what you want. Get ad-ready output.

01

Describe your ad

Paste a product link or a short brief. Coinis pulls your brand.

02

Pick Deepgram Nova-3 / Aura-2

Choose the model for the look you want. Switch anytime.

03

Get creatives

Generate. Launch to Meta. Track results in one place.

One wallet. Every model. No accounts to juggle.

Pricing

Token pricing. No surprises.

One wallet across every model. No API accounts to juggle.

Deepgram Nova-3 / Aura-2 · Deepgram Nova-3 (STT)
0.1 tokens
per minute · $0.01
Speech to text
$0.01 / minute
One wallet. Every model. One invoice. 1 token = $0.10
1 minute ≈ 0 tokens ($0.01)
Start free. 25 tokens a week.

No credit card.

Why pay through Coinis
  • One wallet for every model. No API keys. No separate bills.
  • Generate ads. Launch to Meta. Track in one place.
  • On-brand output from your Brand Profile.

1 token = $0.10 pay-as-you-go. Less on a plan.

Standard vs Fast

Pick the run for the job.

Deepgram Nova-3 (STT)

Final renders, studios
Resolution
Price $0.01 / minute

Deepgram Aura-2 (TTS)

Rapid tests, high volume
Resolution
Price $0.05 / 1k chars
Use cases

Two buyers. One model.

For teams

Every model. One wallet. One bill.

Video, image, audio, and LLM in one place.

Variants in bulk.

Same brief. Many cuts. No accounts to juggle.

Own every output.

Download it. Run it in any campaign.

For creatives

Ship a Reel before lunch.

Prompt to platform-native clip in minutes.

Same product. Ten formats.

One generation, every aspect ratio.

Commercial UGC without a creator.

Authentic selfie-style ads, on brand.

Real-time voice agents and conversational AI Nova-3 streaming powers live transcription for voice bots and virtual assistants.[^1] Aura-2 handles synthesis in the same pipeline, so you can build both transcription and speech generation into a single voice agent flow.[^3]

Call center transcription with diarization Nova-3's Speaker Diarization add-on labels each speaker in a recorded customer call.[^2] Pair it with Smart Formatting to get clean, readable transcripts ready for QA review or CRM logging.

Compliance and PII-sensitive transcription Nova-3's PII Redaction add-on strips SSNs, credit card numbers, and phone numbers from transcripts automatically.[^2] Regulated industries get a clean output without building a separate scrubbing layer.

Multilingual voice products Nova-3 Multilingual detects languages automatically in a single audio stream.[^1] Aura-2 adds TTS output across 7 languages for end-to-end voice interfaces in globally deployed applications.[^3]

IVR and voice-assistant front-ends Aura-2's 40+ English voices are tuned for interactive voice response systems and conversational AI front-ends.[^3] Pick a voice profile that matches your brand tone and deploy it at scale with per-character billing.

Renders in seconds. Set a seed. Get the same frame back.

Outputs are yours. Sell them.

Safe for paid ads.

Your prompts are never used for training.

FAQ

Deepgram Nova-3 / Aura-2 FAQs

How much does Deepgram Nova-3 cost per minute on Coinis?

Deepgram Nova-3 (STT) is priced at $0.013 per minute on Coinis. Deepgram's direct Pay-As-You-Go rate for Nova-3 pre-recorded transcription is $0.0077 per minute. The Coinis retail price includes platform access, unified billing, and a single API key across all catalog models.

What is the difference between Deepgram Nova-3 and Aura-2?

Nova-3 is a speech-to-text model. It transcribes audio into text, supporting 45+ languages, noisy audio, and multi-speaker recordings. Aura-2 is a text-to-speech model. It converts written text into spoken audio across 7 languages with 40+ named English voices.

Does Deepgram support real-time streaming transcription, and is it priced differently?

Yes, Nova-3 supports both real-time WebSocket streaming and pre-recorded REST transcription. On Coinis, both modes are billed at the same $0.013 per minute rate. There is no separate premium for the low-latency streaming path.

Which languages do Nova-3 and Aura-2 support?

Nova-3 supports 45+ languages across its monolingual variants. The Multilingual variant detects multiple languages automatically within a single audio stream. Aura-2 TTS supports 7 languages: English (with American, British, Australian, Irish, and Filipino accents), Spanish (with multiple regional accents), German, French, Dutch, Italian, and Japanese.

How much does Deepgram Nova-3 / Aura-2 cost on Coinis?

On Coinis, Deepgram Nova-3 / Aura-2 is pay-as-you-go from one shared token wallet. Buy tokens once. Spend them on any model. No separate accounts. No monthly commit.

What are the file size and concurrency limits for Nova-3?

Pre-recorded audio files are capped at 2 GB per request, and each request times out after 10 minutes of processing. Pay-As-You-Go accounts support up to 100 concurrent REST requests and 150 concurrent WebSocket streams. For multichannel audio, billing is based on total processed duration: a 10-minute stereo file is billed as 20 minutes.

How does Deepgram Aura-2 compare to ElevenLabs for text-to-speech?

Aura-2 on Coinis is billed at $0.051 per 1,000 characters. It covers 7 languages and 40+ named English voices, with select Spanish voices supporting codeswitching. ElevenLabs is available as a separate model on Coinis. Compare both on the Coinis audio catalog to find the voice and price that fits your workflow.

Can I use Deepgram for HIPAA-compliant or PII-redacted transcription?

Nova-3's PII Redaction add-on automatically removes sensitive identifiers such as SSNs, credit card numbers, and phone numbers from transcripts. HIPAA compliance with a signed BAA is available only on Deepgram's Enterprise plan. If HIPAA is a hard requirement, contact the Coinis team to discuss Enterprise routing options.

Start free

Your wallet. Every model. One place.

Start free. 25 tokens a week. No card.

Generate on Coinis

No credit card.

Pricing and capabilities verified 2026-05-26. Read the docs .