Text-to-Speech Pricing Comparison

Compare pricing and features across different text-to-speech providers. Find the best service for your needs based on price, free tier allowances, and capabilities.

Free AllowanceBasic PlanDetails
OpenAI
GPT-4o-mini-tts$15.00
None
N/A
Azure
Standard$15.00
500K chars
1M chars
$15
Google
WaveNet$16.00
1M chars
1M chars
$16
Google
Studio$160.00
1M chars
1M chars
$160
Amazon
Polly$4.00
1M chars
1M chars
$4
ElevenLabs
V2 Flash/Turbo English$100.00
10K chars
30K chars
$5
PlayAI
3.0 mini$150.00
30 mins
50 mins
$9
smallest.ai
Waves$40.00
30 mins
3 hrs
$5
VoiceGen.org
Standard$40.00
20K chars
80K chars
$10
Hume.ai
Standard$100.00
10K chars
30K chars
$3
Murf.ai
Standard$100.00
100K chars
10K chars
$1
Descript
Standard$80.00
5 mins
30 mins
$24
Lovo.ai
Standard$50.00
None
2 hrs
$29
Resemble.ai
Standard$100.00
None
4K seconds
$5
Play.ht
Standard$100.00
None
25K chars
$5
Cartesia
Sonic$46.00
10K credits
100K chars
$5

Understanding TTS Pricing Models

Text-to-speech (TTS) services typically use one of several pricing models. Most providers charge per million characters processed, while others may offer plans based on minutes of audio generated or a credit system.

When comparing providers, consider not just the base rate, but also factors like voice quality, language support, and available features such as SSML or voice customization.

Key Factors to Consider

  • Base pricing: Cost per million characters processed
  • Free tier: Whether a provider offers free usage up to a certain limit
  • Quality levels: Many providers offer different quality tiers at different price points
  • Voice selection: Number and variety of available voices
  • Language support: Which languages are supported and at what price
  • Additional features: SSML support, emotion control, pausing, etc.
  • Usage restrictions: Any limitations on how generated audio can be used

Frequently Asked Questions

What does "price per million characters" mean?

This is the standard pricing unit for most TTS services. The price listed is what you'll pay to process1,000,000 characters through the service to generate speech.

How many characters are in a minute of speech?

On average, about 1,000 characters (roughly 150-160 words) convert to one minute of speech, though this can vary based on speaking rate, language, and voice model.

Are there additional costs besides the per-character rate?

Some providers charge for additional features like voice customization, storage of generated audio, or premium voices. Check the provider's full pricing page for details.

Do these prices include taxes?

Prices listed are generally before any applicable taxes. Depending on your location and the provider'spolicies, additional taxes or fees may apply.

Pricing data last updated: April 2025. Actual pricing may have changed. Please verify with providers before making decisions.