Skip to main content

Text-to-Speech (TTS)

POST /v1/audio/speech
Convert text to natural speech, compatible with the OpenAI TTS API format.

Request Parameters

ParameterTypeRequiredDescription
modelstringYesModel name: gpt-4o-mini-tts, tts-1, tts-1-hd
inputstringYesText to convert, max 4096 characters
voicestringYesVoice character
response_formatstringNoOutput format: mp3 (default), opus, aac, flac, wav, pcm
speednumberNoSpeed, 0.25-4.0, default 1.0

Available Voices

VoiceCharacteristics
alloyNeutral, balanced
echoMale, steady
fableMale, warm
onyxMale, deep
novaFemale, lively
shimmerFemale, soft

Request Examples

curl -X POST https://crazyrouter.com/v1/audio/speech \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -d '{
    "model": "gpt-4o-mini-tts",
    "input": "Hello, welcome to the Crazyrouter API. What a beautiful day!",
    "voice": "nova",
    "response_format": "mp3",
    "speed": 1.0
  }' \
  --output speech.mp3
The response is a binary audio stream. Save it directly to a file.

Model Comparison

ModelQualityLatencyDescription
tts-1StandardLowSuitable for real-time scenarios
tts-1-hdHDMediumMore natural speech
gpt-4o-mini-ttsHighestMediumLatest model, supports more languages and emotions
gpt-4o-mini-tts supports multiple languages and automatically detects the input text language to use the corresponding pronunciation.
The response is a binary audio stream, not JSON. Use --output or stream-write to a file.