Voice Design (MiniMax)
Endpoints
Voice Design (MiniMax)
Generate a synthesized voice profile from a natural-language description. No reference recording needed. Returns a voice_id usable in /v1/audio/speech.
POST
Voice Design (MiniMax)
Synchronous endpoint. Describe a voice in plain English, get back a
voice_id you can immediately use in /v1/audio/speech on any MiniMax voice model.
Use this when you don’t have voice talent, you’re prototyping a fictional character, or you want a brand-safe persona voice from scratch.
Request
application/json body.
Natural-language voice description. Max 1000 characters. Also accepts the alias
text.Voice design SKU. Currently only
minimax-voice-design is supported.Optional human-readable label, max 64 chars.
Optional sample text MiniMax will render in the new voice for an internal preview. Max 500 characters. Doesn’t appear in the response — the audio bytes are not returned (you’d call
/v1/audio/speech afterward to render).Optional hint:
male or female.Optional hint:
child, young, middle-aged, or elderly.Response
200 OK JSON. Same shape as /v1/audio/voice-clone.
Pricing
Flat $4.20 per designed voice. One-time charge — once designed, thevoice_id is reusable in unlimited TTS calls.
Voice design costs ~2× voice clone because synthesizing timbre from text is strictly more compute-intensive than reproducing a captured voice.
Ownership
Same gating as voice clone — designed voice IDs are owned by the requesting user. Sharing thevoice_id with another account is rejected with 403 voice_not_owned.
Errors
| Status | error.code | When |
|---|---|---|
400 | not_a_voice_design_model | model is not a design SKU |
400 | description_too_long | description > 1000 chars |
400 | invalid_request | missing description |
402 | insufficient_credits | balance below $4.20 |
500 | ownership_write_failed | design succeeded but ownership row insert failed |
502 | provider_error | upstream MiniMax failure |
See also
POST /v1/audio/voice-clone— clone from reference audio insteadPOST /v1/audio/speech— use the designed voice- Voice Design (model) — service overview