Audio Voices
Endpoints
Audio Voices
Browse the available voice library across both providers. Pick a voice_id for /v1/audio/speech.
GET
Audio Voices
Read-only passthrough of TTS voice catalogs across both supported providers:
The full ElevenLabs catalog is large (~100 KB JSON). For a picker UI, fetch once on mount and cache client-side. The MiniMax catalog is smaller (~300 voices) and trivial to ship inline.
- ElevenLabs — ~3,000 voices (premade + community)
- MiniMax — ~300 system voices spanning Chinese, English, Japanese, and other languages
voice_id to POST /v1/audio/speech.
Voice ↔ model compatibility: ElevenLabsvoice_ids only work witheleven-*SKUs. MiniMaxvoice_ids only work withminimax-speech-*SKUs. The two are not cross-compatible.
Query parameters
| Param | Values | Default | Notes |
|---|---|---|---|
provider | elevenlabs | minimax | all | elevenlabs | Which catalog to return |
Response
200 OK. Each voice record carries a provider field so you can disambiguate when fetching ?provider=all.
Common fields
| Field | Type | Notes |
|---|---|---|
provider | "elevenlabs" | "minimax" | Which provider serves this voice |
voice_id | string | Opaque id — pass to /v1/audio/speech |
name | string | Human-readable name |
ElevenLabs-only fields
| Field | Type | Notes |
|---|---|---|
category | string | premade, cloned, etc. |
labels | object | Filter facets — gender, age, accent, use case |
preview_url | string | Short audio sample of the voice |
description | string | Free-text description |
MiniMax-only fields
| Field | Type | Notes |
|---|---|---|
description | string[] | Tags like ["Energetic", "English", "Female"] |
Cache headers
| Header | What |
|---|---|
X-Kyma-Cache: may-hit | Catalog likely served from Redis (5-min TTL per provider) |
X-Kyma-Cache: partial | ?provider=all — at least one provider returned, possibly cached, possibly fresh |
X-Kyma-Cache: miss | No Redis available; fetched fresh from upstream |
Errors
| Status | When |
|---|---|
400 | provider query param value is not one of the allowed options |
401 | Missing or invalid API key |
502 | Upstream catalog endpoint unavailable. With ?provider=all, only returned when both providers fail — partial responses succeed with the available data. |
Picking the right voice for your SKU
| Speech SKU | Eligible voices |
|---|---|
eleven-v3 | Any provider: "elevenlabs" voice |
eleven-multilingual-v2 | Any provider: "elevenlabs" voice |
eleven-flash-v2-5 | Any provider: "elevenlabs" voice |
eleven-turbo-v2-5 | Any provider: "elevenlabs" voice |
minimax-speech-hd | Any provider: "minimax" voice + your own cloned or designed voices |
minimax-speech-turbo | Any provider: "minimax" voice + your own cloned/designed voices |
See also
POST /v1/audio/speech— use avoice_idfrom this listPOST /v1/audio/voice-clone— create a custom voice from a reference recording (10s–5min)POST /v1/audio/voice-design— create a synthesized voice from a text description