Audio Sound Effects
Endpoints
Audio Sound Effects
Generate non-speech audio from a text prompt. Synchronous, flat per-generation pricing.
POST
Audio Sound Effects
Synchronous endpoint. Describe a sound — door slam, rain, sword unsheathing, ambient room — get back a clip. Pairs naturally with
With
/v1/audio/music for video and game soundtracking.
Request
application/json body.
SFX model. Currently only
elevenlabs-sfx is shipped.Description of the sound. Max 500 characters. Also accepts
text as an alias.Optional duration
0.5 to 22 seconds. Omit to let the model auto-pick a sensible length.How literal the prompt should be (0 = creative, 1 = strict). Range
0–1.Audio format. Same options as audio/speech.
Optional query param. Set to
1 to have Kyma upload the resulting MP3 to Vercel Blob, write a multimodal_jobs row, and return JSON { job_id, kind: "audio", url, duration_sec, cost_usd, balance_usd } instead of streaming bytes. Used by Canvas and Muse audio kinds — most direct API callers don’t need this.Response
Default (streaming)
200 OK with audio bytes. Flat pricing — duration doesn’t change the cost.
| Header | What |
|---|---|
X-Kyma-Model | elevenlabs-sfx |
X-Kyma-Cost-USD | flat $0.027 per generation |
X-Kyma-Balance-USD | remaining balance |
With ?save_to_blob=1
200 OK with JSON:
Errors
| Status | error.code | When |
|---|---|---|
400 | not_an_sfx_model | model is not an SFX SKU |
400 | prompt_too_long | prompt > 500 chars |
400 | invalid_duration | duration_seconds outside [0.5, 22] |
401 | auth_error | missing or invalid API key |
402 | billing_error | balance too low |
502 | provider_error | upstream provider failure |
See also
POST /v1/audio/music— full music tracksPOST /v1/audio/speech— voice narration