Tier Matrix
Endpoints
Tier Matrix
Public tier matrix. Returns the full TIERS array — deposit threshold, RPM, TPM, image/video/audio concurrency caps per tier. Cached 5 min.
GET
Tier Matrix
Public, no authentication required. Returns Kyma’s complete rate-limit tier ladder so docs, dashboard signup flows, and external tools can show “what you unlock at each spend threshold” without hardcoding the matrix in two places.
The response is
Cache-Control: public, max-age=300 (5 minutes).
Response
Always
"tier.matrix".Array of 5 tier definitions, ordered Tier 0 → Tier 4.
Tier fields
| Field | Type | Meaning |
|---|---|---|
tier | number | Tier number (0–4) |
name | string | Display name ("Tier 0" … "Tier 4") |
min_deposit | number | Minimum cumulative Stripe deposit in USD to qualify for this tier |
rpm | number | Requests per minute (text/code endpoints, global per-user) |
per_model_rpm | number | Requests per minute for any single model (text/code) |
tpm | number | Tokens per minute (text/code, global per-user) |
max_single_request | number | Upper bound for tokens in a single request (input + estimated output) |
image_concurrent | number | Max concurrent image generation jobs |
image_queue_depth_cap | number | Max queued image jobs ahead of the concurrency gate. 0 on Tier 4 (uses combined pool) |
video_concurrent | number | Max concurrent video generation jobs. 0 on Tier 0 (video blocked entirely) |
video_queue_depth_cap | number | Max queued video jobs. 0 on Tier 0 (blocked) and Tier 4 (combined pool) |
audio_concurrent | number | Legacy aggregate cap — max() across audio sub-pools |
audio_concurrent_per_provider | object | Per-provider audio sub-pool caps. Keys: groq, vertex, elevenlabs, minimax, openai. See Audio sub-pools. |
combined_media_concurrent | number? | Tier 4 only. Image + video share this combined concurrent pool (38 total). |
combined_queue_depth_cap | number? | Tier 4 only. Queue depth cap for the combined pool (114). |
Use cases
- Dashboard signup flow: show “deposit $5 to unlock 60 RPM + 10 concurrent image” inline.
- Pricing page: render tier comparison table without hardcoding numbers.
- External capacity planner: developers building products on Kyma can fetch tier caps and budget concurrency client-side.
- CI test fixture: lock test expectations against the live matrix.
Caching
Cache-Control: public, max-age=300 (5 minutes). Tier matrix changes infrequently (typically only when Kyma raises caps, e.g. after upstream key rotation or quota upgrades).
Errors
Public read-only — no auth errors. Only failure mode is500 on internal error.
Related
GET /v1/pricing— companion catalog of all SKU pricing- Rate Limits guide — full human-readable explanation with audio sub-pool reasoning, queue + backpressure semantics, 429 response shape, and the “Need higher limits?” CTA
GET /v1/auth/limits— your account’s CURRENT tier (requires auth, returns your specifictier_overrideif set)