Overview
gpt-image-2 is OpenAI’s flagship image model (GA April 22, 2026). It’s the strongest text-in-image renderer in production today — accurate typography across Latin, Japanese, Korean, Hindi, Bengali, and 25+ other scripts — with reasoning-augmented composition and photoreal output. Sync API, blob-hosted output, runtime quality dropdown.
One SKU exposes all three of OpenAI’s quality tiers (low / medium / high) — the picker default is medium and you opt into higher fidelity per request. No -pro or -mini derivatives; the model ID matches OpenAI’s exact naming.
Specs
| Field | Value |
|---|---|
| Model ID | gpt-image-2 |
| Creator | OpenAI |
| Best for | Text-in-image, photoreal, multilingual typography, logos |
| Sizes | 1024x1024, 1024x1536, 1536x1024, 2048x2048 |
| Quality tiers | low, medium (default), high |
| Pricing mode | Per image, per quality tier |
| Default latency | ~30s medium 1024² (low ~19s, high ~60s, n=3 high ~3min) |
| Output | Blob-hosted URL (Vercel CDN, no expiring OpenAI URL) |
Pricing
Per image at 1024². List = OpenAI provider cost × 1.35 markup, rounded up to a sensible cent boundary so margin floors at ~35% across every tier.| Quality | Provider cost | Kyma list | Margin |
|---|---|---|---|
low | $0.010 | $0.014 | 40% |
medium (default) | $0.060 | $0.081 | 35% |
high | $0.220 | $0.297 | 35% |
n: 3) scale linearly: high × 3 = 0.297 up front, no refund-and-rebill drift.
Compared to other image models on Kyma
| Strength | gpt-image-2 | flux-2-pro | ideogram-v3 | recraft-v4-pro |
|---|---|---|---|---|
| Text in image (English) | ★★★★★ | ★★★ | ★★★★ | ★★★ |
| Multilingual text | ★★★★★ | ★★ | ★★★ | ★★ |
| Photoreal humans | ★★★★ | ★★★★★ | ★★★★ | ★★★★ |
| Composition reasoning | ★★★★★ | ★★★★ | ★★★ | ★★★★ |
| Print quality (4MP) | ★★★ | ★★★★ | ★★★ | ★★★★★ |
| Multi-reference blend | — | 10 sources | — | — |
| Native SVG output | — | — | — | recraft-v4-vector |
Use this when
- You need text inside the image to be legible and accurate (logos, posters, packaging, UI mockups, screenshots, ads).
- The text is non-English (Japanese, Korean, Chinese, Hindi, Bengali, Arabic, etc.).
- The composition needs reasoning (“a chart showing X”, “a diagram of Y”, “a recipe card with Z ingredients”).
- You’d otherwise pay for a designer to set type properly.
Pick something else when
- You need photoreal hero shots with multi-reference blending →
flux-2-protakes up to 10 source images. - You need editable vector files (SVG with paths and layers) →
recraft-v4-vector. - Volume matters more than fidelity — sub-cent budget tier →
minimax-image-01at $0.005/image. - You need print-ready 4MP without paying gpt-image-2 high tier prices →
recraft-v4-proat $0.338/image.
Example
202 with a job_id; poll GET /v1/jobs/{id} until status is succeeded. The completed job’s output.url is a Vercel blob URL hosted on Kyma’s CDN, not an expiring OpenAI URL.
Quality tier rule of thumb
Tier classification
tier: "quality" (top tier in the unified picker taxonomy alongside flux-2-pro, recraft-v4-pro, ideogram-v3). The default medium quality lands at $0.081 — comparable to flux-kontext-pro / recraft-v4 in the Fast tier — but the SKU’s positioning is the high-end option, with quality: "high" available per request when needed.
See also
- Image Generation overview — full image family
POST /v1/images/generations— endpoint reference- Pricing — cross-catalog comparison