GPT-5.4 Mini
OpenAIUpdated June 2026. GPT-5.4 Mini by OpenAI: $0.75/M cache-miss input, $4.50/M output tokens. Cached input: $0.075/M. 400K context, 128K max output. Vision & Function Calling. Free calculator + compare 40+ models.
Input Price
$0.75
cache miss / 1M tokens
Cached Input
$0.075
per 1M tokens
Output Price
$4.50
per 1M tokens
Context Window
400K
tokens
Specifications
| Provider | OpenAI |
| Model ID | gpt-5.4-mini |
| Input Price | $0.75 / 1M cache-miss tokens |
| Cached Input Price | $0.075 / 1M tokens |
| Output Price | $4.5 / 1M tokens |
| Context Window | 400K tokens |
| Max Output | 128K tokens |
| Capabilities | textvisionfunction_callingstructured_output |
| Release Date | 2026-05 |
| Pricing Source | Official OpenAI pricing |
| Price Verified | 2026-06-14 · Pricing guides were refreshed against OpenAI official pricing/model docs. |
| Notes | Current lower-cost GPT-5.4 production model. |
Monthly Cost Estimates
Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.
| Daily Tokens | Monthly Cost | Annual Cost |
|---|---|---|
| 10K | $0.79 | $9.45 |
| 50K | $3.94 | $47.25 |
| 100K | $7.88 | $94.50 |
| 500K | $39.38 | $472.50 |
| 1.0M | $78.75 | $945.00 |
About GPT-5.4 Mini
GPT-5.4 Mini is a large language model by OpenAI. It features a 400K token context window with up to 128K tokens of output per request. The model supports 4 capabilities: text, vision, function_calling, structured_output.
At $0.75 per million cache-miss input tokens and $4.5 per million output tokens, GPT-5.4 Mini is positioned as a cost-effective option in the OpenAI lineup. Repeated prefix input can be charged at $0.075 per million cached tokens. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.
GPT-5.4 Mini Key Details
- Pricing: $0.75/M cache-miss input tokens, $0.075/M cached input tokens, $4.5/M output tokens
- Context window: 400K tokens — suitable for large documents and codebases
- Max output: 128K tokens per response
- Capabilities: text, vision, function_calling, structured_output
- Highlights: Current lower-cost GPT-5.4 production model.
- Released: 2026-05
Other OpenAI Models
Similar Price Range
Related Tools
FAQ
How much does GPT-5.4 Mini cost?
GPT-5.4 Mini costs $0.75 per million cache-miss input tokens and $4.5 per million output tokens. Cached input costs $0.075 per million tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $9.00/month before cache-hit savings.
What is GPT-5.4 Mini's context window?
GPT-5.4 Mini supports a context window of 400K tokens. This means your combined input prompt and output response can be up to 400K tokens. The maximum output per response is 128K tokens.
Is GPT-5.4 Mini good for my use case?
GPT-5.4 Mini supports text, vision, function_calling, structured_output. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.