o3
OpenAIUpdated May 2026. o3 by OpenAI: $2.00/M cache-miss input, $8.00/M output tokens. 200K context, 100K max output. Function Calling & JSON Mode. Free calculator + compare 40+ models.
Input Price
$2.00
cache miss / 1M tokens
Output Price
$8.00
per 1M tokens
Context Window
200K
tokens
Specifications
| Provider | OpenAI |
| Model ID | o3 |
| Input Price | $2 / 1M cache-miss tokens |
| Output Price | $8 / 1M tokens |
| Context Window | 200K tokens |
| Max Output | 100K tokens |
| Capabilities | textfunction_callingstructured_output |
| Release Date | 2025-06 |
| Pricing Source | Official OpenAI pricing |
| Price Verified | 2026-05-06 · Pricing guides were refreshed against OpenAI official pricing/model docs. |
| Notes | Standard reasoning model. |
Monthly Cost Estimates
Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.
| Daily Tokens | Monthly Cost | Annual Cost |
|---|---|---|
| 10K | $1.50 | $18.00 |
| 50K | $7.50 | $90.00 |
| 100K | $15.00 | $180.00 |
| 500K | $75.00 | $900.00 |
| 1.0M | $150.00 | $1800.00 |
About o3
o3 is a large language model by OpenAI. It features a 200K token context window with up to 100K tokens of output per request. The model supports 3 capabilities: text, function_calling, structured_output.
At $2 per million cache-miss input tokens and $8 per million output tokens, o3 is positioned as a mid-range option in the OpenAI lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.
o3 Key Details
- Pricing: $2/M cache-miss input tokens, $8/M output tokens
- Context window: 200K tokens — suitable for large documents and codebases
- Max output: 100K tokens per response
- Capabilities: text, function_calling, structured_output
- Highlights: Standard reasoning model.
- Released: 2025-06
Other OpenAI Models
Similar Price Range
Related Tools
FAQ
How much does o3 cost?
o3 costs $2 per million cache-miss input tokens and $8 per million output tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $18.00/month before cache-hit savings.
What is o3's context window?
o3 supports a context window of 200K tokens. This means your combined input prompt and output response can be up to 200K tokens. The maximum output per response is 100K tokens.
Is o3 good for my use case?
o3 supports text, function_calling, structured_output. As a mid-range model, it balances capability and cost for most production use cases. Use our Pricing Calculator to compare with alternatives.