What capabilities does o3 have?

o3 supports: text, vision, function_calling, structured_output. Standard reasoning model.

o3

Q: How much does o3 cost?

o3 costs $2.00 per million cache-miss input tokens and $8.00 per million output tokens. Cached input is $0.5 per million tokens.

Q: What is o3's context window?

o3 supports a context window of 200K tokens. Its maximum output is 100K tokens per response.

OpenAI

Updated July 2026. o3 by OpenAI: $2.00/M cache-miss input, $8.00/M output tokens. Cached input: $0.5/M. 200K context, 100K max output. Vision & Function Calling. Free calculator + compare 40+ models.

Input Price

$2.00

cache miss / 1M tokens

Cached Input

$0.5

per 1M tokens

Output Price

$8.00

per 1M tokens

Context Window

200K

tokens

Specifications

Provider	OpenAI
Model ID	o3
Input Price	$2 / 1M cache-miss tokens
Cached Input Price	$0.5 / 1M tokens
Output Price	$8 / 1M tokens
Context Window	200K tokens
Max Output	100K tokens
Capabilities	textvisionfunction_callingstructured_output
Release Date	2025-06
Pricing Source	Official OpenAI pricing
Price Verified	2026-07-14 · GPT-5.6 Sol, Terra, and Luna pricing was refreshed against OpenAI official pricing and model docs.
Notes	Standard reasoning model.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily Tokens	Monthly Cost	Annual Cost
10K	$1.50	$18.00
50K	$7.50	$90.00
100K	$15.00	$180.00
500K	$75.00	$900.00
1.0M	$150.00	$1800.00

About o3

o3 is a large language model by OpenAI. It features a 200K token context window with up to 100K tokens of output per request. The model supports 4 capabilities: text, vision, function_calling, structured_output.

At $2 per million cache-miss input tokens and $8 per million output tokens, o3 is positioned as a mid-range option in the OpenAI lineup. Repeated prefix input can be charged at $0.5 per million cached tokens. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

o3 Key Details

Pricing: $2/M cache-miss input tokens, $0.5/M cached input tokens, $8/M output tokens
Context window: 200K tokens — suitable for large documents and codebases
Max output: 100K tokens per response
Capabilities: text, vision, function_calling, structured_output
Highlights: Standard reasoning model.
Released: 2025-06

Other OpenAI Models

GPT-5.6 Sol

$5 / $30 per 1M · cached input $0.5

GPT-5.6 Terra

$2.5 / $15 per 1M · cached input $0.25

GPT-5.6 Luna

$1 / $6 per 1M · cached input $0.1

Similar Price Range

Gemini 3.1 Pro Preview

Google

$2 / $12 per 1M · cached input $0.2

Grok 4.5

xAI

$2 / $6 per 1M · cached input $0.5

Jamba 1.5 Large

AI21 Labs

$2 / $8 per 1M

Related Tools

AI Token Counter

Count tokens for o3

Pricing Calculator

Compare all model prices

Throughput Planner

Plan RPM, TPM, and monthly cost for o3

FAQ

How much does o3 cost?

o3 costs $2 per million cache-miss input tokens and $8 per million output tokens. Cached input costs $0.5 per million tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $18.00/month before cache-hit savings.

What is o3's context window?

o3 supports a context window of 200K tokens. This means your combined input prompt and output response can be up to 200K tokens. The maximum output per response is 100K tokens.

Is o3 good for my use case?

o3 supports text, vision, function_calling, structured_output. As a mid-range model, it balances capability and cost for most production use cases. Use our Pricing Calculator to compare with alternatives.