AI Model Pricing Directory

Browse pricing for 40+ AI models including GPT-5, Claude, Gemini, DeepSeek, Llama & more. Compare input/output costs, context windows, and capabilities.

Last updated: 2026-03-31 · 42 models from 12 providers

Models

Providers

$0.04

Cheapest Input/M

2.0M

Max Context

Need to estimate costs?

Use our interactive Pricing Calculator to compare models side-by-side and estimate monthly costs.

Open Pricing Calculator

OpenAI

11 models

Model	Input $/M	Output $/M	Context	Capabilities
GPT-5.4 Latest flagship iteration with improved reasoning and computer use.	$2.50	$15.00	1.1M	textvisionaudiofunction_callingstructured_output
GPT-5	$1.25	$10.00	400K	textvisionaudiofunction_callingstructured_output
GPT-5 Mini	$0.25	$2.00	400K	textvisionfunction_callingstructured_output
GPT-5 Nano	$0.05	$0.40	128K	textfunction_callingstructured_output
GPT-4o	$2.50	$10.00	128K	textvisionaudiofunction_callingstructured_output
GPT-4o mini	$0.15	$0.60	128K	textvisionfunction_callingstructured_output
o3-pro Highest reasoning capability for elite tasks.	$20.00	$80.00	200K	textfunction_callingstructured_output
o3 Standard reasoning model.	$2.00	$8.00	200K	textfunction_callingstructured_output
o3-mini Fast reasoning model for coding.	$1.10	$4.40	200K	textfunction_callingstructured_output
o1 Legacy reasoning model.	$15.00	$60.00	200K	textvisionfunction_callingstructured_output
GPT-5.3-Codex Agentic coding model optimized for code generation and multi-step tasks. 25% faster than GPT-5.2 with 200K context window and 32K max output. Best for autonomous coding workflows.	$2.00	$10.00	200K	textfunction_callingstructured_output

Anthropic

3 models

Model	Input $/M	Output $/M	Context	Capabilities
Claude Opus 4.6 Most capable model. 1M context window is currently beta for eligible orgs. 80.8% SWE-bench.	$5.00	$25.00	1.0M	textvisionfunction_callingstructured_output
Claude Sonnet 4.6 1M context window is currently beta for eligible orgs. 79.6% SWE-bench. Best value flagship.	$3.00	$15.00	1.0M	textvisionfunction_callingstructured_output
Claude Haiku 4.5 Fastest Claude model. 200K context, 64K max output.	$1.00	$5.00	200K	textvisionfunction_calling

Google

6 models

Model	Input $/M	Output $/M	Context	Capabilities
Gemini 3.1 Pro Flagship reasoning score among Google models. 2M+ token context. Pricing >200K: $4 input, $18 output.	$2.00	$12.00	2.0M	textvisionaudiofunction_callingstructured_output
Gemini 3.1 Flash-Lite Fastest Gemini model. Optimized for high-throughput, multimodal tasks.	$0.25	$1.50	1.0M	textvisionaudiofunction_calling
Gemini 2.5 Pro Long context >200K: $2.50 input, $15.00 output per 1M.	$1.25	$10.00	2.0M	textvisionaudiofunction_callingstructured_output
Gemini 2.5 Flash	$0.30	$2.50	1.0M	textvisionaudiofunction_callingstructured_output
Gemini 2.5 Flash-Lite	$0.10	$0.40	1.0M	textvisionaudiofunction_callingstructured_output
Gemini 2.0 Flash Deprecated. Scheduled for shutdown on June 1, 2026.	$0.10	$0.40	1.0M	textvisionaudiofunction_callingstructured_output

xAI

4 models

Model	Input $/M	Output $/M	Context	Capabilities
Grok 3	$3.00	$15.00	131K	textvisionfunction_calling
Grok 3 Mini	$0.30	$0.50	131K	textfunction_calling
Grok 4 xAI flagship reasoning model. Strong performance on benchmarks with 256K context.	$3.00	$15.00	256K	textvisionfunction_calling
Grok 4 Fast Fast reasoning variant of Grok 4 with 2M context window. Extremely cost-efficient at $0.20/M input.	$0.20	$0.50	2.0M	textfunction_calling

Meta (via providers)

2 models

Model	Input $/M	Output $/M	Context	Capabilities
Llama 3.3 70B Open-source. Pricing varies by hosting provider. Shown: typical cloud API price.	$0.88	$0.88	128K	textfunction_calling
Llama 3.1 405B Open-source. Pricing varies by hosting provider.	$3.50	$3.50	128K	textfunction_calling

Mistral

3 models

Model	Input $/M	Output $/M	Context	Capabilities
Mistral Large 3	$2.00	$6.00	128K	textvisionfunction_calling
Mistral Medium 3	$1.00	$3.00	128K	textfunction_calling
Mistral Small 3.1	$0.20	$0.60	128K	textfunction_calling

DeepSeek

3 models

Model	Input $/M	Output $/M	Context	Capabilities
DeepSeek V3.2	$0.27	$1.10	128K	textfunction_calling
DeepSeek R1 Reasoning model with chain-of-thought.	$0.55	$2.19	128K	textfunction_calling
DeepSeek V4 1T params, hybrid reasoning + generation. Open-weight.	$0.30	$0.50	1.0M	textfunction_callingstructured_output

Moonshot AI

1 models

Model	Input $/M	Output $/M	Context	Capabilities
Kimi K2.5 76.8% SWE-bench. Strongest open-source model. Agent Swarm support.	$0.60	$2.00	128K	textvisionfunction_calling

Alibaba

2 models

Model	Input $/M	Output $/M	Context	Capabilities
Qwen 2.5 72B Open-source. Competitive with Llama 3.3 70B.	$0.40	$1.20	128K	textfunction_calling
Qwen 2.5 Coder 32B Code-specialized. Open-source.	$0.20	$0.60	128K	textfunction_calling

Cohere

2 models

Model	Input $/M	Output $/M	Context	Capabilities
Command R+ Enterprise RAG-optimized with citation grounding.	$2.50	$10.00	128K	textfunction_callingstructured_output
Command R	$0.15	$0.60	128K	textfunction_calling

AI21 Labs

2 models

Model	Input $/M	Output $/M	Context	Capabilities
Jamba 1.5 Large Mamba-based SSM architecture. 256K context.	$2.00	$8.00	256K	textfunction_calling
Jamba 1.5 Mini	$0.20	$0.40	256K	textfunction_calling

Amazon

3 models

Model	Input $/M	Output $/M	Context	Capabilities
Amazon Nova Pro Via AWS Bedrock. 300K context.	$0.80	$3.20	300K	textvisionfunction_calling
Amazon Nova Lite	$0.06	$0.24	300K	textvisionfunction_calling
Amazon Nova Micro Text-only. Lowest cost option on Bedrock.	$0.04	$0.14	128K	text

Related Resources

Pricing Calculator

Compare costs across all models interactively

AI Token Counter

Count tokens for any text or prompt

AI API Pricing Comparison 2026

In-depth guide comparing all providers