DevTk.AI

AI Model Pricing Calculator

Compare costs across 30+ AI models. Estimate your monthly spend instantly.

100,000 tokens/day = 3,000,000 tokens/month

50,000 tokens/day = 1,500,000 tokens/month

Filter by Provider
Showing 42 of 42 modelsPrices updated: 2026-03-31
Amazon Nova MicroCheapest
Amazon
$0.3150
/month
Input $/1M
$0.04
Output $/1M
$0.14
Context
128,000
Amazon Nova Lite
Amazon
$0.5400
/month
Input $/1M
$0.06
Output $/1M
$0.24
Context
300,000
GPT-5 Nano
OpenAI
$0.7500
/month
Input $/1M
$0.05
Output $/1M
$0.40
Context
128,000
Gemini 2.5 Flash-Lite
Google
$0.9000
/month
Input $/1M
$0.10
Output $/1M
$0.40
Context
1,000,000
Gemini 2.0 Flash
Google
$0.9000
/month
Input $/1M
$0.10
Output $/1M
$0.40
Context
1,000,000
Jamba 1.5 Mini
AI21 Labs
$1.20
/month
Input $/1M
$0.20
Output $/1M
$0.40
Context
256,000
GPT-4o mini
OpenAI
$1.35
/month
Input $/1M
$0.15
Output $/1M
$0.60
Context
128,000
Command R
Cohere
$1.35
/month
Input $/1M
$0.15
Output $/1M
$0.60
Context
128,000
Grok 4 Fast
xAI
$1.35
/month
Input $/1M
$0.20
Output $/1M
$0.50
Context
2,000,000
Mistral Small 3.1
Mistral
$1.50
/month
Input $/1M
$0.20
Output $/1M
$0.60
Context
128,000
Qwen 2.5 Coder 32B
Alibaba
$1.50
/month
Input $/1M
$0.20
Output $/1M
$0.60
Context
128,000
Grok 3 Mini
xAI
$1.65
/month
Input $/1M
$0.30
Output $/1M
$0.50
Context
131,072
DeepSeek V4
DeepSeek
$1.65
/month
Input $/1M
$0.30
Output $/1M
$0.50
Context
1,000,000
DeepSeek V3.2
DeepSeek
$2.46
/month
Input $/1M
$0.27
Output $/1M
$1.10
Context
128,000
Gemini 3.1 Flash-Lite
Google
$3.00
/month
Input $/1M
$0.25
Output $/1M
$1.50
Context
1,000,000
Qwen 2.5 72B
Alibaba
$3.00
/month
Input $/1M
$0.40
Output $/1M
$1.20
Context
128,000
GPT-5 Mini
OpenAI
$3.75
/month
Input $/1M
$0.25
Output $/1M
$2.00
Context
400,000
$3.96
/month
Input $/1M
$0.88
Output $/1M
$0.88
Context
128,000
Gemini 2.5 Flash
Google
$4.65
/month
Input $/1M
$0.30
Output $/1M
$2.50
Context
1,000,000
Kimi K2.5
Moonshot AI
$4.80
/month
Input $/1M
$0.60
Output $/1M
$2.00
Context
128,000
DeepSeek R1
DeepSeek
$4.94
/month
Input $/1M
$0.55
Output $/1M
$2.19
Context
128,000
Amazon Nova Pro
Amazon
$7.20
/month
Input $/1M
$0.80
Output $/1M
$3.20
Context
300,000
Mistral Medium 3
Mistral
$7.50
/month
Input $/1M
$1.00
Output $/1M
$3.00
Context
128,000
o3-mini
OpenAI
$9.90
/month
Input $/1M
$1.10
Output $/1M
$4.40
Context
200,000
Claude Haiku 4.5
Anthropic
$10.50
/month
Input $/1M
$1.00
Output $/1M
$5.00
Context
200,000
Mistral Large 3
Mistral
$15.00
/month
Input $/1M
$2.00
Output $/1M
$6.00
Context
128,000
Llama 3.1 405B
Meta (via providers)
$15.75
/month
Input $/1M
$3.50
Output $/1M
$3.50
Context
128,000
$18.00
/month
Input $/1M
$2.00
Output $/1M
$8.00
Context
200,000
Jamba 1.5 Large
AI21 Labs
$18.00
/month
Input $/1M
$2.00
Output $/1M
$8.00
Context
256,000
GPT-5
OpenAI
$18.75
/month
Input $/1M
$1.25
Output $/1M
$10.00
Context
400,000
Gemini 2.5 Pro
Google
$18.75
/month
Input $/1M
$1.25
Output $/1M
$10.00
Context
2,000,000
GPT-5.3-Codex
OpenAI
$21.00
/month
Input $/1M
$2.00
Output $/1M
$10.00
Context
200,000
GPT-4o
OpenAI
$22.50
/month
Input $/1M
$2.50
Output $/1M
$10.00
Context
128,000
Command R+
Cohere
$22.50
/month
Input $/1M
$2.50
Output $/1M
$10.00
Context
128,000
Gemini 3.1 Pro
Google
$24.00
/month
Input $/1M
$2.00
Output $/1M
$12.00
Context
2,000,000
GPT-5.4
OpenAI
$30.00
/month
Input $/1M
$2.50
Output $/1M
$15.00
Context
1,100,000
Claude Sonnet 4.6
Anthropic
$31.50
/month
Input $/1M
$3.00
Output $/1M
$15.00
Context
1,000,000
Grok 3
xAI
$31.50
/month
Input $/1M
$3.00
Output $/1M
$15.00
Context
131,072
Grok 4
xAI
$31.50
/month
Input $/1M
$3.00
Output $/1M
$15.00
Context
256,000
Claude Opus 4.6
Anthropic
$52.50
/month
Input $/1M
$5.00
Output $/1M
$25.00
Context
1,000,000
$135.00
/month
Input $/1M
$15.00
Output $/1M
$60.00
Context
200,000
o3-pro
OpenAI
$180.00
/month
Input $/1M
$20.00
Output $/1M
$80.00
Context
200,000

How to Use This Tool

  1. Enter your estimated daily input tokens (the text you send to the AI) and daily output tokens (the AI's response length).
  2. Use the provider filter to narrow results to specific providers like OpenAI, Anthropic, Google, or others.
  3. Sort by monthly cost, input price, or output price to find the most cost-effective model for your use case.
  4. Click on a provider name to visit their official pricing page and sign up for API access.
  5. Compare multiple models side-by-side to find the best price-to-performance ratio for your specific workload.

Understanding AI API Pricing in 2026

AI API pricing is based on tokens processed, with separate rates for input tokens (your prompts) and output tokens (the model's responses). Prices are typically quoted per million tokens. For example, GPT-4o charges $2.50 per million input tokens and $10.00 per million output tokens, while Claude Sonnet 4 charges $3.00 and $15.00 respectively.

The AI pricing landscape has become increasingly competitive in 2026. Open-source models like DeepSeek V3 and Llama 3.3 are available at significantly lower costs through providers like Together.ai and Groq. Meanwhile, premium models like GPT-5 and Claude Opus 4 offer superior reasoning at higher price points. The right choice depends on your task complexity and budget.

Several factors beyond per-token pricing affect your total cost: Batch API discounts (typically 50% off for non-real-time processing), prompt caching (reduced costs for repeated prompt prefixes), and context window usage (longer conversations cost more). Some providers also charge differently for cached inputs versus fresh inputs.

For cost optimization, consider these strategies: Use smaller models for simple tasks (GPT-4o-mini or Claude Haiku for classification, summarization). Use prompt caching for system prompts that don't change. Batch non-urgent requests for 50% savings. Monitor token usage with observability tools like Helicone or Langfuse.

Last updated: February 2026

FAQ

How is the monthly cost calculated?

Monthly cost = (daily input tokens × input price per token × 30) + (daily output tokens × output price per token × 30). Prices are based on the latest published API pricing.

How often are prices updated?

We update pricing data regularly. The last update date is shown on the page. AI model prices change frequently, so always verify with the provider's official pricing page.

Which model is cheapest?

It depends on your use case. For simple tasks, GPT-4o-mini and Claude 3.5 Haiku offer excellent price/performance. For complex reasoning, larger models like GPT-4o or Claude 3.5 Sonnet may be more cost-effective despite higher per-token costs.

What is Batch API pricing?

Several providers offer Batch API pricing at ~50% discount for requests that don't need real-time responses. OpenAI's Batch API, Anthropic's Message Batches, and Google's batch endpoints all offer significant savings for bulk processing like document analysis, data extraction, or content generation jobs.

How does prompt caching reduce costs?

Prompt caching (available from Anthropic and OpenAI) stores frequently used prompt prefixes on the provider's servers. When you send a request with a cached prefix, you pay a reduced rate (typically 75-90% less) for those cached tokens. This is especially valuable for applications with long system prompts or few-shot examples that repeat across requests.

Which model offers the best value in 2026?

It depends on your use case. For simple tasks (classification, extraction), GPT-4o-mini and Claude Haiku 3.5 offer excellent value at under $1 per million tokens. For complex reasoning, Claude Sonnet 4 and GPT-4o balance capability with cost. For maximum quality regardless of price, GPT-5 and Claude Opus 4 lead the market.

Related Blog Posts

Related Tools