Gemini 2.5 Flash
GoogleGemini 2.5 Flash by Google costs $0.15/M input, $0.6/M output with 1.0M context window. Updated February 2026. Compare with GPT-5, Claude, Gemini & 40+ models.
Input Price
$0.15
per 1M tokens
Output Price
$0.60
per 1M tokens
Context Window
1.0M
tokens
Specifications
| Provider | |
| Model ID | gemini-2-5-flash |
| Input Price | $0.15 / 1M tokens |
| Output Price | $0.6 / 1M tokens |
| Context Window | 1.0M tokens |
| Max Output | 8K tokens |
| Capabilities | textvisionaudiofunction_callingstructured_output |
| Release Date | 2025-05 |
Monthly Cost Estimates
Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split).
| Daily Tokens | Monthly Cost | Annual Cost |
|---|---|---|
| 10K | $0.11 | $1.35 |
| 50K | $0.56 | $6.75 |
| 100K | $1.13 | $13.50 |
| 500K | $5.63 | $67.50 |
| 1.0M | $11.25 | $135.00 |
About Gemini 2.5 Flash
Gemini 2.5 Flash is a large language model by Google. It features a 1.0M token context window with up to 8K tokens of output per request. The model supports 5 capabilities: text, vision, audio, function_calling, structured_output.
At $0.15 per million input tokens and $0.6 per million output tokens, Gemini 2.5 Flash is positioned as a cost-effective option in the Google lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.
Other Google Models
Similar Price Range
Related Tools
FAQ
How much does Gemini 2.5 Flash cost?
Gemini 2.5 Flash costs $0.15 per million input tokens and $0.6 per million output tokens. For a typical workload of 100K tokens/day, expect approximately $1.35/month.
What is Gemini 2.5 Flash's context window?
Gemini 2.5 Flash supports a context window of 1.0M tokens. This means your combined input prompt and output response can be up to 1.0M tokens. The maximum output per response is 8K tokens.
Is Gemini 2.5 Flash good for my use case?
Gemini 2.5 Flash supports text, vision, audio, function_calling, structured_output. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.