Gemini 2.5 Flash-Lite
GoogleUpdated May 2026. Gemini 2.5 Flash-Lite by Google: $0.1/M cache-miss input, $0.4/M output tokens. 1.0M context, 66K max output. Vision & Audio. Free calculator + compare 40+ models.
Input Price
$0.1
cache miss / 1M tokens
Output Price
$0.4
per 1M tokens
Context Window
1.0M
tokens
Specifications
| Provider | |
| Model ID | gemini-2-5-flash-lite |
| Input Price | $0.1 / 1M cache-miss tokens |
| Output Price | $0.4 / 1M tokens |
| Context Window | 1.0M tokens |
| Max Output | 66K tokens |
| Capabilities | textvisionaudiofunction_callingstructured_output |
| Release Date | 2025-09 |
| Pricing Source | Official Google pricing |
| Price Verified | 2026-05-06 · Gemini active quotas are project-specific; check AI Studio before production planning. |
Monthly Cost Estimates
Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.
| Daily Tokens | Monthly Cost | Annual Cost |
|---|---|---|
| 10K | $0.07 | $0.90 |
| 50K | $0.38 | $4.50 |
| 100K | $0.75 | $9.00 |
| 500K | $3.75 | $45.00 |
| 1.0M | $7.50 | $90.00 |
About Gemini 2.5 Flash-Lite
Gemini 2.5 Flash-Lite is a large language model by Google. It features a 1.0M token context window with up to 66K tokens of output per request. The model supports 5 capabilities: text, vision, audio, function_calling, structured_output.
At $0.1 per million cache-miss input tokens and $0.4 per million output tokens, Gemini 2.5 Flash-Lite is positioned as a cost-effective option in the Google lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.
Gemini 2.5 Flash-Lite Key Details
- Pricing: $0.1/M cache-miss input tokens, $0.4/M output tokens
- Context window: 1.0M tokens — one of the largest available
- Max output: 66K tokens per response
- Capabilities: text, vision, audio, function_calling, structured_output
- Released: 2025-09
Other Google Models
Similar Price Range
Related Tools
FAQ
How much does Gemini 2.5 Flash-Lite cost?
Gemini 2.5 Flash-Lite costs $0.1 per million cache-miss input tokens and $0.4 per million output tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $0.90/month before cache-hit savings.
What is Gemini 2.5 Flash-Lite's context window?
Gemini 2.5 Flash-Lite supports a context window of 1.0M tokens. This means your combined input prompt and output response can be up to 1.0M tokens. The maximum output per response is 66K tokens.
Is Gemini 2.5 Flash-Lite good for my use case?
Gemini 2.5 Flash-Lite supports text, vision, audio, function_calling, structured_output. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.