What capabilities does Gemini 3.1 Flash-Lite have?

Gemini 3.1 Flash-Lite supports: text, vision, audio, function_calling. Fastest Gemini model, 2.5x faster than 2.5 Flash. Optimized for high-throughput, cost-efficient tasks with multimodal support.

Gemini 3.1 Flash-Lite

Google

Gemini 3.1 Flash-Lite by Google costs $0.25/M input, $1.50/M output tokens. 1.0M context window, 66K max output. Supports Vision & Audio. Updated March 2026. Free pricing calculator & comparison with 40+ AI models.

Input Price

$0.25

per 1M tokens

Output Price

$1.50

per 1M tokens

Context Window

1.0M

tokens

Specifications

Provider	Google
Model ID	gemini-3-1-flash-lite
Input Price	$0.25 / 1M tokens
Output Price	$1.5 / 1M tokens
Context Window	1.0M tokens
Max Output	66K tokens
Capabilities	textvisionaudiofunction_calling
Release Date	2026-03
Notes	Fastest Gemini model, 2.5x faster than 2.5 Flash. Optimized for high-throughput, cost-efficient tasks with multimodal support.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split).

Daily Tokens	Monthly Cost	Annual Cost
10K	$0.26	$3.15
50K	$1.31	$15.75
100K	$2.63	$31.50
500K	$13.13	$157.50
1.0M	$26.25	$315.00

About Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite is a large language model by Google. It features a 1.0M token context window with up to 66K tokens of output per request. The model supports 4 capabilities: text, vision, audio, function_calling.

At $0.25 per million input tokens and $1.5 per million output tokens, Gemini 3.1 Flash-Lite is positioned as a cost-effective option in the Google lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Gemini 3.1 Flash-Lite Key Details

Pricing: $0.25/M input tokens, $1.5/M output tokens
Context window: 1.0M tokens — one of the largest available
Max output: 66K tokens per response
Capabilities: text, vision, audio, function_calling
Highlights: Fastest Gemini model, 2.5x faster than 2.5 Flash. Optimized for high-throughput, cost-efficient tasks with multimodal support.
Released: 2026-03

Other Google Models

Gemini 3.1 Pro

$2 / $12 per 1M

Gemini 2.5 Pro

$1.25 / $10 per 1M

Gemini 2.5 Flash

$0.15 / $0.6 per 1M

Similar Price Range

GPT-5 Mini

OpenAI

$0.25 / $2 per 1M

DeepSeek V3.2

DeepSeek

$0.27 / $1.1 per 1M

Grok 3 Mini

xAI

$0.3 / $0.5 per 1M

Related Tools

AI Token Counter

Count tokens for Gemini 3.1 Flash-Lite

Pricing Calculator

Compare all model prices

System Prompt Generator

Build prompts for Gemini 3.1 Flash-Lite

FAQ

How much does Gemini 3.1 Flash-Lite cost?

Gemini 3.1 Flash-Lite costs $0.25 per million input tokens and $1.5 per million output tokens. For a typical workload of 100K tokens/day, expect approximately $3.00/month.

What is Gemini 3.1 Flash-Lite's context window?

Gemini 3.1 Flash-Lite supports a context window of 1.0M tokens. This means your combined input prompt and output response can be up to 1.0M tokens. The maximum output per response is 66K tokens.

Is Gemini 3.1 Flash-Lite good for my use case?

Gemini 3.1 Flash-Lite supports text, vision, audio, function_calling. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.