What capabilities does Mistral Small 4 have?

Mistral Small 4 supports: text, vision, function_calling, structured_output. Current low-cost Mistral model for high-volume production routing.

Mistral Small 4

Q: What is Mistral Small 4's context window?

Mistral Small 4 supports a context window of 131K tokens with a maximum output of 33K tokens per response.

Mistral

Updated June 2026. Mistral Small 4 by Mistral: $0.1/M cache-miss input, $0.3/M output tokens. 131K context, 33K max output. Vision & Function Calling. Free calculator + compare 40+ models.

Input Price

$0.1

cache miss / 1M tokens

Output Price

$0.3

per 1M tokens

Context Window

131K

tokens

Specifications

Provider	Mistral
Model ID	mistral-small-4
Input Price	$0.1 / 1M cache-miss tokens
Output Price	$0.3 / 1M tokens
Context Window	131K tokens
Max Output	33K tokens
Capabilities	textvisionfunction_callingstructured_output
Release Date	2026-04
Pricing Source	Official Mistral pricing
Price Verified	2026-06-14 · Large 3, Medium 3.5, and Small 4 pricing refreshed from Mistral official docs.
Notes	Current low-cost Mistral model for high-volume production routing.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily Tokens	Monthly Cost	Annual Cost
10K	$0.06	$0.72
50K	$0.30	$3.60
100K	$0.60	$7.20
500K	$3.00	$36.00
1.0M	$6.00	$72.00

About Mistral Small 4

Mistral Small 4 is a large language model by Mistral. It features a 131K token context window with up to 33K tokens of output per request. The model supports 4 capabilities: text, vision, function_calling, structured_output.

At $0.1 per million cache-miss input tokens and $0.3 per million output tokens, Mistral Small 4 is positioned as a cost-effective option in the Mistral lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Mistral Small 4 Key Details

Pricing: $0.1/M cache-miss input tokens, $0.3/M output tokens
Context window: 131K tokens — good for standard conversations and tasks
Max output: 33K tokens per response
Capabilities: text, vision, function_calling, structured_output
Highlights: Current low-cost Mistral model for high-volume production routing.
Released: 2026-04

Other Mistral Models

Mistral Large 3

$0.5 / $1.5 per 1M

Mistral Medium 3.5

$1.5 / $7.5 per 1M

Similar Price Range

Gemini 2.5 Flash-Lite

Google

$0.1 / $0.4 per 1M · cached input $0.01

DeepSeek V4 Flash

DeepSeek

$0.14 / $0.28 per 1M · cached input $0.0028

Xiaomi MiMo-V2.5

Xiaomi MiMo

$0.14 / $0.28 per 1M · cached input $0.0028

Related Tools

AI Token Counter

Count tokens for Mistral Small 4

Pricing Calculator

Compare all model prices

Throughput Planner

Plan RPM, TPM, and monthly cost for Mistral Small 4

FAQ

How much does Mistral Small 4 cost?

Mistral Small 4 costs $0.1 per million cache-miss input tokens and $0.3 per million output tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $0.75/month before cache-hit savings.

What is Mistral Small 4's context window?

Mistral Small 4 supports a context window of 131K tokens. This means your combined input prompt and output response can be up to 131K tokens. The maximum output per response is 33K tokens.

Is Mistral Small 4 good for my use case?

Mistral Small 4 supports text, vision, function_calling, structured_output. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.