What capabilities does Xiaomi MiMo-V2.5-Flash have?

Xiaomi MiMo-V2.5-Flash supports: text, function_calling, structured_output. Low-cost MiMo model for high-throughput text and coding tasks. Cache writing is currently free for a limited time.

Xiaomi MiMo-V2.5-Flash

Xiaomi MiMo

Updated May 2026. Xiaomi MiMo-V2.5-Flash by Xiaomi MiMo: $0.1/M cache-miss input, $0.3/M output tokens. Cached input: $0.01/M. 256K context, 64K max output. Function Calling & JSON Mode. Free calculator + compare 40+ models.

Input Price

$0.1

cache miss / 1M tokens

Cached Input

$0.01

per 1M tokens

Output Price

$0.3

per 1M tokens

Context Window

256K

tokens

Specifications

Provider	Xiaomi MiMo
Model ID	mimo-v2.5-flash
Input Price	$0.1 / 1M cache-miss tokens
Cached Input Price	$0.01 / 1M tokens
Output Price	$0.3 / 1M tokens
Context Window	256K tokens
Max Output	64K tokens
Capabilities	textfunction_callingstructured_output
Release Date	2026-04
Pricing Source	Official Xiaomi MiMo pricing
Price Verified	2026-04-28
Notes	Low-cost MiMo model for high-throughput text and coding tasks. Cache writing is currently free for a limited time.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily Tokens	Monthly Cost	Annual Cost
10K	$0.06	$0.72
50K	$0.30	$3.60
100K	$0.60	$7.20
500K	$3.00	$36.00
1.0M	$6.00	$72.00

About Xiaomi MiMo-V2.5-Flash

Xiaomi MiMo-V2.5-Flash is a large language model by Xiaomi MiMo. It features a 256K token context window with up to 64K tokens of output per request. The model supports 3 capabilities: text, function_calling, structured_output.

At $0.1 per million cache-miss input tokens and $0.3 per million output tokens, Xiaomi MiMo-V2.5-Flash is positioned as a cost-effective option in the Xiaomi MiMo lineup. Repeated prefix input can be charged at $0.01 per million cached tokens. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Xiaomi MiMo-V2.5-Flash Key Details

Pricing: $0.1/M cache-miss input tokens, $0.01/M cached input tokens, $0.3/M output tokens
Context window: 256K tokens — suitable for large documents and codebases
Max output: 64K tokens per response
Capabilities: text, function_calling, structured_output
Highlights: Low-cost MiMo model for high-throughput text and coding tasks. Cache writing is currently free for a limited time.
Released: 2026-04

Other Xiaomi MiMo Models

Xiaomi MiMo-V2.5-Pro

$1 / $3 per 1M · cached input $0.2

Xiaomi MiMo-V2.5

$0.4 / $2 per 1M · cached input $0.08

Similar Price Range

Gemini 2.5 Flash-Lite

Google

$0.1 / $0.4 per 1M

Gemini 2.0 Flash

Google

$0.1 / $0.4 per 1M

DeepSeek V4 Flash

DeepSeek

$0.14 / $0.28 per 1M · cached input $0.0028

Related Tools

AI Token Counter

Count tokens for Xiaomi MiMo-V2.5-Flash

Pricing Calculator

Compare all model prices

Throughput Planner

Plan RPM, TPM, and monthly cost for Xiaomi MiMo-V2.5-Flash

FAQ

How much does Xiaomi MiMo-V2.5-Flash cost?

Xiaomi MiMo-V2.5-Flash costs $0.1 per million cache-miss input tokens and $0.3 per million output tokens. Cached input costs $0.01 per million tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $0.75/month before cache-hit savings.

What is Xiaomi MiMo-V2.5-Flash's context window?

Xiaomi MiMo-V2.5-Flash supports a context window of 256K tokens. This means your combined input prompt and output response can be up to 256K tokens. The maximum output per response is 64K tokens.

Is Xiaomi MiMo-V2.5-Flash good for my use case?

Xiaomi MiMo-V2.5-Flash supports text, function_calling, structured_output. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.