DevTk.AI

Xiaomi MiMo-V2.5-Pro

Xiaomi MiMo

Updated May 2026. Xiaomi MiMo-V2.5-Pro by Xiaomi MiMo: $1.00/M cache-miss input, $3.00/M output tokens. Cached input: $0.2/M. Long-context pricing above 256K input tokens: $2.00/M input, $6.00/M output. 1.0M context, 128K max output. Function Calling & JSON Mode. Free calculator + compare 40+ models.

Input Price

$1.00

cache miss / 1M tokens

Cached Input

$0.2

per 1M tokens

Output Price

$3.00

per 1M tokens

Context Window

1.0M

tokens

Specifications

ProviderXiaomi MiMo
Model IDmimo-v2.5-pro
Input Price$1 / 1M cache-miss tokens
Cached Input Price$0.2 / 1M tokens
Output Price$3 / 1M tokens
Long Context Threshold256K input tokens
Long Context Pricing$2 / 1M cache-miss input, $0.4 / 1M cached input, $6 / 1M output
Context Window1.0M tokens
Max Output128K tokens
Capabilities
textfunction_callingstructured_output
Release Date2026-04
Pricing SourceOfficial Xiaomi MiMo pricing
Price Verified2026-04-28
NotesOpen-sourced under MIT. Overseas API price shown for input up to 256K; 256K-1M input uses the long-context prices. Domestic pricing is ¥7/¥1.40/¥21 per 1M for miss/hit/output.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily TokensMonthly CostAnnual Cost
10K $0.60 $7.20
50K $3.00 $36.00
100K $6.00 $72.00
500K $30.00 $360.00
1.0M $60.00 $720.00

About Xiaomi MiMo-V2.5-Pro

Xiaomi MiMo-V2.5-Pro is a large language model by Xiaomi MiMo. It features a 1.0M token context window with up to 128K tokens of output per request. The model supports 3 capabilities: text, function_calling, structured_output.

At $1 per million cache-miss input tokens and $3 per million output tokens, Xiaomi MiMo-V2.5-Pro is positioned as a mid-range option in the Xiaomi MiMo lineup. Repeated prefix input can be charged at $0.2 per million cached tokens. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Xiaomi MiMo-V2.5-Pro Key Details

  • Pricing: $1/M cache-miss input tokens, $0.2/M cached input tokens, $3/M output tokens
  • Context window: 1.0M tokens — one of the largest available
  • Max output: 128K tokens per response
  • Capabilities: text, function_calling, structured_output
  • Highlights: Open-sourced under MIT. Overseas API price shown for input up to 256K; 256K-1M input uses the long-context prices. Domestic pricing is ¥7/¥1.40/¥21 per 1M for miss/hit/output.
  • Released: 2026-04

Other Xiaomi MiMo Models

Similar Price Range

Related Tools

FAQ

How much does Xiaomi MiMo-V2.5-Pro cost?

Xiaomi MiMo-V2.5-Pro costs $1 per million cache-miss input tokens and $3 per million output tokens. Cached input costs $0.2 per million tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $7.50/month before cache-hit savings.

What is Xiaomi MiMo-V2.5-Pro's context window?

Xiaomi MiMo-V2.5-Pro supports a context window of 1.0M tokens. This means your combined input prompt and output response can be up to 1.0M tokens. The maximum output per response is 128K tokens.

Is Xiaomi MiMo-V2.5-Pro good for my use case?

Xiaomi MiMo-V2.5-Pro supports text, function_calling, structured_output. As a mid-range model, it balances capability and cost for most production use cases. Use our Pricing Calculator to compare with alternatives.