What capabilities does Kimi K2.5 have?

Kimi K2.5 supports: text, vision, video, function_calling. China API pricing in CNY. Current K2.5 multimodal model with text, image, and video input.

Kimi K2.5

Q: What is Kimi K2.5's context window?

Kimi K2.5 supports a context window of 262K tokens. Its maximum output is 33K tokens per response.

Moonshot AI

Updated July 2026. Kimi K2.5 by Moonshot AI: ¥4.00/M cache-miss input, ¥21.00/M output tokens. Cached input: ¥0.7/M. 262K context, 33K max output. Vision & Video. Free calculator + compare 40+ models.

Input Price

¥4.00

cache miss / 1M tokens

Cached Input

¥0.7

per 1M tokens

Output Price

¥21.00

per 1M tokens

Context Window

262K

tokens

Specifications

Provider	Moonshot AI
Model ID	kimi-k2.5
Input Price	¥4.00 / 1M cache-miss tokens
Cached Input Price	¥0.7 / 1M tokens
Output Price	¥21.00 / 1M tokens
Context Window	262K tokens
Max Output	33K tokens
Capabilities	textvisionvideofunction_calling
Release Date	2026-01
Pricing Source	Official Moonshot AI pricing
Price Verified	2026-07-19 · Kimi K3 global API pricing is published in USD. Earlier K2-series China API entries remain in CNY.
Notes	China API pricing in CNY. Current K2.5 multimodal model with text, image, and video input.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily Tokens	Monthly Cost	Annual Cost
10K	¥3.75	¥45.00
50K	¥18.75	¥225.00
100K	¥37.50	¥450.00
500K	¥187.50	¥2250.00
1.0M	¥375.00	¥4500.00

About Kimi K2.5

Kimi K2.5 is a large language model by Moonshot AI. It features a 262K token context window with up to 33K tokens of output per request. The model supports 4 capabilities: text, vision, video, function_calling.

At ¥4.00 per million cache-miss input tokens and ¥21.00 per million output tokens, Kimi K2.5 is positioned as a mid-range option in the Moonshot AI lineup. Repeated prefix input can be charged at ¥0.7 per million cached tokens. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Kimi K2.5 Key Details

Pricing: ¥4.00/M cache-miss input tokens, ¥0.7/M cached input tokens, ¥21.00/M output tokens
Context window: 262K tokens — suitable for large documents and codebases
Max output: 33K tokens per response
Capabilities: text, vision, video, function_calling
Highlights: China API pricing in CNY. Current K2.5 multimodal model with text, image, and video input.
Released: 2026-01

Other Moonshot AI Models

Kimi K2.6

¥6.50 / ¥27.00 per 1M · cached input ¥1.10

Kimi K2.7 Code

¥6.50 / ¥27.00 per 1M · cached input ¥1.30

Kimi K3

$3.00 / $15.00 per 1M · cached input $0.3

Similar Price Range

Qwen3.7 Plus

Alibaba

¥2.00 / ¥8.00 per 1M

Qwen3.7 Max

Alibaba

¥12.00 / ¥36.00 per 1M

Related Tools

AI Token Counter

Count tokens for Kimi K2.5

Pricing Calculator

Compare all model prices

Throughput Planner

Plan RPM, TPM, and monthly cost for Kimi K2.5

FAQ

How much does Kimi K2.5 cost?

Kimi K2.5 costs ¥4.00 per million cache-miss input tokens and ¥21.00 per million output tokens. Cached input costs ¥0.7 per million tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately ¥43.50/month before cache-hit savings.

What is Kimi K2.5's context window?

Kimi K2.5 supports a context window of 262K tokens. This means your combined input prompt and output response can be up to 262K tokens. The maximum output per response is 33K tokens.

Is Kimi K2.5 good for my use case?

Kimi K2.5 supports text, vision, video, function_calling. As a mid-range model, it balances capability and cost for most production use cases. Use our Pricing Calculator to compare with alternatives.