What capabilities does Qwen 2.5 Coder 32B have?

Qwen 2.5 Coder 32B supports: text, function_calling. Code-specialized. Open-source.

Qwen 2.5 Coder 32B

Alibaba legacy

Updated July 2026. Qwen 2.5 Coder 32B by Alibaba: $0.2/M cache-miss input, $0.6/M output tokens. 128K context, 8K max output. Function Calling. Free calculator + compare 40+ models.

Input Price

$0.2

cache miss / 1M tokens

Output Price

$0.6

per 1M tokens

Context Window

128K

tokens

Specifications

Provider	Alibaba
Model ID	qwen-2-5-coder-32b
Input Price	$0.2 / 1M cache-miss tokens
Output Price	$0.6 / 1M tokens
Context Window	128K tokens
Max Output	8K tokens
Capabilities	textfunction_calling
Release Date	2025-09
Pricing Source	Official Alibaba pricing
Price Verified	2026-06-14 · Alibaba Cloud Model Studio China pricing is published in CNY and is excluded from the USD comparison calculator.
Notes	Code-specialized. Open-source.

Monthly Cost Estimates

Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.

Daily Tokens	Monthly Cost	Annual Cost
10K	$0.12	$1.44
50K	$0.6	$7.20
100K	$1.20	$14.40
500K	$6.00	$72.00
1.0M	$12.00	$144.00

About Qwen 2.5 Coder 32B

Qwen 2.5 Coder 32B is a large language model by Alibaba. It features a 128K token context window with up to 8K tokens of output per request. The model supports 2 capabilities: text, function_calling.

At $0.2 per million cache-miss input tokens and $0.6 per million output tokens, Qwen 2.5 Coder 32B is positioned as a cost-effective option in the Alibaba lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.

Qwen 2.5 Coder 32B Key Details

Pricing: $0.2/M cache-miss input tokens, $0.6/M output tokens
Context window: 128K tokens — good for standard conversations and tasks
Max output: 8K tokens per response
Capabilities: text, function_calling
Highlights: Code-specialized. Open-source.
Released: 2025-09

Other Alibaba Models

Qwen3.7 Plus

¥2.00 / ¥8.00 per 1M

Qwen3.7 Max

¥12.00 / ¥36.00 per 1M

Similar Price Range

GPT-5.4 Nano

OpenAI

$0.2 / $1.25 per 1M · cached input $0.02

Jamba 1.5 Mini

AI21 Labs

$0.2 / $0.4 per 1M

GPT-5 Mini

OpenAI

$0.25 / $2.00 per 1M · cached input $0.025

Related Tools

AI Token Counter

Count tokens for Qwen 2.5 Coder 32B

Pricing Calculator

Compare all model prices

Throughput Planner

Plan RPM, TPM, and monthly cost for Qwen 2.5 Coder 32B

FAQ

How much does Qwen 2.5 Coder 32B cost?

Qwen 2.5 Coder 32B costs $0.2 per million cache-miss input tokens and $0.6 per million output tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $1.50/month before cache-hit savings.

What is Qwen 2.5 Coder 32B's context window?

Qwen 2.5 Coder 32B supports a context window of 128K tokens. This means your combined input prompt and output response can be up to 128K tokens. The maximum output per response is 8K tokens.

Is Qwen 2.5 Coder 32B good for my use case?

Qwen 2.5 Coder 32B supports text, function_calling. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.