Qwen 2.5 Coder 32B
AlibabaUpdated May 2026. Qwen 2.5 Coder 32B by Alibaba: $0.2/M cache-miss input, $0.6/M output tokens. 128K context, 8K max output. Function Calling. Free calculator + compare 40+ models.
Input Price
$0.2
cache miss / 1M tokens
Output Price
$0.6
per 1M tokens
Context Window
128K
tokens
Specifications
| Provider | Alibaba |
| Model ID | qwen-2-5-coder-32b |
| Input Price | $0.2 / 1M cache-miss tokens |
| Output Price | $0.6 / 1M tokens |
| Context Window | 128K tokens |
| Max Output | 8K tokens |
| Capabilities | textfunction_calling |
| Release Date | 2025-09 |
| Pricing Source | Official Alibaba pricing |
| Price Verified | 2026-02-26 · DashScope model pricing should be refreshed before new pricing content. |
| Notes | Code-specialized. Open-source. |
Monthly Cost Estimates
Estimated monthly costs based on different daily usage levels (assuming 50% input / 50% output split). Input estimates use cache-miss pricing, so cache-heavy workloads can be lower.
| Daily Tokens | Monthly Cost | Annual Cost |
|---|---|---|
| 10K | $0.12 | $1.44 |
| 50K | $0.60 | $7.20 |
| 100K | $1.20 | $14.40 |
| 500K | $6.00 | $72.00 |
| 1.0M | $12.00 | $144.00 |
About Qwen 2.5 Coder 32B
Qwen 2.5 Coder 32B is a large language model by Alibaba. It features a 128K token context window with up to 8K tokens of output per request. The model supports 2 capabilities: text, function_calling.
At $0.2 per million cache-miss input tokens and $0.6 per million output tokens, Qwen 2.5 Coder 32B is positioned as a cost-effective option in the Alibaba lineup. Use our Token Counter to estimate how many tokens your prompts use, and our Pricing Calculator to compare costs across all models.
Qwen 2.5 Coder 32B Key Details
- Pricing: $0.2/M cache-miss input tokens, $0.6/M output tokens
- Context window: 128K tokens — good for standard conversations and tasks
- Max output: 8K tokens per response
- Capabilities: text, function_calling
- Highlights: Code-specialized. Open-source.
- Released: 2025-09
Other Alibaba Models
Similar Price Range
Related Tools
FAQ
How much does Qwen 2.5 Coder 32B cost?
Qwen 2.5 Coder 32B costs $0.2 per million cache-miss input tokens and $0.6 per million output tokens. For a typical workload of 100K input tokens/day and 50K output tokens/day, expect approximately $1.50/month before cache-hit savings.
What is Qwen 2.5 Coder 32B's context window?
Qwen 2.5 Coder 32B supports a context window of 128K tokens. This means your combined input prompt and output response can be up to 128K tokens. The maximum output per response is 8K tokens.
Is Qwen 2.5 Coder 32B good for my use case?
Qwen 2.5 Coder 32B supports text, function_calling. As a budget-friendly model, it works well for high-volume tasks like classification, summarization, and simple generation. Use our Pricing Calculator to compare with alternatives.