Price Per TokenPrice Per Token
Qwen

Qwen3 32B API Pricing 2026

Compare pricing, benchmarks, and providers for Qwen3 32B. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Qwen3 32B was released on April 28, 2025. Pricing starts at $0.080 per million input tokens and $0.240 per million output tokens. The model supports a context window of up to 41K tokens. API access is available through Qwen.

Context Window
41K
tokens
Pricing
Input$0.080
Output$0.240
Cached$0.040
per 1M tokens
Speed
Output86 tok/s
TTFT1.05s
median latency
Capabilities
ToolsCaching

Pricing Comparison

Compare Qwen3 32B with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.080
$0.240
28.8
72.7
53.5
40,960
Try
$0.060
$0.240
16.7
59.0
43.3
300,000
Try
$0.070
$0.300
73.7
262,144
Try
$0.070
$0.280
131,072
Try
$0.070
$0.280
120,000
Try
$0.070
$0.270
40.3
70.6
51.6
160,000
Try
$0.071
$0.100
52.4
82.8
75.3
262,144
Try
$0.075
$0.300
131,072
Try
$0.075
$0.300
262,144
Try
$0.075
$0.300
18.5
72.4
53.5
1,048,576
Try
$0.080
$0.330
51.5
77.7
65.9
262,144
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Qwen3 32B is available from multiple providers with different pricing and availability.

10
Providers
$0.080
Cheapest Input/M
$0.240
Cheapest Output/M
Chutes
Best Price

Provider Pricing

10 models

Provider
Input $/M
Output $/M
Uptime
C
Chutes
Best Price
$0.080
$0.240
99.7%
D
DeepInfra
Best Price
$0.080
$0.280
100.0%
A
AtlasCloud
$0.100
$1.200
99.8%
N
Novita
$0.100
$0.450
100.0%
A
Alibaba
$0.104
$0.416
100.0%
S
SiliconFlow
$0.140
$0.570
96.3%
N
Nebius
$0.200
$0.600
100.0%
Groq
Groq
$0.290
$0.590
100.0%
S
SambaNova
$0.400
$0.800
-
C
Cerebras
$0.400
$0.800
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Qwen3 32B perform? Vote based on your experience.

Cost vs. Quality

Compare Qwen3 32B's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Qwen3 32B
Qwen3 32B (Thinking)
Other models

Detailed Benchmark Scores

BenchmarkStandard Thinking
Intelligence14.5 (27th) 16.5 (19th)
Coding-13.8 (19th)
Math19.7 (21th) 73.0 (29th)
MMLU Pro72.7 (37th) 79.8 (24th)
GPQA53.5 (35th) 66.8 (16th)
LiveCodeBench28.8 (24th) 54.6 (13th)
AIME30.3 (50th) 80.7 (53th)
Aider40.0 (20th) -
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Qwen3 32B

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Qwen3 32B directly in your browser.

Open Playground

Frequently Asked Questions

Qwen3 32B costs $0.000080 per 1,000 input tokens and $0.000240 per 1,000 output tokens.
Qwen3 32B is available from 10 provider(s). Chutes offers the best price.
We compare Qwen3 32B with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Qwen Models

See pricing for all Qwen models.

View all Qwen models →