Price Per TokenPrice Per Token
Nvidia
Nvidia
vs
Qwen
Qwen

Nvidia vs Qwen

Compare all Nvidia and Qwen models — pricing, benchmarks, and capabilities

108 out of our 483 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

Overview

Nvidia

Models
7
Cheapest Input
$0.04
Max Context
262K
Top Intelligence
14.6

Qwen

Models
86
Cheapest Input
$0.03
Max Context
1.0M
Top Intelligence
40.1

Flagship Model Comparison

Llama 3.3 Nemotron Super 49B V1.5vsQwen3.5 397B A17B
MetricLlama 3.3 Nemotron Super 49B V1.5Qwen3.5 397B A17B
Input $/1M tokens$0.10$0.39
Output $/1M tokens$0.40$0.90
Context Window131K262K
Intelligence14.640.1
Coding10.537.4
Math8.0N/A

All Models Pricing

Nvidia Models

ModelInput/1MOutput/1MContext
Llama 3.3 Nemotron Super 49B V1.5 $0.10 $0.40131K
Llama 3.1 Nemotron 70B Instruct $0.90 $0.90131K
Nemotron Nano 9B V2 $0.04 $0.16131K
Nemotron 3 Nano 30B A3B $0.05 $0.20262K
Nemotron Nano 12B 2 VL $0.20 $0.20131K
Nemotron Nano 12B V2 $0.20 $0.20128K
Llama 3.1 Nemotron Ultra 253B v1 $0.00 $0.00128K

Qwen Models

ModelInput/1MOutput/1MContext
Qwen3.5 397B A17B $0.39 $0.90262K
Qwen3.5-27B $0.20 $1.56262K
Qwen3.5-122B-A10B $0.26 $2.08262K
Qwen2.5 7B Instruct $0.04 $0.1033K
Qwen3 Max Thinking $0.78 $3.90262K
Qwen3.5-35B-A3B $0.16 $1.00262K
Qwen3 Coder Next $0.12 $0.75262K
Qwen3.5 9B $0.05 $0.15262K
Qwen2.5-VL 7B Instruct $0.20 $0.2033K
Qwen3 Max $1.20 $6.00262K
Qwen3 235B A22B Instruct 2507 $0.07 $0.10262K
Qwen3 Coder 480B A35B (exacto) $0.22 $0.90262K
Qwen3 VL 235B A22B Instruct $0.20 $0.88262K
Qwen3 Next 80B A3B Instruct $0.09 $0.78262K
Qwen3 Coder 30B A3B Instruct $0.07 $0.27160K
Qwen3 VL 32B Instruct $0.10 $0.42131K
Qwen3 235B A22B $0.40 $0.80131K
Qwen-Max $1.04 $4.1633K
Qwen3 VL 30B A3B Instruct $0.13 $0.52131K
Qwen2.5 72B Instruct $0.12 $0.3933K
QwQ 32B $0.15 $0.4033K
Qwen3 30B A3B Instruct 2507 $0.09 $0.30262K
Qwen3 32B $0.08 $0.2441K
Qwen3 VL 8B Instruct $0.08 $0.20131K
Qwen2.5 VL 32B Instruct $0.20 $0.60128K
Qwen2.5 Coder 32B Instruct $0.20 $0.2033K
Qwen3 14B $0.06 $0.2041K
Qwen3 30B A3B $0.08 $0.2841K
Qwen3 4B $0.20 $0.2041K
Qwen-Turbo $0.03 $0.13131K
Qwen2 72B Instruct $0.90 $0.9033K
Qwen3 Omni 30B A3B Instruct $0.90 $0.9066K
Qwen3 8B $0.05 $0.2041K
Qwen2.5 Coder 7B Instruct $0.03 $0.0933K
Qwen3 30B A3B Thinking 2507 $0.05 $0.3033K
Qwen3 Next 80B A3B Thinking $0.10 $0.30128K
Qwen2 VL 2B Instruct $0.10 $0.1033K
Qwen2.5 1.5B Instruct $0.10 $0.1033K
Qwen2.5 Coder 3B Instruct $0.10 $0.1033K
Qwen2.5 Coder 3B $0.10 $0.1033K
Qwen2.5 VL 3B Instruct $0.10 $0.10128K
Qwen1.5 0.5B $0.10 $0.1033K
Qwen1.5 0.5B Chat $0.10 $0.1033K
Qwen3.5-Flash $0.10 $0.401.0M
Qwen3 235B A22B Thinking 2507 $0.11 $0.60262K
Qwen3 VL 8B Thinking $0.12 $1.36131K
Qwen3 VL 30B A3B Thinking $0.13 $0.60131K
Qwen VL Plus $0.14 $0.41131K
Qwen3 Coder Flash $0.20 $0.971.0M
Qwen2.5 Coder 7B $0.20 $0.2033K
Qwen2.5 Coder 14B Instruct $0.20 $0.2033K
Qwen2 VL 7B Instruct $0.20 $0.20131K
Qwen2 7B Instruct $0.20 $0.2033K
CodeQwen 1.5 7B $0.20 $0.2066K
Qwen2.5 14B $0.20 $0.20131K
Qwen2.5 7B $0.20 $0.20131K
Qwen2.5 Coder 0.5B $0.20 $0.2033K
Qwen2.5 Coder 0.5B Instruct $0.20 $0.2033K
Qwen2.5 Coder 14B $0.20 $0.2033K
Qwen2.5 Coder 1.5B $0.20 $0.2033K
Qwen3 0.6B $0.20 $0.2041K
Qwen3 4B Instruct 2507 $0.20 $0.20262K
Qwen2.5 Coder 1.5B Instruct $0.20 $0.2033K
Qwen3 1.7B $0.20 $0.20131K
Qwen2.5 14B Instruct $0.20 $0.2033K
Qwen2.5 0.5B Instruct $0.20 $0.2033K
Qwen Plus 0728 (thinking) $0.26 $0.781.0M
Qwen3 VL 235B A22B Thinking $0.26 $0.90131K
Qwen3.5 Plus $0.26 $1.561.0M
Qwen1.5 14B Chat $0.30 $0.3033K
Qwen-Plus $0.40 $1.201.0M
Qwen2 VL 72B Instruct $0.45 $0.45131K
Qwen3 Coder Plus $0.65 $3.251.0M
Qwen VL Max $0.80 $3.20131K
Qwen2.5 VL 72B Instruct $0.80 $0.8033K
Qwen2.5 Coder 32B Instruct 128K $0.90 $0.90131K
Qwen3 Coder 480B Instruct BF16 $0.90 $0.90262K
Qwen2.5 72B $0.90 $0.90131K
Qwen 1.5 72B Chat $0.90 $0.9033K
Qwen2.5 32B $0.90 $0.90131K
Qwen2.5 32B Instruct $0.90 $0.90128K
Qwen2.5 Coder 32B $0.90 $0.9033K
Qwen2.5 Coder 32B Instruct 32K RoPE $0.90 $0.9033K
Qwen2.5 Coder 32B Instruct 64K $0.90 $0.9066K
Qwen2.5 Math 72B Instruct $0.90 $0.904K
QwQ 32B Preview $0.90 $0.9033K

Benchmark Comparison

Best Scores by Provider

BenchmarkNvidiaQwen
Intelligence
14.6
Llama 3.3 Nemotron Super 49B V1.5
40.1
Qwen3.5 397B A17B
Coding
15.8
Nemotron 3 Nano 30B A3B
37.4
Qwen3.5 397B A17B
Math
62.3
Nemotron Nano 9B V2
82.3
Qwen3 Max Thinking
MMLU Pro
73.9
Nemotron Nano 9B V2
83.8
Qwen3 Max
GPQA
55.7
Nemotron Nano 9B V2
86.1
Qwen3.5 397B A17B
LiveCodeBench
70.1
Nemotron Nano 9B V2
68.4
Qwen3 Next 80B A3B Instruct
Aider
54.9
Llama 3.1 Nemotron 70B Instruct
72.9
Qwen2.5 Coder 32B Instruct
AIME
24.7
Llama 3.1 Nemotron 70B Instruct
72.7
Qwen3 30B A3B Instruct 2507
BBH
N/A
35.9
Qwen2.5-VL 7B Instruct

Capabilities

CapabilityNvidiaQwen
Vision✓ (23 models)
Tool Calls✓ (7 models)✓ (69 models)
Reasoning✓ (5 models)✓ (24 models)
Audio Input
Audio Output
PDF Input
Web Search✓ (2 models)
Prompt Caching✓ (13 models)
Open Source Models✓ (5 models)✓ (70 models)

Model-Level Comparisons

Compare specific models head-to-head:

Frequently Asked Questions

Nvidia has 7 models while Qwen has 86 models available through API providers.
Nvidia's cheapest model starts at $0.04/M input tokens, while Qwen's cheapest starts at $0.03/M input tokens.
It depends on your needs. Compare specific models from each provider using the head-to-head comparisons above. Consider factors like pricing, benchmark performance, context window size, and supported capabilities.