Price Per TokenPrice Per Token
Nvidia
Nvidia
vs
Qwen
Qwen

Nvidia vs Qwen

Compare all Nvidia and Qwen models — pricing, benchmarks, and capabilities

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

Overview

Nvidia

Models
9
Cheapest Input
$0.04
Max Context
262K
Top Intelligence
36.0

Qwen

Models
99
Cheapest Input
$0.01
Max Context
1.0M
Top Intelligence
56.6

Flagship Model Comparison

Nemotron-3 Super 120B A12BvsQwen3.7 Max
MetricNemotron-3 Super 120B A12BQwen3.7 Max
Input $/1M tokens$0.10$1.25
Output $/1M tokens$0.50$3.75
Context WindowN/A1.0M
Intelligence36.056.6
Coding31.250.1

All Models Pricing

Nvidia Models

ModelInput/1MOutput/1MContext
Nemotron-3 Super 120B A12B $0.10 $0.50N/A
Llama 3.3 Nemotron Super 49B V1.5 $0.10 $0.40131K
Llama 3.1 Nemotron 70B Instruct $0.90 $0.90131K
Nemotron Nano 9B V2 $0.04 $0.16131K
Nemotron 3 Nano 30B A3B $0.05 $0.20262K
Nemotron Nano 12B 2 VL $0.20 $0.20131K
Nemotron 3 Super 120B A12B $0.09 $0.45N/A
Nemotron Nano 12B V2 $0.20 $0.20128K
Llama 3.1 Nemotron Ultra 253B v1 $0.00 $0.00128K

Qwen Models

ModelInput/1MOutput/1MContext
Qwen3.7 Max $1.25 $3.751.0M
Qwen3.5 397B A17B $0.39 $0.90262K
Qwen3.5-27B $0.20 $0.90262K
Qwen3.5-122B-A10B $0.26 $0.90262K
Qwen2.5 7B Instruct $0.04 $0.1033K
Qwen3 Max Thinking $0.78 $3.90262K
Qwen3.6 35B A3B $0.14 $0.90262K
Qwen3.5-35B-A3B $0.14 $0.90262K
Qwen3 Coder Next $0.11 $0.80262K
Qwen3.5 9B $0.04 $0.15262K
Qwen2.5-VL 7B Instruct $0.20 $0.2033K
Qwen3 Max $0.78 $3.90262K
Qwen3 235B A22B Instruct 2507 $0.07 $0.10262K
Qwen3 Coder 480B A35B (exacto) $0.22 $0.90262K
Qwen3.5 4B (Non-reasoning) $0.03 $0.15N/A
Qwen3 VL 235B A22B Instruct $0.20 $0.88262K
Qwen3 Next 80B A3B Instruct $0.09 $0.78262K
Qwen3 Coder 30B A3B Instruct $0.07 $0.27160K
Qwen3 VL 32B Instruct $0.10 $0.42131K
Qwen3 235B A22B $0.46 $0.90131K
Qwen-Max $1.04 $4.1633K
Qwen3 VL 30B A3B Instruct $0.13 $0.52131K
Qwen2.5 72B Instruct $0.36 $0.4033K
QwQ 32B $0.90 $0.9033K
Qwen3 30B A3B Instruct 2507 $0.04 $0.17262K
Qwen3.5 2B (Non-reasoning) $0.02 $0.10N/A
Qwen3 32B $0.08 $0.2841K
Qwen3 VL 8B Instruct $0.08 $0.20131K
Qwen2.5 VL 32B Instruct $0.90 $0.90128K
Qwen2.5 Coder 32B Instruct $0.66 $0.8033K
Qwen3 14B $0.08 $0.2041K
Qwen3 30B A3B $0.08 $0.2841K
Qwen3 4B $0.20 $0.2041K
Qwen-Turbo $0.03 $0.13131K
Qwen2 72B Instruct $0.90 $0.9033K
Qwen3 Omni 30B A3B Instruct $0.90 $0.9066K
Qwen3 8B $0.05 $0.2041K
Qwen2.5 Coder 7B Instruct $0.20 $0.2033K
Qwen3.5 0.8B $0.01 $0.05N/A
Qwen3 1.7B $0.20 $0.20131K
Qwen2 1.5B Instruct $0.02 $0.02N/A
Qwen3.5-Flash $0.07 $0.261.0M
Qwen3 30B A3B Thinking 2507 $0.08 $0.4033K
Qwen3 Next 80B A3B Thinking $0.10 $0.30128K
Qwen2 VL 2B Instruct $0.10 $0.1033K
Qwen2.5 Coder 3B Instruct $0.10 $0.1033K
Qwen2.5 Coder 3B $0.10 $0.1033K
Qwen2.5 VL 3B Instruct $0.10 $0.10128K
Qwen3 235B A22B Thinking 2507 $0.10 $0.10262K
Qwen1.5 0.5B $0.10 $0.1033K
Qwen1.5 0.5B Chat $0.10 $0.1033K
Qwen3 VL 8B Thinking $0.12 $1.36131K
Qwen3 VL 30B A3B Thinking $0.13 $0.90131K
Qwen VL Plus $0.14 $0.41131K
Qwen3 Coder Flash $0.20 $0.971.0M
Qwen2.5 Coder 7B $0.20 $0.2033K
Qwen2.5 Coder 14B Instruct $0.20 $0.2033K
Qwen2 VL 7B Instruct $0.20 $0.20131K
Qwen2 7B Instruct $0.20 $0.2033K
CodeQwen 1.5 7B $0.20 $0.2066K
Qwen2.5 14B $0.20 $0.20131K
Qwen2.5 1.5B Instruct $0.20 $0.2033K
Qwen2.5 7B $0.20 $0.20131K
Qwen2.5 Coder 0.5B $0.20 $0.2033K
Qwen2.5 Coder 0.5B Instruct $0.20 $0.2033K
Qwen2.5 Coder 14B $0.20 $0.2033K
Qwen2.5 Coder 1.5B $0.20 $0.2033K
Qwen3 0.6B $0.20 $0.2041K
Qwen3 4B Instruct 2507 $0.20 $0.20262K
Qwen2.5 Coder 1.5B Instruct $0.20 $0.2033K
Qwen2.5 14B Instruct $0.20 $0.2033K
Qwen2.5 0.5B Instruct $0.20 $0.2033K
Qwen2.5 VL 72B Instruct $0.25 $0.7533K
Qwen Plus 0728 (thinking) $0.26 $0.781.0M
Qwen-Plus $0.26 $0.781.0M
Qwen3.5 Plus $0.26 $1.561.0M
Qwen3 VL 235B A22B Thinking $0.26 $0.90131K
Qwen1.5 14B Chat $0.30 $0.3033K
Qwen2 VL 72B Instruct $0.45 $0.45131K
Qwen VL Max $0.52 $2.08131K
Qwen3 Coder Plus $0.65 $3.251.0M
Qwen2.5 Coder 32B Instruct 128K $0.90 $0.90131K
Qwen3 Coder 480B Instruct BF16 $0.90 $0.90262K
Qwen2.5 72B $0.90 $0.90131K
Qwen 1.5 72B Chat $0.90 $0.9033K
Qwen2.5 32B $0.90 $0.90131K
Qwen2.5 32B Instruct $0.90 $0.90128K
Qwen2.5 Coder 32B $0.90 $0.9033K
Qwen2.5 Coder 32B Instruct 32K RoPE $0.90 $0.9033K
Qwen2.5 Coder 32B Instruct 64K $0.90 $0.9066K
Qwen2.5 Math 72B Instruct $0.90 $0.904K
QwQ 32B Preview $0.90 $0.9033K
Qwen2 1.5B $0.00 $0.00N/A
Qwen3 0.6B Base $0.00 $0.00N/A
Qwen3 14B Base $0.00 $0.00N/A
Qwen3 1.7B Base $0.00 $0.00N/A
Qwen3 30B A3B Base $0.00 $0.00N/A
Qwen3 4B Base $0.00 $0.00N/A
Qwen3 8B Base $0.00 $0.00N/A

Benchmark Comparison

Best Scores by Provider

BenchmarkNvidiaQwen
Intelligence
36.0
Nemotron-3 Super 120B A12B
56.6
Qwen3.7 Max
Coding
31.2
Nemotron-3 Super 120B A12B
50.1
Qwen3.7 Max
Math
62.3
Nemotron Nano 9B V2
82.3
Qwen3 Max Thinking
MMLU Pro
73.9
Nemotron Nano 9B V2
83.8
Qwen3 Max
GPQA
80.0
Nemotron-3 Super 120B A12B
92.3
Qwen3.7 Max
LiveCodeBench
70.1
Nemotron Nano 9B V2
68.4
Qwen3 Next 80B A3B Instruct
Aider
54.9
Llama 3.1 Nemotron 70B Instruct
72.9
Qwen2.5 Coder 32B Instruct
AIME
24.7
Llama 3.1 Nemotron 70B Instruct
72.7
Qwen3 30B A3B Instruct 2507
BBH
N/A
35.9
Qwen2.5-VL 7B Instruct

Capabilities

CapabilityNvidiaQwen
Vision✓ (24 models)
Tool Calls✓ (7 models)✓ (74 models)
Reasoning✓ (5 models)✓ (24 models)
Audio Input
Audio Output
PDF Input
Web Search✓ (2 models)
Prompt Caching✓ (16 models)
Open Source Models✓ (5 models)✓ (83 models)

Model-Level Comparisons

Compare specific models head-to-head:

Frequently Asked Questions

Nvidia has 9 models while Qwen has 99 models available through API providers.
Nvidia's cheapest model starts at $0.04/M input tokens, while Qwen's cheapest starts at $0.01/M input tokens.
It depends on your needs. Compare specific models from each provider using the head-to-head comparisons above. Consider factors like pricing, benchmark performance, context window size, and supported capabilities.