Llama 3.3 Nemotron Super 49B V1.5 vs Qwen3.5 397B A17B

Key Takeaways

Llama 3.3 Nemotron Super 49B V1.5 wins:

Cheaper input tokens
Cheaper output tokens
Faster response time
Better at math
Has reasoning mode

Qwen3.5 397B A17B wins:

Larger context window
Higher intelligence benchmark
Better at coding
Supports vision

Price Advantage

Llama 3.3 Nemotron Super 49B V1.5

Benchmark Advantage

Qwen3.5 397B A17B

Context Window

Qwen3.5 397B A17B

Speed

Llama 3.3 Nemotron Super 49B V1.5

Pricing Comparison

Price Comparison

Metric	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.5 397B A17B	Winner
Input (per 1M tokens)	$0.10	$0.39	Llama 3.3 Nemotron Super 49B V1.5
Output (per 1M tokens)	$0.40	$0.90	Llama 3.3 Nemotron Super 49B V1.5
Cache Read (per 1M)	N/A	$0.45	Qwen3.5 397B A17B

Using a 3:1 input/output ratio, Llama 3.3 Nemotron Super 49B V1.5 is 66% cheaper overall.

Llama 3.3 Nemotron Super 49B V1.5 Providers

No provider data available

Qwen3.5 397B A17B Providers

No provider data available

Benchmark Comparison

7

Benchmarks Compared

0

Llama 3.3 Nemotron Super 49B V1.5 Wins

3

Qwen3.5 397B A17B Wins

Benchmark Scores

Benchmark	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.5 397B A17B	Winner
Intelligence Index Overall intelligence score	14.6	40.1
Coding Index Code generation & understanding	10.5	37.4
Math Index Mathematical reasoning	8.0	-	-
MMLU Pro Academic knowledge	69.2	-	-
GPQA Graduate-level science	48.1	86.1
LiveCodeBench Competitive programming	29.0	-	-
AIME Competition math	13.7	-	-

Qwen3.5 397B A17B significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:

Y-axis:

Loading chart...

Llama 3.3 Nemotron Super 49B V1.5

Other models

Context & Performance

Context Window

Llama 3.3 Nemotron Super 49B V1.5

131,072

tokens

Qwen3.5 397B A17B

262,144

tokens

Qwen3.5 397B A17B has a 50% larger context window.

Speed Performance

Metric	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.5 397B A17B	Winner
Tokens/second	82.7 tok/s	55.5 tok/s
Time to First Token	0.24s	1.55s

Llama 3.3 Nemotron Super 49B V1.5 responds 49% faster on average.

Capabilities

Feature Comparison

Feature	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.5 397B A17B
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

Property	Llama 3.3 Nemotron Super 49B V1.5	Qwen3.5 397B A17B
License	Proprietary	Open Source
Author	Nvidia	Qwen
Released	Oct 2025	Feb 2026

Llama 3.3 Nemotron Super 49B V1.5 Modalities

Input

text

Output

text

Qwen3.5 397B A17B Modalities

Input

textimagevideo

Output

text

Related Comparisons

Compare Llama 3.3 Nemotron Super 49B V1.5 with:

Compare Qwen3.5 397B A17B with:

See all model comparisons

Key Takeaways

Llama 3.3 Nemotron Super 49B V1.5 wins:

Qwen3.5 397B A17B wins:

Pricing Comparison

Price Comparison

Llama 3.3 Nemotron Super 49B V1.5 Providers

Qwen3.5 397B A17B Providers

Benchmark Comparison

Benchmark Scores

Cost vs Quality

Context & Performance

Context Window

Speed Performance

Capabilities

Feature Comparison

License & Release

Llama 3.3 Nemotron Super 49B V1.5 Modalities

Qwen3.5 397B A17B Modalities

Related Comparisons

Compare Llama 3.3 Nemotron Super 49B V1.5 with:

Compare Qwen3.5 397B A17B with:

Frequently Asked Questions

Tools

Directories

Models & Pricing

Endpoints

Rankings

News