Price Per TokenPrice Per Token
Qwen
Qwen
vs
Qwen
Qwen

Qwen3 VL 8B Instruct vs Qwen3.5 397B A17B

A detailed comparison of pricing, benchmarks, and capabilities

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

107 out of our 482 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Qwen3 VL 8B Instruct wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Faster response time
  • Better at math

Qwen3.5 397B A17B wins:

  • Larger context window
  • Higher intelligence benchmark
  • Better at coding
Price Advantage
Qwen3 VL 8B Instruct
Benchmark Advantage
Qwen3.5 397B A17B
Context Window
Qwen3.5 397B A17B
Speed
Qwen3 VL 8B Instruct

Pricing Comparison

Price Comparison

MetricQwen3 VL 8B InstructQwen3.5 397B A17BWinner
Input (per 1M tokens)$0.08$0.39 Qwen3 VL 8B Instruct
Output (per 1M tokens)$0.20$0.90 Qwen3 VL 8B Instruct
Cache Read (per 1M)$0.10$0.45 Qwen3 VL 8B Instruct
Using a 3:1 input/output ratio, Qwen3 VL 8B Instruct is 79% cheaper overall.

Qwen3 VL 8B Instruct Providers

No provider data available

Qwen3.5 397B A17B Providers

No provider data available

Benchmark Comparison

6
Benchmarks Compared
0
Qwen3 VL 8B Instruct Wins
3
Qwen3.5 397B A17B Wins

Benchmark Scores

BenchmarkQwen3 VL 8B InstructQwen3.5 397B A17BWinner
Intelligence Index
Overall intelligence score
14.340.1
Coding Index
Code generation & understanding
7.337.4
Math Index
Mathematical reasoning
27.3--
MMLU Pro
Academic knowledge
68.6--
GPQA
Graduate-level science
42.786.1
LiveCodeBench
Competitive programming
33.2--
Qwen3.5 397B A17B significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Qwen3 VL 8B Instruct
Other models

Context & Performance

Context Window

Qwen3 VL 8B Instruct
131,072
tokens
Qwen3.5 397B A17B
262,144
tokens
Qwen3.5 397B A17B has a 50% larger context window.

Speed Performance

MetricQwen3 VL 8B InstructQwen3.5 397B A17BWinner
Tokens/second138.4 tok/s52.8 tok/s
Time to First Token1.05s1.61s
Qwen3 VL 8B Instruct responds 162% faster on average.

Capabilities

Feature Comparison

FeatureQwen3 VL 8B InstructQwen3.5 397B A17B
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyQwen3 VL 8B InstructQwen3.5 397B A17B
LicenseOpen SourceOpen Source
AuthorQwenQwen
ReleasedOct 2025Feb 2026

Qwen3 VL 8B Instruct Modalities

Input
imagetext
Output
text

Qwen3.5 397B A17B Modalities

Input
textimagevideo
Output
text

Related Comparisons

Compare Qwen3 VL 8B Instruct with:

Compare Qwen3.5 397B A17B with:

Frequently Asked Questions

Qwen3 VL 8B Instruct has cheaper input pricing at $0.08/M tokens. Qwen3 VL 8B Instruct has cheaper output pricing at $0.20/M tokens.
Qwen3.5 397B A17B scores higher on coding benchmarks with a score of 37.4, compared to Qwen3 VL 8B Instruct's score of 7.3.
Qwen3 VL 8B Instruct has a 131,072 token context window, while Qwen3.5 397B A17B has a 262,144 token context window.
Qwen3 VL 8B Instruct supports vision. Qwen3.5 397B A17B supports vision.
Advertise with us