Price Per TokenPrice Per Token
Google
Google
vs
Qwen
Qwen

Gemini 3 Pro Preview vs Qwen3 VL 8B Thinking

A detailed comparison of pricing, benchmarks, and capabilities

Best Coding Model Leaderboard Find the top LLM for coding by benchmarks and community votes

114 out of our 298 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Gemini 3 Pro Preview wins:

  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math

Qwen3 VL 8B Thinking wins:

  • Cheaper input tokens
  • Cheaper output tokens
Price Advantage
Qwen3 VL 8B Thinking
Benchmark Advantage
Gemini 3 Pro Preview
Context Window
Gemini 3 Pro Preview
Speed
Gemini 3 Pro Preview

Pricing Comparison

Price Comparison

MetricGemini 3 Pro PreviewQwen3 VL 8B ThinkingWinner
Input (per 1M tokens)$2.00$0.12 Qwen3 VL 8B Thinking
Output (per 1M tokens)$12.00$1.36 Qwen3 VL 8B Thinking
Cache Read (per 1M)$200000.00N/A Gemini 3 Pro Preview
Cache Write (per 1M)$375000.00N/A Gemini 3 Pro Preview
Using a 3:1 input/output ratio, Qwen3 VL 8B Thinking is 90% cheaper overall.

Gemini 3 Pro Preview Providers

Google AI Studio $2.00 (Cheapest)
Google $2.00 (Cheapest)

Qwen3 VL 8B Thinking Providers

Alibaba $0.12 (Cheapest)

Benchmark Comparison

6
Benchmarks Compared
0
Gemini 3 Pro Preview Wins
0
Qwen3 VL 8B Thinking Wins

Benchmark Scores

BenchmarkGemini 3 Pro PreviewQwen3 VL 8B ThinkingWinner
Intelligence Index
Overall intelligence score
48.4--
Coding Index
Code generation & understanding
46.5--
Math Index
Mathematical reasoning
95.7--
MMLU Pro
Academic knowledge
89.8--
GPQA
Graduate-level science
90.8--
LiveCodeBench
Competitive programming
91.7--
Gemini 3 Pro Preview significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Gemini 3 Pro Preview
Other models

Context & Performance

Context Window

Gemini 3 Pro Preview
1,048,576
tokens
Max output: 65,536 tokens
Qwen3 VL 8B Thinking
131,072
tokens
Max output: 32,768 tokens
Gemini 3 Pro Preview has a 88% larger context window.

Speed Performance

MetricGemini 3 Pro PreviewQwen3 VL 8B ThinkingWinner
Tokens/second128.3 tok/sN/A
Time to First Token31.86sN/A

Capabilities

Feature Comparison

FeatureGemini 3 Pro PreviewQwen3 VL 8B Thinking
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 3 Pro PreviewQwen3 VL 8B Thinking
LicenseProprietaryOpen Source
AuthorGoogleQwen
ReleasedNov 2025Oct 2025

Gemini 3 Pro Preview Modalities

Input
textimagefileaudiovideo
Output
text

Qwen3 VL 8B Thinking Modalities

Input
imagetext
Output
text

Related Comparisons

Compare Gemini 3 Pro Preview with:

Compare Qwen3 VL 8B Thinking with:

Frequently Asked Questions

Qwen3 VL 8B Thinking has cheaper input pricing at $0.12/M tokens. Qwen3 VL 8B Thinking has cheaper output pricing at $1.36/M tokens.
Gemini 3 Pro Preview scores higher on coding benchmarks with a score of 46.5, compared to Qwen3 VL 8B Thinking's score of N/A.
Gemini 3 Pro Preview has a 1,048,576 token context window, while Qwen3 VL 8B Thinking has a 131,072 token context window.
Gemini 3 Pro Preview supports vision. Qwen3 VL 8B Thinking supports vision.