Key Takeaways
GPT-5 Mini wins:
- Larger context window
- Faster response time
- Higher intelligence benchmark
- Better at coding
- Better at math
Qwen3 VL 8B Thinking wins:
- Cheaper input tokens
- Cheaper output tokens
- Has reasoning mode
Price Advantage
Qwen3 VL 8B Thinking
Benchmark Advantage
GPT-5 Mini
Context Window
GPT-5 Mini
Speed
GPT-5 Mini
Pricing Comparison
Price Comparison
| Metric | GPT-5 Mini | Qwen3 VL 8B Thinking | Winner |
|---|---|---|---|
| Input (per 1M tokens) | $0.25 | $0.12 | Qwen3 VL 8B Thinking |
| Output (per 1M tokens) | $2.00 | $1.36 | Qwen3 VL 8B Thinking |
| Cache Read (per 1M) | $0.03 | N/A | GPT-5 Mini |
Using a 3:1 input/output ratio, Qwen3 VL 8B Thinking is 38% cheaper overall.
GPT-5 Mini Providers
OpenAI $0.25 (Cheapest)
Azure $0.25 (Cheapest)
Qwen3 VL 8B Thinking Providers
Alibaba $0.12 (Cheapest)
Benchmark Comparison
6
Benchmarks Compared
0
GPT-5 Mini Wins
0
Qwen3 VL 8B Thinking Wins
Benchmark Scores
| Benchmark | GPT-5 Mini | Qwen3 VL 8B Thinking | Winner |
|---|---|---|---|
Intelligence Index Overall intelligence score | 41.2 | - | - |
Coding Index Code generation & understanding | 35.3 | - | - |
Math Index Mathematical reasoning | 90.7 | - | - |
MMLU Pro Academic knowledge | 83.7 | - | - |
GPQA Graduate-level science | 82.8 | - | - |
LiveCodeBench Competitive programming | 83.8 | - | - |
GPT-5 Mini significantly outperforms in coding benchmarks.
Cost vs Quality
X-axis:
Y-axis:
Loading chart...
GPT-5 Mini
Other models
Context & Performance
Context Window
GPT-5 Mini
400,000
tokens
Max output: 128,000 tokens
Qwen3 VL 8B Thinking
131,072
tokens
Max output: 32,768 tokens
GPT-5 Mini has a 67% larger context window.
Speed Performance
| Metric | GPT-5 Mini | Qwen3 VL 8B Thinking | Winner |
|---|---|---|---|
| Tokens/second | 72.2 tok/s | N/A | |
| Time to First Token | 111.02s | N/A |
Capabilities
Feature Comparison
| Feature | GPT-5 Mini | Qwen3 VL 8B Thinking |
|---|---|---|
| Vision (Image Input) | ||
| Tool/Function Calls | ||
| Reasoning Mode | ||
| Audio Input | ||
| Audio Output | ||
| PDF Input | ||
| Prompt Caching | ||
| Web Search |
License & Release
| Property | GPT-5 Mini | Qwen3 VL 8B Thinking |
|---|---|---|
| License | Proprietary | Open Source |
| Author | OpenAI | Qwen |
| Released | Aug 2025 | Oct 2025 |
GPT-5 Mini Modalities
Input
textimagefile
Output
text
Qwen3 VL 8B Thinking Modalities
Input
imagetext
Output
text
