Key Takeaways
Qwen3 VL 32B Instruct wins:
- Cheaper input tokens
- Cheaper output tokens
- Larger context window
- Faster response time
Grok 4 wins:
- Higher intelligence benchmark
- Better at coding
- Better at math
Price Advantage
Qwen3 VL 32B Instruct
Benchmark Advantage
Grok 4
Context Window
Qwen3 VL 32B Instruct
Speed
Qwen3 VL 32B Instruct
Pricing Comparison
Price Comparison
| Metric | Qwen3 VL 32B Instruct | Grok 4 | Winner |
|---|---|---|---|
| Input (per 1M tokens) | $0.50 | $3.00 | Qwen3 VL 32B Instruct |
| Output (per 1M tokens) | $1.50 | $15.00 | Qwen3 VL 32B Instruct |
| Cache Read (per 1M) | N/A | $750000.00 | Grok 4 |
Using a 3:1 input/output ratio, Qwen3 VL 32B Instruct is 88% cheaper overall.
Qwen3 VL 32B Instruct Providers
Together $0.50 (Cheapest)
Grok 4 Providers
Vercel $3.00 (Cheapest)
xAI $3.00 (Cheapest)
Benchmark Comparison
8
Benchmarks Compared
0
Qwen3 VL 32B Instruct Wins
6
Grok 4 Wins
Benchmark Scores
| Benchmark | Qwen3 VL 32B Instruct | Grok 4 | Winner |
|---|---|---|---|
Intelligence Index Overall intelligence score | 17.2 | 41.4 | |
Coding Index Code generation & understanding | 15.6 | 40.5 | |
Math Index Mathematical reasoning | 68.3 | 92.7 | |
MMLU Pro Academic knowledge | 79.1 | 86.6 | |
GPQA Graduate-level science | 67.1 | 87.7 | |
LiveCodeBench Competitive programming | 51.4 | 81.9 | |
Aider Real-world code editing | - | 79.6 | - |
AIME Competition math | - | 94.3 | - |
Grok 4 significantly outperforms in coding benchmarks.
Cost vs Quality
X-axis:
Y-axis:
Loading chart...
Qwen3 VL 32B Instruct
Other models
Context & Performance
Context Window
Qwen3 VL 32B Instruct
262,144
tokens
Grok 4
256,000
tokens
Qwen3 VL 32B Instruct has a 2% larger context window.
Speed Performance
| Metric | Qwen3 VL 32B Instruct | Grok 4 | Winner |
|---|---|---|---|
| Tokens/second | 78.2 tok/s | 33.9 tok/s | |
| Time to First Token | 1.07s | 8.24s |
Qwen3 VL 32B Instruct responds 131% faster on average.
Capabilities
Feature Comparison
| Feature | Qwen3 VL 32B Instruct | Grok 4 |
|---|---|---|
| Vision (Image Input) | ||
| Tool/Function Calls | ||
| Reasoning Mode | ||
| Audio Input | ||
| Audio Output | ||
| PDF Input | ||
| Prompt Caching | ||
| Web Search |
License & Release
| Property | Qwen3 VL 32B Instruct | Grok 4 |
|---|---|---|
| License | Open Source | Proprietary |
| Author | Qwen | Xai |
| Released | Oct 2025 | Jul 2025 |
Qwen3 VL 32B Instruct Modalities
Input
textimage
Output
text
Grok 4 Modalities
Input
imagetext
Output
text
