Key Takeaways
Llama 3.2 90B Vision Instruct wins:
- Cheaper input tokens
- Cheaper output tokens
- Larger context window
- Faster response time
- Supports vision
- Supports tool calls
MiMo v2 Pro wins:
- Higher intelligence benchmark
- Better at coding
Price Advantage
Llama 3.2 90B Vision Instruct
Benchmark Advantage
MiMo v2 Pro
Context Window
Llama 3.2 90B Vision Instruct
Speed
Llama 3.2 90B Vision Instruct
Pricing Comparison
Benchmark Comparison
Context & Performance
Capabilities
Feature Comparison
| Feature | Llama 3.2 90B Vision Instruct | MiMo v2 Pro |
|---|---|---|
Vision (Image Input) | ||
Tool/Function Calls | ||
Reasoning Mode | ||
Audio Input | ||
Audio Output | ||
PDF Input | ||
Prompt Caching | ||
Web Search |
License & Release
| Property | Llama 3.2 90B Vision Instruct | MiMo v2 Pro |
|---|---|---|
| License | Open Source | Proprietary |
| Author | Meta-llama | Xiaomi |
Llama 3.2 90B Vision Instruct Modalities
Input
Output
MiMo v2 Pro Modalities
Input
Output