Key Takeaways
Llama 3.2 90B Vision Instruct wins:
- Cheaper output tokens
- Faster response time
MiMo v2 Omni wins:
- Cheaper input tokens
- Larger context window
- Higher intelligence benchmark
- Better at coding
Price Advantage
Llama 3.2 90B Vision Instruct
Benchmark Advantage
MiMo v2 Omni
Context Window
MiMo v2 Omni
Speed
Llama 3.2 90B Vision Instruct
Pricing Comparison
Benchmark Comparison
Context & Performance
Capabilities
Feature Comparison
| Feature | Llama 3.2 90B Vision Instruct | MiMo v2 Omni |
|---|---|---|
Vision (Image Input) | ||
Tool/Function Calls | ||
Reasoning Mode | ||
Audio Input | ||
Audio Output | ||
PDF Input | ||
Prompt Caching | ||
Web Search |
License & Release
| Property | Llama 3.2 90B Vision Instruct | MiMo v2 Omni |
|---|---|---|
| License | Open Source | Proprietary |
| Author | Meta-llama | Xiaomi |
| Released | Unknown | Mar 2026 |
Llama 3.2 90B Vision Instruct Modalities
Input
Output
MiMo v2 Omni Modalities
Input
textaudioimagevideo
Output
text