Overview
Microsoft
Models
7
Cheapest Input
$0.05
Max Context
131K
Top Intelligence
33.1
Nvidia
Models
7
Cheapest Input
$0.04
Max Context
262K
Top Intelligence
14.6
Flagship Model Comparison
WizardLM-2 8x22BvsLlama 3.3 Nemotron Super 49B V1.5
| Metric | WizardLM-2 8x22B | Llama 3.3 Nemotron Super 49B V1.5 |
|---|---|---|
| Input $/1M tokens | $0.62 | $0.10 |
| Output $/1M tokens | $0.62 | $0.40 |
| Context Window | 66K | 131K |
| Intelligence | 33.1 | 14.6 |
| Coding | N/A | 10.5 |
| Math | N/A | 8.0 |
All Models Pricing
Microsoft Models
| Model | Input/1M | Output/1M | Context |
|---|---|---|---|
| WizardLM-2 8x22B | $0.62 | $0.62 | 66K |
| Phi-3 Medium 128K Instruct | $1.00 | $1.00 | 128K |
| Phi-3 Mini 128K Instruct | $0.10 | $0.10 | 128K |
| Phi 4 | $0.06 | $0.14 | 16K |
| Phi 4 Multimodal Instruct | $0.05 | $0.10 | 131K |
| Phi 4 Reasoning Plus | $0.07 | $0.35 | 32K |
| Phi-3.5 Mini 128K Instruct | $0.10 | $0.10 | 128K |
Nvidia Models
| Model | Input/1M | Output/1M | Context |
|---|---|---|---|
| Llama 3.3 Nemotron Super 49B V1.5 | $0.10 | $0.40 | 131K |
| Llama 3.1 Nemotron 70B Instruct | $0.90 | $0.90 | 131K |
| Nemotron Nano 9B V2 | $0.04 | $0.16 | 131K |
| Nemotron 3 Nano 30B A3B | $0.05 | $0.20 | 262K |
| Nemotron Nano 12B 2 VL | $0.20 | $0.20 | 131K |
| Nemotron Nano 12B V2 | $0.20 | $0.20 | 128K |
| Llama 3.1 Nemotron Ultra 253B v1 | $0.00 | $0.00 | 128K |
Benchmark Comparison
Best Scores by Provider
| Benchmark | Microsoft | Nvidia |
|---|---|---|
| Intelligence | 33.1 WizardLM-2 8x22B | 14.6 Llama 3.3 Nemotron Super 49B V1.5 |
| Coding | 11.2 Phi 4 | 15.8 Nemotron 3 Nano 30B A3B |
| Math | 18.0 Phi 4 | 62.3 Nemotron Nano 9B V2 |
| MMLU Pro | 71.4 Phi 4 | 73.9 Nemotron Nano 9B V2 |
| GPQA | 57.5 Phi 4 | 55.7 Nemotron Nano 9B V2 |
| LiveCodeBench | 23.1 Phi 4 | 70.1 Nemotron Nano 9B V2 |
| Aider | 44.4 WizardLM-2 8x22B | 54.9 Llama 3.1 Nemotron 70B Instruct |
| AIME | 14.3 Phi 4 | 24.7 Llama 3.1 Nemotron 70B Instruct |
| BBH | 48.6 WizardLM-2 8x22B | N/A |
Capabilities
| Capability | Microsoft | Nvidia |
|---|---|---|
| Vision | ✓ | ✓ |
| Tool Calls | ✓ (7 models) | ✓ (7 models) |
| Reasoning | ✓ (2 models) | ✓ (5 models) |
| Audio Input | — | — |
| Audio Output | — | — |
| PDF Input | — | — |
| Web Search | — | — |
| Prompt Caching | — | — |
| Open Source Models | ✓ (7 models) | ✓ (5 models) |
Model-Level Comparisons
Compare specific models head-to-head:
WizardLM-2 8x22BvsLlama 3.3 Nemotron Super 49B V1.5
WizardLM-2 8x22BvsLlama 3.1 Nemotron 70B Instruct
WizardLM-2 8x22BvsNemotron Nano 9B V2
Phi-3 Medium 128K InstructvsLlama 3.3 Nemotron Super 49B V1.5
Phi-3 Medium 128K InstructvsLlama 3.1 Nemotron 70B Instruct
Phi-3 Medium 128K InstructvsNemotron Nano 9B V2
Phi-3 Mini 128K InstructvsLlama 3.3 Nemotron Super 49B V1.5
Phi-3 Mini 128K InstructvsLlama 3.1 Nemotron 70B Instruct
Phi-3 Mini 128K InstructvsNemotron Nano 9B V2