Multimodal Understanding benchmark testing vision-language models on expert-level tasks.
Data from LayerLens
Models
49
Best Score
79.2
Average
58.2
Std Dev
14.9
Provider | Model | Input $/M | Output $/M | MMMU | Actions |
|---|---|---|---|---|---|
$1.100 | $4.400 | 79.2 | |||
$1.250 | $10.000 | 79.1 | |||
$0.400 | $3.200 | 78.1 | |||
$0.300 | $2.400 | 77.6 | |||
$0.100 | $0.400 | 76.8 | |||
$0.250 | $1.000 | 76.4 | |||
$5.000 | $25.000 | 76.3 | |||
$5.000 | $25.000 | 76.3 | |||
$0.250 | $2.000 | 75.3 | |||
$3.000 | $15.000 | 75.3 | |||
$3.000 | $15.000 | 72.9 | |||
$3.000 | $15.000 | 71.7 | |||
$2.000 | $8.000 | 69.3 | |||
$0.100 | $0.400 | 69.0 | |||
$0.200 | $0.880 | 68.2 | |||
$3.000 | $15.000 | 66.9 | |||
$3.000 | $15.000 | 66.9 | |||
$1.000 | $5.000 | 65.2 | |||
$1.000 | $5.000 | 65.2 | |||
$1.750 | $14.000 | 62.3 | |||
$1.250 | $10.000 | 60.7 | |||
$0.200 | $0.500 | 60.4 | |||
$2.500 | $10.000 | 59.1 | |||
$0.400 | $2.000 | 58.7 | |||
$0.400 | $2.000 | 58.1 | |||
$1.200 | $6.000 | 57.4 | |||
$0.060 | $0.180 | 55.9 | |||
$0.800 | $4.000 | 54.3 | |||
$0.040 | $0.150 | 53.9 | |||
$0.300 | $2.500 | 53.0 | |||
$0.300 | $2.500 | 53.0 | |||
$0.150 | $0.400 | 52.9 | |||
$0.300 | $0.500 | 52.4 | |||
$0.080 | $0.240 | 52.1 | |||
$0.080 | $0.240 | 52.1 | |||
$0.700 | $2.500 | 50.7 | |||
$0.800 | $3.200 | 50.1 | |||
$3.000 | $15.000 | 49.1 | |||
$0.080 | $0.280 | 49.0 | |||
$0.080 | $0.280 | 49.0 | |||
$0.320 | $0.890 | 48.2 | |||
$4.000 | $4.000 | 45.2 | |||
$0.080 | $0.300 | 42.3 | |||
$2.500 | $10.000 | 42.0 | |||
$0.060 | $0.140 | 39.7 | |||
$2.500 | $10.000 | 37.9 | |||
$0.150 | $0.600 | 31.3 | |||
$0.550 | $3.500 | 21.8 | |||
$0.400 | $2.400 | 13.1 |
Pricing from OpenRouter. Benchmarks from Artificial Analysis.

Deploy OpenClaw in Under 1 Minute— We handle hosting, scaling, and maintenance
93 out of our 301 tracked models have had a price change in March.
Get our weekly newsletter on pricing changes, new releases, and tools.
Multimodal Understanding benchmark testing vision-language models on expert-level tasks.
This leaderboard shows all models with MMMU benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.