Price Per TokenPrice Per Token

MMMU Leaderboard

Multimodal Understanding benchmark testing vision-language models on expert-level tasks.

Data from LayerLens

Models

49

Best Score

79.2

Average

58.2

Std Dev

14.9

Categories
Multimodal
Provider
Model
Input $/M
Output $/M
MMMU
Actions
$1.100
$4.400
79.2
$1.250
$10.000
79.1
$0.400
$3.200
78.1
$0.300
$2.400
77.6
$0.100
$0.400
76.8
$0.250
$1.000
76.4
$5.000
$25.000
76.3
$5.000
$25.000
76.3
$0.250
$2.000
75.3
$3.000
$15.000
75.3
$3.000
$15.000
72.9
$3.000
$15.000
71.7
$2.000
$8.000
69.3
$0.100
$0.400
69.0
$0.200
$0.880
68.2
$3.000
$15.000
66.9
$3.000
$15.000
66.9
$1.000
$5.000
65.2
$1.000
$5.000
65.2
$1.750
$14.000
62.3
$1.250
$10.000
60.7
$0.200
$0.500
60.4
$2.500
$10.000
59.1
$0.400
$2.000
58.7
$0.400
$2.000
58.1
$1.200
$6.000
57.4
$0.060
$0.180
55.9
$0.800
$4.000
54.3
$0.040
$0.150
53.9
$0.300
$2.500
53.0
$0.300
$2.500
53.0
$0.150
$0.400
52.9
$0.300
$0.500
52.4
$0.080
$0.240
52.1
$0.080
$0.240
52.1
$0.700
$2.500
50.7
$0.800
$3.200
50.1
$3.000
$15.000
49.1
$0.080
$0.280
49.0
$0.080
$0.280
49.0
$0.320
$0.890
48.2
$4.000
$4.000
45.2
$0.080
$0.300
42.3
$2.500
$10.000
42.0
$0.060
$0.140
39.7
$2.500
$10.000
37.9
$0.150
$0.600
31.3
$0.550
$3.500
21.8
$0.400
$2.400
13.1

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

93 out of our 301 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About MMMU

Multimodal Understanding benchmark testing vision-language models on expert-level tasks.

This leaderboard shows all models with MMMU benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Advertise with us