American Invitational Mathematics Examination 2024 problems testing olympiad-level mathematical reasoning.
Data from Artificial Analysis
As of May 19, 2026, the top-scoring model on AIME 2024 is GPT-5 at 95.7%, followed by Grok 4 at 94.3% and o4 Mini at 94.0%. 123 models have been evaluated on this benchmark.
Last updated: May 19, 2026
Models
123
Best Score
95.7
Average
41.6
Std Dev
31.0
Provider | Model | Input $/M | Output $/M | AIME 2024 | Actions |
|---|---|---|---|---|---|
$1.250 | $10.000 | 95.7 | |||
$3.000 | $15.000 | 94.3 | |||
$1.100 | $4.400 | 94.0 | |||
$0.149 | $0.900 | 94.0 | |||
$0.250 | $0.500 | 93.3 | |||
$1.250 | $10.000 | 91.7 | |||
$0.080 | $0.300 | 90.7 | |||
$2.000 | $8.000 | 90.3 | |||
$0.500 | $2.150 | 89.3 | |||
$1.000 | $10.000 | 88.7 | |||
$0.600 | $2.200 | 87.3 | |||
$1.250 | $10.000 | 87.0 | |||
$1.100 | $4.400 | 86.0 | |||
$0.100 | $0.400 | 86.0 | |||
$0.400 | $2.200 | 84.7 | |||
$1.250 | $10.000 | 84.3 | |||
$0.300 | $2.500 | 84.3 | |||
$0.455 | $0.900 | 84.0 | |||
$1.250 | $10.000 | 83.0 | |||
$0.300 | $2.500 | 82.3 | |||
$0.400 | $2.200 | 81.3 | |||
$0.080 | $0.280 | 80.7 | |||
$2.000 | $8.000 | 79.0 | |||
$0.900 | $0.900 | 78.0 | |||
$3.000 | $15.000 | 77.3 | |||
$1.100 | $4.400 | 77.0 | |||
$0.060 | $0.200 | 76.3 | |||
$15.000 | $75.000 | 75.7 | |||
$0.080 | $0.280 | 75.3 | |||
$0.050 | $0.200 | 74.7 | |||
$0.090 | $0.300 | 72.7 | |||
$15.000 | $60.000 | 72.3 | |||
$0.071 | $0.100 | 71.7 | |||
$0.100 | $0.400 | 70.3 | |||
$0.550 | $2.200 | 69.3 | |||
$0.550 | $2.200 | 69.3 | |||
$0.290 | $0.290 | 68.7 | |||
$0.550 | $2.000 | 68.3 | |||
$0.130 | $0.850 | 67.3 | |||
$0.700 | $0.800 | 67.0 | |||
$0.200 | $0.200 | 65.7 | |||
$0.100 | $0.400 | 58.3 | |||
$15.000 | $75.000 | 56.3 | |||
$0.200 | $0.770 | 52.0 | |||
$0.300 | $2.500 | 50.0 | |||
$0.100 | $0.400 | 50.0 | |||
$0.280 | $0.900 | 49.3 | |||
$1.000 | $1.000 | 48.7 | |||
$3.000 | $15.000 | 48.7 | |||
$0.220 | $0.900 | 47.7 | |||
$0.900 | $0.900 | 45.3 | |||
$0.400 | $2.000 | 44.0 | |||
$2.000 | $8.000 | 43.7 | |||
$0.300 | $2.500 | 43.3 | |||
$0.200 | $0.800 | 43.0 | |||
$3.000 | $15.000 | 40.7 | |||
$0.150 | $0.600 | 39.0 | |||
$1.250 | $10.000 | 36.7 | |||
$0.100 | $0.400 | 33.0 | |||
$3.000 | $15.000 | 33.0 | |||
$0.455 | $0.900 | 32.7 | |||
$0.075 | $0.200 | 32.3 | |||
$0.075 | $0.300 | 30.3 | |||
$0.080 | $0.280 | 30.3 | |||
$0.100 | $0.320 | 30.0 | |||
$0.070 | $0.270 | 29.7 | |||
$3.000 | $15.000 | 29.0 | |||
$0.080 | $0.300 | 28.3 | |||
$0.060 | $0.200 | 28.0 | |||
$0.075 | $0.300 | 27.7 | |||
$0.080 | $0.280 | 26.0 | |||
$0.080 | $0.160 | 25.3 | |||
$0.200 | $0.770 | 25.3 | |||
$0.900 | $0.900 | 24.7 | |||
$0.050 | $0.200 | 24.3 | |||
$0.050 | $0.200 | 23.7 | |||
$1.040 | $4.160 | 23.3 | |||
$3.000 | $15.000 | 22.3 | |||
$0.040 | $0.130 | 22.0 | |||
$0.900 | $0.900 | 21.3 | |||
$0.200 | $0.200 | 21.3 | |||
$0.200 | $0.200 | 21.3 | |||
$0.200 | $0.200 | 21.3 | |||
$0.100 | $0.400 | 19.3 | |||
$0.340 | $0.390 | 17.3 | |||
$2.500 | $12.500 | 17.0 | |||
$0.360 | $0.400 | 16.0 | |||
$3.000 | $15.000 | 15.7 | |||
$5.000 | $15.000 | 15.0 | |||
$0.065 | $0.140 | 14.3 | |||
$0.060 | $0.120 | 13.7 | |||
$0.100 | $0.400 | 13.7 | |||
$0.200 | $0.600 | 13.0 | |||
$0.660 | $0.800 | 12.0 | |||
$0.033 | $0.130 | 12.0 | |||
$0.150 | $0.600 | 11.7 | |||
$2.000 | $6.000 | 11.0 | |||
$0.900 | $0.900 | 11.0 | |||
$0.800 | $3.200 | 10.7 | |||
$0.060 | $0.240 | 10.7 | |||
$2.500 | $10.000 | 9.7 | |||
$0.060 | $0.060 | 9.3 | |||
$2.000 | $6.000 | 9.3 | |||
$0.100 | $0.300 | 9.3 | |||
$0.035 | $0.140 | 8.0 | |||
$0.050 | $0.080 | 8.0 | |||
$0.020 | $0.050 | 7.7 | |||
$2.000 | $6.000 | 7.0 | |||
$0.030 | $0.050 | 6.7 | |||
$0.400 | $2.000 | 6.7 | |||
$0.040 | $0.080 | 6.3 | |||
$2.000 | $8.000 | 5.7 | |||
$0.200 | $0.200 | 5.3 | |||
$0.800 | $4.000 | 3.3 | |||
$0.300 | $0.300 | 2.3 | |||
$0.250 | $1.250 | 1.0 | |||
$0.070 | $0.280 | 0.3 | |||
$0.140 | $0.420 | - | |||
$0.500 | $1.500 | - | |||
$1.200 | $1.200 | - | |||
$0.510 | $0.740 | - | |||
$0.040 | $0.040 | - | |||
$0.020 | $0.020 | - |
Pricing from OpenRouter. Benchmarks from Artificial Analysis.
Get our weekly newsletter on pricing changes, new releases, and tools.
American Invitational Mathematics Examination 2024 problems testing olympiad-level mathematical reasoning.
This leaderboard shows all models with AIME 2024 benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.