Price Per TokenPrice Per Token

MultiChallenge Leaderboard

MultiChallenge — general knowledge benchmark with diverse challenge types.

Data from LayerLens

Models

2

Best Score

37.4

Average

28.2

Std Dev

9.2

Categories
General Knowledge
Provider
Model
Input $/M
Output $/M
MultiChallenge
Actions
$0.400
$2.000
37.4
$0.800
$3.200
19.0

Pricing from OpenRouter. Benchmarks from Artificial Analysis.

108 out of our 483 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

About MultiChallenge

MultiChallenge — general knowledge benchmark with diverse challenge types.

This leaderboard shows all models with MultiChallenge benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Frequently Asked Questions

MultiChallenge — general knowledge benchmark with diverse challenge types.
As of March 15, 2026, Mistral Medium 3.1 leads the MultiChallenge leaderboard with a score of 37.4. Rankings change as new models are released and evaluated.
Currently 2 models have been evaluated on MultiChallenge, with an average score of 28.2 and standard deviation of 9.2.
Benchmark scores are updated when new evaluations are published by our data sources (Artificial Analysis and LayerLens). Pricing data is refreshed daily from OpenRouter.