Price Per TokenPrice Per Token

MMLU-Pro Leaderboard

Massive Multitask Language Understanding benchmark testing knowledge across 57 diverse subjects including STEM, humanities, social sciences, and professional domains.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

About MMLU-Pro

Massive Multitask Language Understanding benchmark testing knowledge across 57 diverse subjects including STEM, humanities, social sciences, and professional domains.

This leaderboard shows all models with MMLU-Pro benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.