Price Per TokenPrice Per Token

MMLU-Pro Leaderboard

Massive Multitask Language Understanding benchmark testing knowledge across 57 diverse subjects including STEM, humanities, social sciences, and professional domains.

About MMLU-Pro

Massive Multitask Language Understanding benchmark testing knowledge across 57 diverse subjects including STEM, humanities, social sciences, and professional domains.

This leaderboard shows all models with MMLU-Pro benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.

Built by @aellman

2026 68 Ventures, LLC. All rights reserved.