Competition mathematics problems requiring multi-step reasoning, covering algebra, geometry, number theory, and calculus.

Best LLMs for OpenClaw— Vote for which model works best with OpenClaw
112 out of our 301 tracked models have had a price change in February.
Get our weekly newsletter on pricing changes, new releases, and tools.
Competition mathematics problems requiring multi-step reasoning, covering algebra, geometry, number theory, and calculus.
This leaderboard shows all models with MATH (Hard) benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.