American Invitational Mathematics Examination problems testing olympiad-level mathematical reasoning with integer answers from 000-999.

Best LLMs for OpenClaw— Vote for which model works best with OpenClaw
112 out of our 301 tracked models have had a price change in February.
Get our weekly newsletter on pricing changes, new releases, and tools.
American Invitational Mathematics Examination problems testing olympiad-level mathematical reasoning with integer answers from 000-999.
This leaderboard shows all models with AIME 2024 benchmark scores, ranked from highest to lowest. Pricing data is included to help you compare performance against cost.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.