Track the evolution of AI models over time. Compare benchmark scores, pricing trends, and the open source vs closed source frontier.
How does model performance correlate with API pricing? Higher benchmark scores with lower prices indicate better value.
Track the frontier of AI capabilities as new models are released each month.
Provider frontier comparison - MMLU-Pro benchmark (includes reasoning variants)
Provider frontier comparison - Graduate-level science (includes reasoning variants)
Provider frontier comparison - Real-world coding (includes reasoning variants)
Provider frontier comparison - Competitive programming (includes reasoning variants)
Provider frontier comparison - Competition math (includes reasoning variants)
Provider frontier comparison - Maximum context window
Compare the frontier capabilities between open source and proprietary models over time.
Comparing the best open source and closed source models each month
Graduate-level science benchmark comparison
Real-world coding benchmark comparison
Competitive programming benchmark comparison
Competition math benchmark comparison
Maximum context window comparison
Compare the frontier AI capabilities by country of origin. Track which nations lead in different benchmarks over time.
Best model per country each month
Graduate-level science benchmark by country
Real-world coding benchmark by country
Competitive programming benchmark by country
Competition math benchmark by country
Maximum context window by country
Data includes 302 models from 2023-05 to 2026-02
Benchmark data sourced from Artificial Analysis. Pricing data updated daily.
Built by @aellman
2026 68 Ventures, LLC. All rights reserved.