50k devs visit Price Per Token every month. Become a sponsor

|Follow:

LLM Trends

Track the evolution of AI models over time. Compare benchmark and pricing trends, and the open source vs closed source frontier.

Frontier Pricing Benchmark vs Price Over Time Open vs Closed By Country

Frontier Model Pricing

How has the cost of frontier output tokens evolved for each major AI lab? Tracks each lab's flagship model line over time.

Frontier Output Token Cost Over Time

Flagship model output price per 1M tokens by lab — excludes mini, fast, and reasoning-only variants

Benchmark vs Price

How does model performance correlate with API pricing? Higher benchmark scores with lower prices indicate better value.

MMLU-Pro vs Price

Loading chart...

GPQA vs Price

Loading chart...

Aider (Coding) vs Price

Loading chart...

LiveCodeBench vs Price

Loading chart...

MATH Hard vs Price

Loading chart...

Context Length vs Price

Loading chart...

Benchmarks Over Time

Track the frontier of AI capabilities as new models are released each month.

MMLU-Pro Frontier Over Time

Provider frontier comparison - MMLU-Pro benchmark (includes reasoning variants)

GPQA Frontier Over Time

Provider frontier comparison - Graduate-level science (includes reasoning variants)

Aider Frontier Over Time

Provider frontier comparison - Real-world coding (includes reasoning variants)

LiveCodeBench Frontier Over Time

Provider frontier comparison - Competitive programming (includes reasoning variants)

MATH Hard Frontier Over Time

Provider frontier comparison - Competition math (includes reasoning variants)

Context Length Frontier Over Time

Provider frontier comparison - Maximum context window

Open Source vs Closed Source

Compare the frontier capabilities between open source and proprietary models over time.

MMLU-Pro: Open vs Closed

Comparing the best open source and closed source models each month

GPQA: Open vs Closed

Graduate-level science benchmark comparison

Aider: Open vs Closed

Real-world coding benchmark comparison

LiveCodeBench: Open vs Closed

Competitive programming benchmark comparison

MATH Hard: Open vs Closed

Competition math benchmark comparison

Context Length: Open vs Closed

Maximum context window comparison

By Country

Compare the frontier AI capabilities by country of origin. Track which nations lead in different benchmarks over time.

MMLU-Pro: By Country

Best model per country each month

GPQA: By Country

Graduate-level science benchmark by country

Aider: By Country

Real-world coding benchmark by country

LiveCodeBench: By Country

Competitive programming benchmark by country

MATH Hard: By Country

Competition math benchmark by country

Context Length: By Country

Maximum context window by country

Data includes 315 models from 2023-05 to 2026-05

Benchmark data sourced from Artificial Analysis. Pricing data updated daily.

Built by @aellman

Tools

Directories

Models & Pricing

Endpoints

Rankings

News

Advertise | Terms of Service | Privacy Policy