Price Per TokenPrice Per Token
Google
Google
vs
OpenAI
OpenAI

Gemini 2.5 Flash vs GPT-4

A detailed comparison of pricing, benchmarks, and capabilities

108 out of our 483 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

Gemini 2.5 Flash wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Supports vision
  • Has reasoning mode

GPT-4 wins:

  • No clear advantages in compared metrics
Price Advantage
Gemini 2.5 Flash
Benchmark Advantage
Gemini 2.5 Flash
Context Window
Gemini 2.5 Flash
Speed
Gemini 2.5 Flash

Pricing Comparison

Price Comparison

MetricGemini 2.5 FlashGPT-4Winner
Input (per 1M tokens)$0.30$30.00 Gemini 2.5 Flash
Output (per 1M tokens)$2.50$60.00 Gemini 2.5 Flash
Cache Read (per 1M)$0.03N/A Gemini 2.5 Flash
Cache Write (per 1M)$0.08N/A Gemini 2.5 Flash
Using a 3:1 input/output ratio, Gemini 2.5 Flash is 98% cheaper overall.

Gemini 2.5 Flash Providers

No provider data available

GPT-4 Providers

No provider data available

Benchmark Comparison

8
Benchmarks Compared
2
Gemini 2.5 Flash Wins
1
GPT-4 Wins

Benchmark Scores

BenchmarkGemini 2.5 FlashGPT-4Winner
Intelligence Index
Overall intelligence score
20.612.8
Coding Index
Code generation & understanding
17.813.1
Math Index
Mathematical reasoning
60.3--
MMLU Pro
Academic knowledge
80.9--
GPQA
Graduate-level science
68.3--
LiveCodeBench
Competitive programming
49.5--
Aider
Real-world code editing
55.167.7
AIME
Competition math
50.0--
Gemini 2.5 Flash shows stronger mathematical reasoning abilities.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
Gemini 2.5 Flash
Other models

Context & Performance

Context Window

Gemini 2.5 Flash
1,048,576
tokens
GPT-4
8,191
tokens
Gemini 2.5 Flash has a 99% larger context window.

Speed Performance

MetricGemini 2.5 FlashGPT-4Winner
Tokens/second205.9 tok/s33.4 tok/s
Time to First Token0.40s0.98s
Gemini 2.5 Flash responds 517% faster on average.

Capabilities

Feature Comparison

FeatureGemini 2.5 FlashGPT-4
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 2.5 FlashGPT-4
LicenseProprietaryProprietary
AuthorGoogleOpenAI
ReleasedJun 2025May 2023

Gemini 2.5 Flash Modalities

Input
fileimagetextaudiovideo
Output
text

GPT-4 Modalities

Input
text
Output
text

Related Comparisons

Compare Gemini 2.5 Flash with:

Compare GPT-4 with:

Frequently Asked Questions

Gemini 2.5 Flash has cheaper input pricing at $0.30/M tokens. Gemini 2.5 Flash has cheaper output pricing at $2.50/M tokens.
Gemini 2.5 Flash scores higher on coding benchmarks with a score of 17.8, compared to GPT-4's score of 13.1.
Gemini 2.5 Flash has a 1,048,576 token context window, while GPT-4 has a 8,191 token context window.
Gemini 2.5 Flash supports vision. GPT-4 does not support vision.