Price Per TokenPrice Per Token
Google
Google
vs
Google
Google

Gemini 2.0 Flash Lite vs Gemma 4 31B Instruct

A detailed comparison of pricing, benchmarks, and capabilities

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

Key Takeaways

Gemini 2.0 Flash Lite wins:

  • Cheaper input tokens
  • Cheaper output tokens
  • Larger context window

Gemma 4 31B Instruct wins:

  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
Price Advantage
Gemini 2.0 Flash Lite
Benchmark Advantage
Gemma 4 31B Instruct
Context Window
Gemini 2.0 Flash Lite
Speed
Gemma 4 31B Instruct

Pricing Comparison

Benchmark Comparison

Context & Performance

Capabilities

Feature Comparison

FeatureGemini 2.0 Flash LiteGemma 4 31B Instruct
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyGemini 2.0 Flash LiteGemma 4 31B Instruct
LicenseProprietaryProprietary
AuthorGoogleGoogle
ReleasedFeb 2025Apr 2026

Gemini 2.0 Flash Lite Modalities

Input
textimagefileaudiovideo
Output
text

Gemma 4 31B Instruct Modalities

Input
imagetextvideo
Output
text

Frequently Asked Questions

Gemini 2.0 Flash Lite has cheaper input pricing at $0.07/M tokens. Gemini 2.0 Flash Lite has cheaper output pricing at $0.30/M tokens.
Gemini 2.0 Flash Lite has a 1,048,576 token context window, while Gemma 4 31B Instruct has a 262,144 token context window.
Gemini 2.0 Flash Lite supports vision. Gemma 4 31B Instruct supports vision.