Price Per TokenPrice Per Token
Google

Gemma 3 4B API Pricing 2026

Compare pricing, benchmarks, and providers for Gemma 3 4B. Find the best value for your use case.

Sponsor Price Per Token Reach 5000+ developers comparing LLM APIs

116 out of our 296 tracked models have had a price change in January.

Make informed model choices with updates on pricing, new releases, and tools with our weekly newsletter.

Last updated: January 16, 2026 at 08:05 AM

Overview

Gemma 3 4B was released on March 13, 2025. Pricing starts at $0.017 per million input tokens and $0.068 per million output tokens. The model supports a context window of up to 96K tokens. API access is available through Google.

Context Window
96K
tokens
Pricing
Input$0.017
Output$0.068
per 1M tokens
Speed
Output37 tok/s
TTFT1.01s
median latency
Capabilities
VisionTools

Pricing Comparison

Compare Gemma 3 4B with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.017
$0.068
11.2
41.7
29.1
96,000
Try
$0.010
$0.020
15.1
50.5
34.4
32,768
Try
$0.010
$0.020
32,768
Try
$0.017
$0.110
131,000
Try
$0.020
$0.100
19.5
58.0
38.2
32,768
Try
$0.020
$0.100
77.7
74.8
68.8
131,072
Try
$0.020
$0.040
14.6
48.8
29.6
32,768
Try
$0.020
$0.060
131,072
Try
$0.020
$0.020
8.3
34.7
25.5
131,072
Try
$0.020
$0.050
11.6
47.6
25.9
16,384
Try
$0.020
$0.040
26.4
8.7
131,072
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Gemma 3 4B is available from multiple providers with different pricing and availability.

3
Providers
$0.017
Cheapest Input/M
$0.068
Cheapest Output/M
Chutes
Best Price

Provider Pricing

3 models

Provider
Input $/M
Output $/M
Uptime
C
Chutes
Best Price
$0.017
$0.068
100.0%
OpenRouter
OpenRouter
$0.017
$0.068
-
D
DeepInfra
$0.040
$0.080
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Cost vs. Quality

Compare Gemma 3 4B's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Gemma 3 4B
Other models

Detailed Benchmark Scores

Intelligence
6.6 (0th pct)
Coding
2.9 (2th pct)
Math
12.7 (12th pct)
MMLU Pro
41.7 (10th pct)
GPQA
29.1 (9th pct)
LiveCodeBench
11.2 (4th pct)
AIME
6.3 (8th pct)
Aider
-
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Gemma 3 4B

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Gemma 3 4B directly in your browser.

Open Playground

Frequently Asked Questions

Gemma 3 4B costs $0.000017 per 1,000 input tokens and $0.000068 per 1,000 output tokens.
Gemma 3 4B is available from 3 provider(s). Chutes offers the best price.
We compare Gemma 3 4B with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Google Models

See pricing for all Google models.

View all Google models →