Price Per TokenPrice Per Token
Meta-llama

Llama 2 7B Chat API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 2 7B Chat. Find the best value for your use case.

Get our weekly newsletter on pricing changes, new releases, and tools.

No bots, no big tech influence — join the new community for AI devs
8 Ways to Use Fewer Tokens
Last updated: April 23, 2026 at 08:37 AM

Overview

Llama 2 7B Chat. Pricing starts at $0.200 per million input tokens and $0.200 per million output tokens. The model supports a context window of up to 4K tokens. API access is available through Meta-llama.

Context Window
4K
tokens
Pricing
Input$0.200
Output$0.200
Cached$0.100
per 1M tokens
Speed
Output116 tok/s
TTFT1.91s
median latency
Capabilities
Text only

Pricing Comparison

Compare Llama 2 7B Chat with 0 similar models by price.

Current Pricing (per 1M tokens)

1 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.200
$0.200
0.2
16.4
22.7
4,096
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 2 7B Chat is available from multiple providers with different pricing and availability.

No multi-provider data available for this model

This model may only be available from a single provider

Community Rankings

How does Llama 2 7B Chat perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 2 7B Chat's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 2 7B Chat
Other models

Detailed Benchmark Scores

Intelligence
9.7 (9th pct)
Coding
-
Math
-
MMLU Pro
16.4 (1th pct)
GPQA
22.7 (7th pct)
LiveCodeBench
0.2 (0th pct)
AIME
- (0th pct)
Aider
-
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 2 7B Chat

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 2 7B Chat directly in your browser.

Open Playground

Frequently Asked Questions

Llama 2 7B Chat costs $0.000200 per 1,000 input tokens and $0.000200 per 1,000 output tokens.

All Meta-llama Models

See pricing for all Meta-llama models.

View all Meta-llama models →