Price Per TokenPrice Per Token
Meta-llama

Llama 3.1 8B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 8B Instruct. Find the best value for your use case.

Sponsor Price Per Token Reach 5000+ developers comparing LLM APIs

125 out of our 298 tracked models have had a price change in January.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: January 23, 2026 at 08:08 AM

Overview

Llama 3.1 8B Instruct was released on July 23, 2024. Pricing starts at $0.020 per million input tokens and $0.050 per million output tokens. The model supports a context window of up to 16K tokens. API access is available through Meta-llama.

Context Window
16K
tokens
Pricing
Input$0.020
Output$0.050
per 1M tokens
Speed
Output163 tok/s
TTFT0.35s
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.1 8B Instruct with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.020
$0.050
11.6
47.6
25.9
16,384
Try
$0.010
$0.020
15.1
50.5
34.4
32,768
Try
$0.010
$0.020
32,768
Try
$0.017
$0.110
131,000
Try
$0.017
$0.068
11.2
41.7
29.1
96,000
Try
$0.020
$0.100
19.5
58.0
38.2
32,768
Try
$0.020
$0.100
77.7
74.8
68.8
131,072
Try
$0.020
$0.040
14.6
48.8
29.6
32,768
Try
$0.020
$0.060
131,072
Try
$0.020
$0.020
8.3
34.7
25.5
131,072
Try
$0.020
$0.040
26.4
8.7
131,072
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 8B Instruct is available from multiple providers with different pricing and availability.

10
Providers
$0.020
Cheapest Input/M
$0.050
Cheapest Output/M
Novita
Best Price

Provider Pricing

10 models

Provider
Input $/M
Output $/M
Uptime
N
Novita
Best Price
$0.020
$0.050
99.9%
N
Nebius
$0.030
$0.090
100.0%
D
DeepInfra
$0.030
$0.050
97.7%
Groq
Groq
$0.050
$0.080
100.0%
S
SiliconFlow
$0.060
$0.060
99.9%
F
Friendli
$0.100
$0.100
100.0%
H
Hyperbolic
$0.100
$0.100
100.0%
S
SambaNova
$0.100
$0.200
99.2%
C
Cerebras
$0.100
$0.100
99.9%
C
Cloudflare
$0.150
$0.290
-
Provider pricing data sourced from OpenRouter and Helicone

Cost vs. Quality

Compare Llama 3.1 8B Instruct's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.1 8B Instruct
Other models

Detailed Benchmark Scores

Intelligence
12.2 (15th pct)
Coding
4.9 (7th pct)
Math
4.3 (6th pct)
MMLU Pro
47.6 (12th pct)
GPQA
25.9 (8th pct)
LiveCodeBench
11.6 (5th pct)
AIME
7.7 (11th pct)
Aider
37.6 (15th pct)
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.1 8B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 8B Instruct directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.1 8B Instruct costs $0.000020 per 1,000 input tokens and $0.000050 per 1,000 output tokens.
Llama 3.1 8B Instruct is available from 10 provider(s). Novita offers the best price.
We compare Llama 3.1 8B Instruct with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Meta-llama Models

See pricing for all Meta-llama models.

View all Meta-llama models →