Price Per TokenPrice Per Token
Meta-llama

Llama 3.1 8B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 8B Instruct. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Llama 3.1 8B Instruct was released on July 23, 2024. Pricing starts at $0.020 per million input tokens and $0.050 per million output tokens. The model supports a context window of up to 16K tokens. API access is available through Meta-llama.

Context Window
16K
tokens
Pricing
Input$0.020
Output$0.050
per 1M tokens
Speed
Output162 tok/s
TTFT0.33s
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.1 8B Instruct with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.020
$0.050
11.6
47.6
25.9
16,384
Try
$0.000
$0.000
131,072
Try
$0.000
$0.000
131,072
Try
$0.010
$0.020
15.1
50.5
34.4
32,768
Try
$0.010
$0.020
32,768
Try
$0.017
$0.110
131,000
Try
$0.017
$0.068
11.2
41.7
29.1
96,000
Try
$0.020
$0.100
19.5
58.0
38.2
32,768
Try
$0.020
$0.040
14.6
48.8
29.6
32,768
Try
$0.020
$0.060
131,072
Try
$0.020
$0.020
8.3
34.7
25.5
131,072
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 8B Instruct is available from multiple providers with different pricing and availability.

10
Providers
$0.020
Cheapest Input/M
$0.050
Cheapest Output/M
Nebius
Best Price

Provider Pricing

10 models

Provider
Input $/M
Output $/M
Uptime
N
Nebius
Best Price
$0.020
$0.060
99.9%
D
DeepInfra
Best Price
$0.020
$0.050
99.6%
N
Novita
Best Price
$0.020
$0.050
99.9%
Groq
Groq
$0.050
$0.080
100.0%
S
SiliconFlow
$0.060
$0.060
100.0%
F
Friendli
$0.100
$0.100
99.9%
H
Hyperbolic
$0.100
$0.100
99.8%
S
SambaNova
$0.100
$0.200
99.3%
C
Cerebras
$0.100
$0.100
99.6%
C
Cloudflare
$0.150
$0.290
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Llama 3.1 8B Instruct perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.1 8B Instruct's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.1 8B Instruct
Other models

Detailed Benchmark Scores

Intelligence
11.7 (12th pct)
Coding
4.9 (5th pct)
Math
4.3 (5th pct)
MMLU Pro
47.6 (11th pct)
GPQA
25.9 (8th pct)
LiveCodeBench
11.6 (4th pct)
AIME
7.7 (10th pct)
Aider
37.6 (15th pct)
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.1 8B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 8B Instruct directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.1 8B Instruct costs $0.000020 per 1,000 input tokens and $0.000050 per 1,000 output tokens.
Llama 3.1 8B Instruct is available from 10 provider(s). Nebius offers the best price.
We compare Llama 3.1 8B Instruct with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Meta-llama Models

See pricing for all Meta-llama models.

View all Meta-llama models →