Price Per TokenPrice Per Token
Meta-llama

Llama 3.3 70B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.3 70B Instruct. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Llama 3.3 70B Instruct was released on December 6, 2024. Pricing starts at $0.100 per million input tokens and $0.320 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Meta-llama.

Context Window
131K
tokens
Pricing
Input$0.100
Output$0.320
per 1M tokens
Speed
Output104 tok/s
TTFT0.49s
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.3 70B Instruct with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.100
$0.320
28.8
71.3
49.8
131,072
Try
$0.100
$0.200
26.6
52.2
40.0
65,536
Try
$0.100
$0.300
32,000
Try
$0.100
$0.300
32,768
Try
$0.100
$0.100
131,072
Try
$0.100
$0.400
29.0
69.2
48.1
131,072
Try
$0.100
$0.100
128,000
Try
$0.100
$0.400
40.0
72.4
47.4
1,048,576
Try
$0.100
$0.300
25.4
62.2
41.4
131,072
Try
$0.100
$0.400
32.6
65.7
51.2
1,047,576
Try
$0.100
$0.400
33.4
77.9
62.3
1,048,576
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.3 70B Instruct is available from multiple providers with different pricing and availability.

15
Providers
$0.100
Cheapest Input/M
$0.320
Cheapest Output/M
DeepInfra
Best Price

Provider Pricing

15 models

Provider
Input $/M
Output $/M
Uptime
D
DeepInfra
Best Price
$0.100
$0.320
-
N
Novita
$0.135
$0.400
99.9%
P
Parasail
$0.220
$0.500
100.0%
N
Nebius
$0.250
$0.750
99.8%
C
Crusoe
$0.250
$0.750
100.0%
C
Cloudflare
$0.290
$2.250
99.2%
H
Hyperbolic
$0.400
$0.400
99.8%
Groq
Groq
$0.590
$0.790
99.8%
F
Friendli
$0.600
$0.600
100.0%
S
SambaNova
$0.600
$1.200
98.4%
W
WandB
$0.710
$0.710
100.0%
Google
Google
$0.720
$0.720
100.0%
C
Cerebras
$0.850
$1.200
99.3%
T
Together
$0.880
$0.880
100.0%
Llama
Llama
$N/A
$N/A
-
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Llama 3.3 70B Instruct perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.3 70B Instruct's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.3 70B Instruct
Other models

Detailed Benchmark Scores

Intelligence
14.2 (25th pct)
Coding
10.7 (18th pct)
Math
7.7 (9th pct)
MMLU Pro
71.3 (35th pct)
GPQA
49.8 (29th pct)
LiveCodeBench
28.8 (24th pct)
AIME
30.0 (50th pct)
Aider
59.4 (48th pct)
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.3 70B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.3 70B Instruct directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.3 70B Instruct costs $0.000100 per 1,000 input tokens and $0.000320 per 1,000 output tokens.
Llama 3.3 70B Instruct is available from 15 provider(s). DeepInfra offers the best price.
We compare Llama 3.3 70B Instruct with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Meta-llama Models

See pricing for all Meta-llama models.

View all Meta-llama models →