Llama 3.1 8B Instruct vs Llama 3.3 70B Instruct

Key Takeaways

Llama 3.1 8B Instruct wins:

Cheaper input tokens
Cheaper output tokens
Faster response time

Llama 3.3 70B Instruct wins:

Larger context window
Higher intelligence benchmark
Better at coding
Better at math

Price Advantage

Llama 3.1 8B Instruct

Benchmark Advantage

Llama 3.3 70B Instruct

Context Window

Llama 3.3 70B Instruct

Speed

Llama 3.1 8B Instruct

Pricing Comparison

Price Comparison

Metric	Llama 3.1 8B Instruct	Llama 3.3 70B Instruct	Winner
Input (per 1M tokens)	$0.02	$0.10	Llama 3.1 8B Instruct
Output (per 1M tokens)	$0.05	$0.32	Llama 3.1 8B Instruct

Using a 3:1 input/output ratio, Llama 3.1 8B Instruct is 82% cheaper overall.

Llama 3.1 8B Instruct Providers

Nebius $0.02 (Cheapest)

DeepInfra $0.02 (Cheapest)

Novita $0.02 (Cheapest)

Groq $0.05

SiliconFlow $0.06

Llama 3.3 70B Instruct Providers

DeepInfra $0.10 (Cheapest)

Novita $0.14

Parasail $0.22

Nebius $0.25

Crusoe $0.25

Benchmark Comparison

8

Benchmarks Compared

0

Llama 3.1 8B Instruct Wins

8

Llama 3.3 70B Instruct Wins

Benchmark Scores

Benchmark	Llama 3.1 8B Instruct	Llama 3.3 70B Instruct
Intelligence Index Overall intelligence score	11.7	14.2
Coding Index Code generation & understanding	4.9	10.7
Math Index Mathematical reasoning	4.3	7.7
MMLU Pro Academic knowledge	47.6	71.3
GPQA Graduate-level science	25.9	49.8
LiveCodeBench Competitive programming	11.6	28.8
Aider Real-world code editing	37.6	59.4
AIME Competition math	7.7	30.0

Llama 3.3 70B Instruct significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:

Y-axis:

Loading chart...

Llama 3.1 8B Instruct

Other models

Context & Performance

Context Window

Llama 3.1 8B Instruct

16,384

tokens

Max output: 16,384 tokens

Llama 3.3 70B Instruct

131,072

tokens

Max output: 16,384 tokens

Llama 3.3 70B Instruct has a 88% larger context window.

Speed Performance

Metric	Llama 3.1 8B Instruct	Llama 3.3 70B Instruct	Winner
Tokens/second	162.2 tok/s	104.4 tok/s
Time to First Token	0.33s	0.49s

Llama 3.1 8B Instruct responds 55% faster on average.

Capabilities

Feature Comparison

Feature	Llama 3.1 8B Instruct	Llama 3.3 70B Instruct
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

Property	Llama 3.1 8B Instruct	Llama 3.3 70B Instruct
License	Open Source	Open Source
Author	Meta-llama	Meta-llama
Released	Jul 2024	Dec 2024

Llama 3.1 8B Instruct Modalities

Input

text

Output

text

Llama 3.3 70B Instruct Modalities

Input

text

Output

text

Related Comparisons

Compare Llama 3.1 8B Instruct with:

Compare Llama 3.3 70B Instruct with:

See all model comparisons

Key Takeaways

Llama 3.1 8B Instruct wins:

Llama 3.3 70B Instruct wins:

Pricing Comparison

Price Comparison

Llama 3.1 8B Instruct Providers

Llama 3.3 70B Instruct Providers

Benchmark Comparison

Benchmark Scores

Cost vs Quality

Context & Performance

Context Window

Speed Performance

Capabilities

Feature Comparison

License & Release

Llama 3.1 8B Instruct Modalities

Llama 3.3 70B Instruct Modalities

Related Comparisons

Compare Llama 3.1 8B Instruct with:

Compare Llama 3.3 70B Instruct with:

Frequently Asked Questions

Tools

Directories

Pricing

Rankings

News