Price Per TokenPrice Per Token
Meta-llama

Llama 3.1 405B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 405B Instruct. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Llama 3.1 405B Instruct was released on July 23, 2024. Pricing starts at $4.00 per million input tokens and $4.00 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Meta-llama.

Context Window
131K
tokens
Pricing
Input$4.00
Output$4.00
per 1M tokens
Speed
Output25 tok/s
TTFT0.79s
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.1 405B Instruct with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$4.000
$4.000
30.5
73.2
51.5
131,000
Try
$3.000
$15.000
81.9
86.6
87.7
256,000
Try
$3.000
$15.000
42.5
79.9
69.3
131,072
Try
$3.000
$15.000
44.9
83.7
68.3
1,000,000
Try
$3.000
$15.000
27.5
75.5
57.8
200,000
Try
$3.000
$15.000
39.4
80.3
65.6
200,000
Try
$3.000
$3.000
16,000
Try
$3.000
$5.000
16,384
Try
$3.750
$7.500
6,144
Try
$4.000
$8.000
131,072
Try
$4.000
$4.000
32,768
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 405B Instruct is available from multiple providers with different pricing and availability.

2
Providers
$4.000
Cheapest Input/M
$4.000
Cheapest Output/M
Hyperbolic
Best Price

Provider Pricing

2 models

Provider
Input $/M
Output $/M
Uptime
H
Hyperbolic
Best Price
$4.000
$4.000
-
Google
Google
$5.000
$16.000
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Llama 3.1 405B Instruct perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.1 405B Instruct's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.1 405B Instruct
Other models

Detailed Benchmark Scores

Intelligence
14.2 (25th pct)
Coding
14.5 (38th pct)
Math
3.0 (2th pct)
MMLU Pro
73.2 (39th pct)
GPQA
51.5 (31th pct)
LiveCodeBench
30.5 (29th pct)
AIME
21.3 (35th pct)
Aider
66.2 (66th pct)
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.1 405B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 405B Instruct directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.1 405B Instruct costs $0.004000 per 1,000 input tokens and $0.004000 per 1,000 output tokens.
Llama 3.1 405B Instruct is available from 2 provider(s). Hyperbolic offers the best price.
We compare Llama 3.1 405B Instruct with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Meta-llama Models

See pricing for all Meta-llama models.

View all Meta-llama models →