Price Per TokenPrice Per Token
Nvidia

Llama 3.1 Nemotron 70B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 Nemotron 70B Instruct. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Llama 3.1 Nemotron 70B Instruct was released on October 15, 2024. Pricing starts at $1.20 per million input tokens and $1.20 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Nvidia.

Context Window
131K
tokens
Pricing
Input$1.20
Output$1.20
per 1M tokens
Speed
Output31 tok/s
TTFT0.37s
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.1 Nemotron 70B Instruct with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$1.200
$1.200
16.9
69.0
46.5
131,072
Try
$1.000
$5.000
51.1
80.0
64.6
200,000
Try
$1.000
$3.000
54.6
72.9
53.6
131,072
Try
$1.000
$1.000
29.5
68.9
47.1
127,072
Try
$1.000
$5.000
1,000,000
Try
$1.000
$1.000
131,072
Try
$1.100
$4.400
200,000
Try
$1.100
$4.400
85.9
83.2
78.4
200,000
Try
$1.100
$4.400
73.4
80.2
77.3
200,000
Try
$1.100
$4.400
71.7
79.1
74.8
200,000
Try
$1.200
$6.000
262,144
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 Nemotron 70B Instruct is available from multiple providers with different pricing and availability.

1
Providers
$1.200
Cheapest Input/M
$1.200
Cheapest Output/M
DeepInfra
Best Price

Provider Pricing

1 models

Provider
Input $/M
Output $/M
Uptime
D
DeepInfra
Best Price
$1.200
$1.200
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Llama 3.1 Nemotron 70B Instruct perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.1 Nemotron 70B Instruct's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.1 Nemotron 70B Instruct
Other models

Detailed Benchmark Scores

Intelligence
13.4 (22th pct)
Coding
10.8 (20th pct)
Math
11.0 (10th pct)
MMLU Pro
69.0 (29th pct)
GPQA
46.5 (24th pct)
LiveCodeBench
16.9 (10th pct)
AIME
24.7 (41th pct)
Aider
54.9 (37th pct)
Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.1 Nemotron 70B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 Nemotron 70B Instruct directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.1 Nemotron 70B Instruct costs $0.001200 per 1,000 input tokens and $0.001200 per 1,000 output tokens.
Llama 3.1 Nemotron 70B Instruct is available from 1 provider(s). DeepInfra offers the best price.
We compare Llama 3.1 Nemotron 70B Instruct with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Nvidia Models

See pricing for all Nvidia models.

View all Nvidia models →