Llama 3.1 Nemotron 70B Instruct API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 Nemotron 70B Instruct. Find the best value for your use case.

Get our weekly newsletter on pricing changes, new releases, and tools.

No bots, no big tech influence — join the new community for AI devs

8 Ways to Use Fewer Tokens

Last updated: June 9, 2026 at 08:22 AM

Overview

Llama 3.1 Nemotron 70B Instruct was released on October 15, 2024. Pricing starts at $0.900 per million input tokens and $0.900 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Nvidia.

Context Window

131K

tokens

Pricing

Input$0.900

Output$0.900

Cached$0.450

per 1M tokens

Speed

Output277 tok/s

TTFT4.82s

median latency

Capabilities

Tools

Pricing Comparison

Compare Llama 3.1 Nemotron 70B Instruct with 0 similar models by price.

Current Pricing (per 1M tokens)

1 models

Provider	Model	Input $/M	Output $/M	Coding	MMLU	GPQA	Context	Actions
M Nvidia	Llama 3.1 Nemotron 70B Instruct	$0.900	$0.900	16.9	69.0	46.5	131,072	Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 Nemotron 70B Instruct is available from multiple providers with different pricing and availability.

No multi-provider data available for this model

This model may only be available from a single provider

Community Rankings

How does Llama 3.1 Nemotron 70B Instruct perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.1 Nemotron 70B Instruct's benchmark performance against all models.

X-axis:

Y-axis:

Loading chart...

Llama 3.1 Nemotron 70B Instruct

Other models

Detailed Benchmark Scores

Intelligence

13.4 (25th pct)

Coding

10.8 (22th pct)

Math

11.0 (11th pct)

MMLU Pro

69.0 (36th pct)

GPQA

46.5 (28th pct)

LiveCodeBench

16.9 (15th pct)

AIME

24.7 (45th pct)

Aider

54.9 (37th pct)

Benchmark data from Artificial Analysis and HuggingFace Open LLM Leaderboard

Try Llama 3.1 Nemotron 70B Instruct

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 Nemotron 70B Instruct directly in your browser.

Open Playground

Compare Llama 3.1 Nemotron 70B Instruct

Frequently Asked Questions

All Nvidia Models

See pricing for all Nvidia models.

View all Nvidia models →

Llama 3.1 Nemotron 70B Instruct API Pricing 2026

Overview

Pricing Comparison

Current Pricing (per 1M tokens)

Compare Providers

Community Rankings

Cost vs. Quality

Detailed Benchmark Scores

Try Llama 3.1 Nemotron 70B Instruct

Use our Calculator

Try in Playground

Compare Llama 3.1 Nemotron 70B Instruct

Similar Price Range

Same Provider

Popular Competitors

Frequently Asked Questions

All Nvidia Models

Tools

Directories

Models & Pricing

Endpoints

Rankings

News