Nvidia API Pricing (Updated 2026)

Compare pricing for all 9 Nvidia models. Prices shown per 1M tokens with historical trends.

Get our weekly newsletter on pricing changes, new releases, and tools.

No bots, no big tech influence — join the new community for AI devs

8 Ways to Use Fewer Tokens

Last updated: May 13, 2026 at 08:24 AM

View latest Nvidia AI news

Overview

Total Models

$0.04

Cheapest Input (per 1M)

262K

Max Context Length

Model Release Timeline

9 releases

Oct 2024

$0.90 / $0.90

Sep 2025

$0.04 / $0.16

Sep 2025

$0.04 / $0.16

Oct 2025

$0.10 / $0.40

Oct 2025

$0.10 / $0.40

Oct 2025

$0.20 / $0.20

Oct 2025

$0.20 / $0.20

Dec 2025

$0.05 / $0.20

Dec 2025

$0.05 / $0.20

Spacing represents time between releases

Oct 2024→Dec 2025

All Nvidia Models

Compare pricing for all 9 Nvidia models.

Model	Context	Input	Output	Coding	MMLU	GPQA	Cache Read	Cache Write	Actions
Llama 3.1 Nemotron Ultra 253B v1	128K	$0.000	$0.000	—	—	—	-	-	Try
Llama 3.1 Nemotron Ultra 253B v1 Thinking	128K	$0.000	$0.000	64.1	82.5	72.8	-	-	Try
Nemotron Nano 9B V2	131K	$0.040	$0.160	70.1	73.9	55.7	$0.1	-	Try
Nemotron Nano 9B V2 Thinking	131K	$0.040	$0.160	72.4	74.2	57.0	$0.1	-	Try
Nemotron 3 Nano 30B A3B	262K	$0.050	$0.200	36.0	57.9	39.9	$0.45	-	Try
Nemotron 3 Nano 30B A3B Thinking	262K	$0.050	$0.200	74.1	79.4	75.7	$0.45	-	Try
Nemotron 3 Super 120B A12B	-	$0.100	$0.500	—	—	—	$0.45	-	Try
Nemotron-3 Super 120B A12B	-	$0.100	$0.500	—	—	80.0	-	-	Try
Llama 3.3 Nemotron Super 49B V1.5	131K	$0.100	$0.400	28.0	69.8	51.7	-	-	Try
Llama 3.3 Nemotron Super 49B V1.5 Thinking	131K	$0.100	$0.400	27.7	78.5	64.3	-	-	Try
Nemotron Nano 12B 2 VL	131K	$0.200	$0.200	34.5	64.9	43.9	$0.1	-	Try
Nemotron Nano 12B 2 VL Thinking	131K	$0.200	$0.200	69.4	75.9	57.2	$0.1	-	Try
Nemotron Nano 12B V2	128K	$0.200	$0.200	—	—	—	$0.1	-	Try
Llama 3.1 Nemotron 70B Instruct	131K	$0.900	$0.900	16.9	69.0	46.5	$0.45	-	Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts under 200k tokens.

Showing lowest price across all providers. Benchmarks from Artificial Analysis and HuggingFace Open LLM Leaderboard.

Cost vs. Quality

Compare Nvidia models by benchmark performance and cost.

X-axis:

Y-axis:

Loading chart...

Nvidia Models

Community Ratings

Rate how well Nvidia models perform across popular AI coding tools.

Historical Pricing Trends

Price Changes Over Time

Show % Change

No historical pricing data available for this provider.

Try Nvidia Models

Cost Calculator

Estimate your costs for any Nvidia model based on expected token usage.

Open Cost Calculator

Try in Playground

Test Nvidia models directly in your browser.

Open Playground

Nvidia API Pricing (Updated 2026)

Overview

Model Release Timeline

All Nvidia Models

Cost vs. Quality

Community Ratings

Historical Pricing Trends

Try Nvidia Models

Cost Calculator

Try in Playground

Frequently Asked Questions

Tools

Directories

Models & Pricing

Endpoints

Rankings

News