Price Per TokenPrice Per Token
Nvidia

Nvidia API Pricing (Updated 2026)

Compare pricing for all 9 Nvidia models. Prices shown per 1M tokens with historical trends.

Get our weekly newsletter on pricing changes, new releases, and tools.

OpenClaw

Deploy OpenClaw in Under 1 Minute We handle hosting, scaling, and maintenance

8 Ways to Use Fewer Tokens
Last updated: April 2, 2026 at 08:38 AM

Overview

9
Total Models
$0.04
Cheapest Input (per 1M)
262K
Max Context Length

Model Release Timeline

9 releases
Oct 2024
$0.90 / $0.90
Sep 2025
$0.04 / $0.16
Sep 2025
$0.04 / $0.16
Oct 2025
$0.10 / $0.40
Oct 2025
$0.10 / $0.40
Oct 2025
$0.20 / $0.20
Oct 2025
$0.20 / $0.20
Dec 2025
$0.05 / $0.20
Dec 2025
$0.05 / $0.20
Spacing represents time between releases
Oct 2024Dec 2025

All Nvidia Models

Compare pricing for all 9 Nvidia models.

Model
Context
Input
Output
Coding
MMLU
GPQA
Cache Read
Cache Write
Actions
131K
$0.040
$0.160
70.1
73.9
55.7
$0.1
-
Try
131K
$0.040
$0.160
72.4
74.2
57.0
$0.1
-
Try
262K
$0.050
$0.200
36.0
57.9
39.9
$0.45
-
Try
262K
$0.050
$0.200
74.1
79.4
75.7
$0.45
-
Try
-
$0.100
$0.500
$0.1
-
Try
-
$0.100
$0.500
80.0
-
-
Try
131K
$0.100
$0.400
29.0
69.2
48.1
-
-
Try
131K
$0.100
$0.400
73.7
81.4
74.8
-
-
Try
131K
$0.200
$0.200
34.5
64.9
43.9
$0.1
-
Try
131K
$0.200
$0.200
69.4
75.9
57.2
$0.1
-
Try
128K
$0.200
$0.200
$0.1
-
Try
128K
$0.600
$1.800
-
-
Try
128K
$0.600
$1.800
64.1
82.5
72.8
-
-
Try
131K
$0.900
$0.900
16.9
69.0
46.5
$0.45
-
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts under 200k tokens.

Showing lowest price across all providers. Benchmarks from Artificial Analysis and HuggingFace Open LLM Leaderboard.

Cost vs. Quality

Compare Nvidia models by benchmark performance and cost.

X-axis:
Y-axis:
Loading chart...
Nvidia Models

Community Ratings

Rate how well Nvidia models perform across popular AI coding tools.

Historical Pricing Trends

Price Changes Over Time

No historical pricing data available for this provider.

Try Nvidia Models

Cost Calculator

Estimate your costs for any Nvidia model based on expected token usage.

Open Cost Calculator

Try in Playground

Test Nvidia models directly in your browser.

Open Playground

Frequently Asked Questions

Currently, Nemotron Nano 9B V2 is the most affordable at $0.04 per 1M input tokens.
Count your input tokens (prompt) and expected output tokens (response). Use the formula: (input_tokens/1M × input_price) + (output_tokens/1M × output_price). Our pricing calculator can help automate this.
Nvidia pricing varies by model. Input costs range from $0.04 to $0.90 per 1M tokens. See the table above for specific model pricing.