Llama 3.1 Nemotron Ultra 253B v1 API Pricing 2026
Compare pricing, benchmarks, and providers for Llama 3.1 Nemotron Ultra 253B v1. Find the best value for your use case.
Get our weekly newsletter on pricing changes, new releases, and tools.

Deploy OpenClaw in Under 1 Minute— We handle hosting, scaling, and maintenance
Overview
Llama 3.1 Nemotron Ultra 253B v1. Pricing starts at $0.600 per million input tokens and $1.80 per million output tokens. The model supports a context window of up to 128K tokens. API access is available through Nvidia.
Pricing Comparison
Compare Llama 3.1 Nemotron Ultra 253B v1 with 0 similar models by price.
Current Pricing (per 1M tokens)
1 models
Provider | Model | Input $/M | Output $/M | Coding | MMLU | GPQA | Context | Actions |
|---|---|---|---|---|---|---|---|---|
$0.600 | $1.800 | — | — | — | 128,000 |
* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.
Compare Providers
Llama 3.1 Nemotron Ultra 253B v1 is available from multiple providers with different pricing and availability.
No multi-provider data available for this model
This model may only be available from a single provider
Community Rankings
How does Llama 3.1 Nemotron Ultra 253B v1 perform? Vote based on your experience.
Cost vs. Quality
Compare Llama 3.1 Nemotron Ultra 253B v1's benchmark performance against all models.
Try Llama 3.1 Nemotron Ultra 253B v1
Use our Calculator
Estimate your costs based on expected token usage.
Open Cost CalculatorTry in Playground
Test Llama 3.1 Nemotron Ultra 253B v1 directly in your browser.
Open PlaygroundCompare Llama 3.1 Nemotron Ultra 253B v1
Similar Price Range
Same Provider
Popular Competitors
Frequently Asked Questions
All Nvidia Models
See pricing for all Nvidia models.
View all Nvidia models →