Price Per TokenPrice Per Token
Nvidia

Llama 3.1 Nemotron Ultra 253B v1 API Pricing 2026

Compare pricing, benchmarks, and providers for Llama 3.1 Nemotron Ultra 253B v1. Find the best value for your use case.

OpenClaw

Best LLMs for OpenClaw Vote for which model works best with OpenClaw

112 out of our 301 tracked models have had a price change in February.

Get our weekly newsletter on pricing changes, new releases, and tools.

Last updated: February 15, 2026 at 08:07 AM

Overview

Llama 3.1 Nemotron Ultra 253B v1 was released on April 8, 2025. Pricing starts at $0.600 per million input tokens and $1.80 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Nvidia.

Context Window
131K
tokens
Pricing
Input$0.600
Output$1.80
per 1M tokens
Speed
Output-
TTFT-
median latency
Capabilities
Tools

Pricing Comparison

Compare Llama 3.1 Nemotron Ultra 253B v1 with 10 similar models by price.

Current Pricing (per 1M tokens)

11 models

Provider
Model
Input $/M
Output $/M
Coding
MMLU
GPQA
Context
Actions
$0.600
$1.800
131,072
Try
$0.500
$3.000
79.7
88.2
81.2
1,048,576
Try
$0.500
$1.500
262,144
Try
$0.500
$2.400
55.6
82.4
76.6
131,072
Try
$0.500
$1.500
46.2
29.7
16,385
Try
$0.510
$0.740
19.8
57.4
37.9
8,192
Try
$0.540
$0.540
6.6
38.7
29.2
32,768
Try
$0.550
$0.800
32,768
Try
$0.600
$2.400
128,000
Try
$0.600
$6.000
1,040,000
Try
$0.600
$1.800
35.2
75.1
57.3
65,536
Try

* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.

Compare Providers

Llama 3.1 Nemotron Ultra 253B v1 is available from multiple providers with different pricing and availability.

1
Providers
$0.600
Cheapest Input/M
$1.800
Cheapest Output/M
Nebius
Best Price

Provider Pricing

1 models

Provider
Input $/M
Output $/M
Uptime
N
Nebius
Best Price
$0.600
$1.800
100.0%
Provider pricing data sourced from OpenRouter and Helicone

Community Rankings

How does Llama 3.1 Nemotron Ultra 253B v1 perform? Vote based on your experience.

Cost vs. Quality

Compare Llama 3.1 Nemotron Ultra 253B v1's benchmark performance against all models.

X-axis:
Y-axis:
Loading chart...
Llama 3.1 Nemotron Ultra 253B v1 (Thinking)
Other models

Try Llama 3.1 Nemotron Ultra 253B v1

Use our Calculator

Estimate your costs based on expected token usage.

Open Cost Calculator

Try in Playground

Test Llama 3.1 Nemotron Ultra 253B v1 directly in your browser.

Open Playground

Frequently Asked Questions

Llama 3.1 Nemotron Ultra 253B v1 costs $0.000600 per 1,000 input tokens and $0.001800 per 1,000 output tokens.
Llama 3.1 Nemotron Ultra 253B v1 is available from 1 provider(s). Nebius offers the best price.
We compare Llama 3.1 Nemotron Ultra 253B v1 with 10 similarly-priced models. See the benchmark scatter plot above to compare quality vs cost.

All Nvidia Models

See pricing for all Nvidia models.

View all Nvidia models →