Llama 3.1 Nemotron 70B Instruct API Pricing 2026
Compare pricing, benchmarks, and providers for Llama 3.1 Nemotron 70B Instruct. Find the best value for your use case.
Get our weekly newsletter on pricing changes, new releases, and tools.

Deploy OpenClaw in Under 1 Minute— We handle hosting, scaling, and maintenance
Overview
Llama 3.1 Nemotron 70B Instruct was released on October 15, 2024. Pricing starts at $0.900 per million input tokens and $0.900 per million output tokens. The model supports a context window of up to 131K tokens. API access is available through Nvidia.
Pricing Comparison
Compare Llama 3.1 Nemotron 70B Instruct with 0 similar models by price.
Current Pricing (per 1M tokens)
1 models
Provider | Model | Input $/M | Output $/M | Coding | MMLU | GPQA | Context | Actions |
|---|---|---|---|---|---|---|---|---|
$0.900 | $0.900 | 16.9 | 69.0 | 46.5 | 131,072 |
* Some models use tiered pricing based on prompt length. Displayed prices are for prompts ≤ 200k tokens.
Compare Providers
Llama 3.1 Nemotron 70B Instruct is available from multiple providers with different pricing and availability.
No multi-provider data available for this model
This model may only be available from a single provider
Community Rankings
How does Llama 3.1 Nemotron 70B Instruct perform? Vote based on your experience.
Cost vs. Quality
Compare Llama 3.1 Nemotron 70B Instruct's benchmark performance against all models.
Detailed Benchmark Scores
Try Llama 3.1 Nemotron 70B Instruct
Use our Calculator
Estimate your costs based on expected token usage.
Open Cost CalculatorTry in Playground
Test Llama 3.1 Nemotron 70B Instruct directly in your browser.
Open PlaygroundCompare Llama 3.1 Nemotron 70B Instruct
Similar Price Range
Same Provider
Popular Competitors
Frequently Asked Questions
All Nvidia Models
See pricing for all Nvidia models.
View all Nvidia models →