Price Per TokenPrice Per Token
Nvidia
Nvidia
vs
Sao10k

Nemotron Nano 9B V2 vs Llama 3 8B Lunaris

A detailed comparison of pricing, benchmarks, and capabilities

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

Key Takeaways

Nemotron Nano 9B V2 wins:

  • Larger context window
  • Faster response time
  • Better at coding
  • Better at math
  • Supports tool calls

Llama 3 8B Lunaris wins:

  • Cheaper output tokens
  • Higher intelligence benchmark
Price Advantage
Llama 3 8B Lunaris
Benchmark Advantage
Nemotron Nano 9B V2
Context Window
Nemotron Nano 9B V2
Speed
Nemotron Nano 9B V2

Pricing Comparison

Benchmark Comparison

Context & Performance

Capabilities

Feature Comparison

FeatureNemotron Nano 9B V2Llama 3 8B Lunaris
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyNemotron Nano 9B V2Llama 3 8B Lunaris
LicenseOpen SourceOpen Source
AuthorNvidiaSao10k
ReleasedSep 2025Aug 2024

Nemotron Nano 9B V2 Modalities

Input
text
Output
text

Llama 3 8B Lunaris Modalities

Input
text
Output
text

Frequently Asked Questions

Nemotron Nano 9B V2 has cheaper input pricing at $0.04/M tokens. Llama 3 8B Lunaris has cheaper output pricing at $0.05/M tokens.
Nemotron Nano 9B V2 has a 131,072 token context window, while Llama 3 8B Lunaris has a 8,192 token context window.
Nemotron Nano 9B V2 does not support vision. Llama 3 8B Lunaris does not support vision.