Price Per TokenPrice Per Token
Meta-llama
Meta-llama
vs
Xai
Xai

Llama 3.2 90B Vision Instruct vs Grok 4

A detailed comparison of pricing, benchmarks, and capabilities

Get our weekly newsletter on pricing changes, new releases, and tools.

Join the Price Per Token Community

Key Takeaways

Llama 3.2 90B Vision Instruct wins:

  • Cheaper input tokens
  • Cheaper output tokens

Grok 4 wins:

  • Larger context window
  • Faster response time
  • Higher intelligence benchmark
  • Better at coding
  • Better at math
  • Has reasoning mode
Price Advantage
Llama 3.2 90B Vision Instruct
Benchmark Advantage
Grok 4
Context Window
Grok 4
Speed
Grok 4

Pricing Comparison

Benchmark Comparison

Context & Performance

Capabilities

Feature Comparison

FeatureLlama 3.2 90B Vision InstructGrok 4
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyLlama 3.2 90B Vision InstructGrok 4
LicenseOpen SourceProprietary
AuthorMeta-llamaXai
ReleasedUnknownJul 2025

Llama 3.2 90B Vision Instruct Modalities

Input
Output

Grok 4 Modalities

Input
imagetext
Output
text

Frequently Asked Questions

Llama 3.2 90B Vision Instruct has cheaper input pricing at $0.90/M tokens. Llama 3.2 90B Vision Instruct has cheaper output pricing at $0.90/M tokens.
Llama 3.2 90B Vision Instruct has a 128,000 token context window, while Grok 4 has a 256,000 token context window.
Llama 3.2 90B Vision Instruct supports vision. Grok 4 supports vision.