Price Per TokenPrice Per Token
Deepseek
Deepseek
vs
Microsoft
Microsoft

DeepSeek V3.1 Terminus vs WizardLM-2 8x22B

A detailed comparison of pricing, benchmarks, and capabilities

108 out of our 483 tracked models have had a price change in March.

Get our weekly newsletter on pricing changes, new releases, and tools.

Key Takeaways

DeepSeek V3.1 Terminus wins:

  • Cheaper input tokens
  • Larger context window
  • Faster response time
  • Better at coding
  • Better at math
  • Has reasoning mode

WizardLM-2 8x22B wins:

  • Cheaper output tokens
  • Higher intelligence benchmark
Price Advantage
DeepSeek V3.1 Terminus
Benchmark Advantage
DeepSeek V3.1 Terminus
Context Window
DeepSeek V3.1 Terminus
Speed
DeepSeek V3.1 Terminus

Pricing Comparison

Price Comparison

MetricDeepSeek V3.1 TerminusWizardLM-2 8x22BWinner
Input (per 1M tokens)$0.21$0.62 DeepSeek V3.1 Terminus
Output (per 1M tokens)$0.79$0.62 WizardLM-2 8x22B
Cache Read (per 1M)$0.12N/A DeepSeek V3.1 Terminus
Using a 3:1 input/output ratio, DeepSeek V3.1 Terminus is 43% cheaper overall.

DeepSeek V3.1 Terminus Providers

No provider data available

WizardLM-2 8x22B Providers

No provider data available

Benchmark Comparison

8
Benchmarks Compared
2
DeepSeek V3.1 Terminus Wins
1
WizardLM-2 8x22B Wins

Benchmark Scores

BenchmarkDeepSeek V3.1 TerminusWizardLM-2 8x22BWinner
Intelligence Index
Overall intelligence score
28.533.1
Coding Index
Code generation & understanding
31.9--
Math Index
Mathematical reasoning
53.7--
MMLU Pro
Academic knowledge
83.640.0
GPQA
Graduate-level science
75.117.6
LiveCodeBench
Competitive programming
52.9--
Aider
Real-world code editing
-44.4-
BBH
Big-Bench Hard
-48.6-
DeepSeek V3.1 Terminus significantly outperforms in coding benchmarks.

Cost vs Quality

X-axis:
Y-axis:
Loading chart...
DeepSeek V3.1 Terminus
Other models

Context & Performance

Context Window

DeepSeek V3.1 Terminus
163,840
tokens
WizardLM-2 8x22B
65,535
tokens
DeepSeek V3.1 Terminus has a 60% larger context window.

Speed Performance

MetricDeepSeek V3.1 TerminusWizardLM-2 8x22BWinner
Tokens/second0.0 tok/sN/A
Time to First Token0.00sN/A

Capabilities

Feature Comparison

FeatureDeepSeek V3.1 TerminusWizardLM-2 8x22B
Vision (Image Input)
Tool/Function Calls
Reasoning Mode
Audio Input
Audio Output
PDF Input
Prompt Caching
Web Search

License & Release

PropertyDeepSeek V3.1 TerminusWizardLM-2 8x22B
LicenseOpen SourceOpen Source
AuthorDeepseekMicrosoft
ReleasedSep 2025Apr 2024

DeepSeek V3.1 Terminus Modalities

Input
text
Output
text

WizardLM-2 8x22B Modalities

Input
text
Output
text

Related Comparisons

Compare DeepSeek V3.1 Terminus with:

Compare WizardLM-2 8x22B with:

Frequently Asked Questions

DeepSeek V3.1 Terminus has cheaper input pricing at $0.21/M tokens. WizardLM-2 8x22B has cheaper output pricing at $0.62/M tokens.
DeepSeek V3.1 Terminus scores higher on coding benchmarks with a score of 31.9, compared to WizardLM-2 8x22B's score of N/A.
DeepSeek V3.1 Terminus has a 163,840 token context window, while WizardLM-2 8x22B has a 65,535 token context window.
DeepSeek V3.1 Terminus does not support vision. WizardLM-2 8x22B does not support vision.