
Cerebras Free Tier
Cerebras offers a generous free tier with up to 1M tokens per day — no credit card required. Known for the fastest inference speeds available.
8 Ways to Use Fewer Tokens
Get the free PDF guide — practical tips to cut your token usage and API costs. Subscribe to the Price Per Token newsletter and download instantly.
Free Tier Details
Free TierUp to 1M tokens/day
Context Length (Free)8,192 tokens (up to 128K on request)
Models AvailableLlama 3.3 70B, Llama 4 Scout, DeepSeek R1
Credit Card RequiredNo
Cerebras uses custom wafer-scale chips for inference, delivering 20x faster speeds than GPU-based providers. Free tier is ideal for prototyping and weekend projects.
Cheapest Paid Models on Cerebras
When you need more than the free tier, these are the most affordable options.
| Model | Context | Input / 1M | Output / 1M |
|---|---|---|---|
| Llama 3.1 8B Instruct | 16K | $0.100 | $0.100 |
| Qwen3 235B A22B Instruct 2507 | 262K | $0.600 | $1.20 |
Frequently Asked Questions
Built by @aellman
Tools
Directories
Models & Pricing
Endpoints
Rankings
- All Rankings
- All Benchmarks
- Best LLM for Coding
- Best LLM for Math
- Best LLM for Writing
- Best LLM for RAG
- Best LLM for OpenClaw
- Best LLM for Cursor
- Best LLM for Windsurf
- Best LLM for Cline
- Best LLM for Aider
- Best LLM for GitHub Copilot
- Best LLM for Bolt
- Best LLM for Continue.dev
- MMLU-Pro
- GPQA
- LiveCodeBench
- Aider
- AIME
- MATH (Hard)
- Big-Bench Hard
2026 68 Ventures, LLC. All rights reserved.