Qwen: Qwen3 8B
byQwen
Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.
Pricing
Input
$0.03 / 1M tokens
Output
$0.11 / 1M tokens
Specifications
Context Window128K tokens
Max Output20K tokens
Modalitytext
Input Typestext
Output Typestext
Strategic Analysis 🔒
Unlock vCAIO insights to make better model decisions:
- Governance Risk Rating (Low / Medium / High)
- Quality Tier Classification
- Best Use Cases & Tags
- Strategic Verdict from vCAIO
- AI-Verified Fit Scoring
Not sure if this model fits your use case?
Describe your task and get AI-verified recommendations in seconds.
Other Qwen Models
Pricing last updated: Invalid Date