BrianOnAI logoBrianOnAI

DeepSeek: R1 Distill Llama 70B

byDeepSeek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including: - AIME 2024 pass@1: 70.0 - MATH-500 pass@1: 94.5 - CodeForces Rating: 1633 The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Pricing

Input
$0.03 / 1M tokens
Output
$0.11 / 1M tokens

Specifications

Context Window131K tokens
Max Output131K tokens
Modalitytext
Input Typestext
Output Typestext

Strategic Analysis 🔒

Unlock vCAIO insights to make better model decisions:

  • Governance Risk Rating (Low / Medium / High)
  • Quality Tier Classification
  • Best Use Cases & Tags
  • Strategic Verdict from vCAIO
  • AI-Verified Fit Scoring

Not sure if this model fits your use case?

Describe your task and get AI-verified recommendations in seconds.

Try Model Advisor

Pricing last updated: Invalid Date