DeepSeek R1 (0528) vs QwQ-32B vs Magistral Medium 1.2

Best cell per row highlighted. Null means undisclosed — never counted as zero.

DeepSeek
DeepSeek R1 (0528)GA
8.0
AI Panel
Alibaba Cloud
QwQ-32BGA
6.8
AI Panel
Mistral AI
Magistral Medium 1.2GA
7.2
AI Panel

Identity & lifecycle

ProviderDeepSeekAlibaba CloudMistral AI
Family / tierDeepSeek R1 · ReasoningQwQ · ReasoningMagistral · Reasoning
StatusGAGAGA
Released2025-05-272025-03-042025-09-17
Knowledge cutoff2025-042024-092025-06

Architecture & context

Context window128K131K131K
Max output tokens64K33K41K
Input modalitiestexttexttext, image
Reasoning modealwaysalwaysalways
Open weightsYesYesNo

Pricing (per Mtok)

Input$0.55$0.12$2
Output$2.19$0.18$5
Cached input$0.14
Batch input$1
Free tierYesYesYes

Speed

Output speed (tok/s)38.9
Time to first token (s)1.7
Latency tierslowslowslow

Trust & deployment

Trains on inputsYesNoNo
LicenseMITApache-2.0Proprietary
CloudsGCPAzure AI Foundry
ComplianceSOC2, ISO27001, GDPR

AI Panel scoring

Unified score86.87.2
Decision Maker877
Domain Strategist8.576.5
Finance Lead977
Domain Practitioner8.57.57.5
Power User867.5
Skeptic7.56.56.5
Value score9.287

Benchmarks

MMLU93.4
MMLU-Pro85
GPQA Diamond8165.276.26
AIME 202587.583.48
MATH-50090.6
SWE-bench Verified57.6
LiveCodeBench73.363.4
Aider Polyglot71.6
MMMU70
IFEval83.9
TAU-bench63.9
SimpleQA27.8
Humanity's Last Exam17.7
LMArena Elo1382
Artificial Analysis Index6827

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.