QwQ-32B vs Magistral Medium 1.2 vs Grok 4.20

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Alibaba Cloud
QwQ-32BGA
6.8
AI Panel
Mistral AI
Magistral Medium 1.2GA
7.2
AI Panel
xAI
Grok 4.20GA
6.9
AI Panel

Identity & lifecycle

ProviderAlibaba CloudMistral AIxAI
Family / tierQwQ · ReasoningMagistral · ReasoningGrok 4 · Reasoning
StatusGAGAGA
Released2025-03-042025-09-172026-03-09
Knowledge cutoff2024-092025-062024-11

Architecture & context

Context window131K131K1M
Max output tokens33K41K
Input modalitiestexttext, imagetext, image
Reasoning modealwaysalwaysoptional
Open weightsYesNoNo

Pricing (per Mtok)

Input$0.12$2$1.25
Output$0.18$5$2.5
Cached input$0.2
Batch input$1
Free tierYesYesYes

Speed

Output speed (tok/s)38.9171.4
Time to first token (s)1.713.24
Latency tierslowslowmedium

Trust & deployment

Trains on inputsNoNoYes
LicenseApache-2.0ProprietaryProprietary
CloudsGCPAzure AI Foundry
ComplianceSOC2, ISO27001, GDPR

AI Panel scoring

Unified score6.87.26.9
Decision Maker777
Domain Strategist76.56.5
Finance Lead776.5
Domain Practitioner7.57.57.5
Power User67.56.5
Skeptic6.56.56
Value score877

Benchmarks

GPQA Diamond65.276.2678.5
AIME 202583.48
MATH-50090.687.3
LiveCodeBench63.4
MMMU70
IFEval83.981
TAU-bench93
LMArena Elo1491
Artificial Analysis Index2749

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.