DeepSeek R1 (0528) vs DeepSeek V4-Flash vs DeepSeek V4-Pro

Best cell per row highlighted. Null means undisclosed — never counted as zero.

DeepSeek
DeepSeek R1 (0528)GA
8.0
AI Panel
DeepSeek
DeepSeek V4-Flashpreview
8.7
AI Panel
DeepSeek
DeepSeek V4-Propreview
8.5
AI Panel

Identity & lifecycle

ProviderDeepSeekDeepSeekDeepSeek
Family / tierDeepSeek R1 · ReasoningDeepSeek V4 · FlashDeepSeek V4 · Pro
StatusGApreviewpreview
Released2025-05-272026-04-232026-04-23
Knowledge cutoff2025-042026-022026-02

Architecture & context

Context window128K1M1M
Max output tokens64K384K384K
Input modalitiestexttexttext
Reasoning modealwaysoptionaloptional
Open weightsYesYesYes

Pricing (per Mtok)

Input$0.55$0.14$0.435
Output$2.19$0.28$0.87
Cached input$0.14$0.0028$0.0036
Batch input
Free tierYesYesYes

Speed

Output speed (tok/s)
Time to first token (s)
Latency tierslowfastmedium

Trust & deployment

Trains on inputsYesYesYes
LicenseMITMITMIT
Clouds
Compliance

AI Panel scoring

Unified score88.78.5
Decision Maker88.58
Domain Strategist8.58.58.5
Finance Lead99.79.5
Domain Practitioner8.58.58.5
Power User887.5
Skeptic7.57.57.5
Value score9.29.99.7

Benchmarks

MMLU93.4
MMLU-Pro8586.287.5
GPQA Diamond8188.190.1
AIME 202587.5
SWE-bench Verified57.67980.6
HumanEval76.8
LiveCodeBench73.391.693.5
Aider Polyglot71.6
TAU-bench63.9
MRCR Long Context78.783.5
SimpleQA27.834.157.9
Humanity's Last Exam17.734.837.7
LMArena Elo1382
LMArena Coding Elo1287
Artificial Analysis Index6852

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.