DeepSeek R1 (0528) vs DeepSeek V4-Flash

Best cell per row highlighted. Null means undisclosed — never counted as zero.

DeepSeek
DeepSeek R1 (0528)GA
8.0
AI Panel
DeepSeek
DeepSeek V4-Flashpreview
8.7
AI Panel

Identity & lifecycle

ProviderDeepSeekDeepSeek
Family / tierDeepSeek R1 · ReasoningDeepSeek V4 · Flash
StatusGApreview
Released2025-05-272026-04-23
Knowledge cutoff2025-042026-02

Architecture & context

Context window128K1M
Max output tokens64K384K
Input modalitiestexttext
Reasoning modealwaysoptional
Open weightsYesYes

Pricing (per Mtok)

Input$0.55$0.14
Output$2.19$0.28
Cached input$0.14$0.0028
Batch input
Free tierYesYes

Speed

Output speed (tok/s)
Time to first token (s)
Latency tierslowfast

Trust & deployment

Trains on inputsYesYes
LicenseMITMIT
Clouds
Compliance

AI Panel scoring

Unified score88.7
Decision Maker88.5
Domain Strategist8.58.5
Finance Lead99.7
Domain Practitioner8.58.5
Power User88
Skeptic7.57.5
Value score9.29.9

Benchmarks

MMLU93.4
MMLU-Pro8586.2
GPQA Diamond8188.1
AIME 202587.5
SWE-bench Verified57.679
LiveCodeBench73.391.6
Aider Polyglot71.6
TAU-bench63.9
MRCR Long Context78.7
SimpleQA27.834.1
Humanity's Last Exam17.734.8
LMArena Elo1382
Artificial Analysis Index68

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.