DeepSeek R1 (0528) vs DeepSeek V4-Flash vs DeepSeek V4-Pro

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

DeepSeek

DeepSeek R1 (0528)GA

8.0

AI Panel

DeepSeek

DeepSeek V4-Flashpreview

8.7

AI Panel

DeepSeek

DeepSeek V4-Propreview

8.5

AI Panel

Identity & lifecycle

Provider	DeepSeek	DeepSeek	DeepSeek
Family / tier	DeepSeek R1 · Reasoning	DeepSeek V4 · Flash	DeepSeek V4 · Pro
Status	GA	preview	preview
Released	2025-05-27	2026-04-23	2026-04-23
Knowledge cutoff	2025-04	2026-02	2026-02

Architecture & context

Context window	128K	1M	1M
Max output tokens	64K	384K	384K
Input modalities	text	text	text
Reasoning mode	always	optional	optional
Open weights	Yes	Yes	Yes

Pricing (per Mtok)

Input	$0.55	$0.14	$0.435
Output	$2.19	$0.28	$0.87
Cached input	$0.14	$0.0028	$0.0036
Batch input	—	—	—
Free tier	Yes	Yes	Yes

Speed

Output speed (tok/s)	—	—	—
Time to first token (s)	—	—	—
Latency tier	slow	fast	medium

Trust & deployment

Trains on inputs	Yes	Yes	Yes
License	MIT	MIT	MIT
Clouds	—	—	—
Compliance	—	—	—

AI Panel scoring

Unified score	8	8.7	8.5
Decision Maker	8	8.5	8
Domain Strategist	8.5	8.5	8.5
Finance Lead	9	9.7	9.5
Domain Practitioner	8.5	8.5	8.5
Power User	8	8	7.5
Skeptic	7.5	7.5	7.5
Value score	9.2	9.9	9.7

Benchmarks

MMLU	93.4	—	—
MMLU-Pro	85	86.2	87.5
GPQA Diamond	81	88.1	90.1
AIME 2025	87.5	—	—
SWE-bench Verified	57.6	79	80.6
HumanEval	—	—	76.8
LiveCodeBench	73.3	91.6	93.5
Aider Polyglot	71.6	—	—
TAU-bench	63.9	—	—
MRCR Long Context	—	78.7	83.5
SimpleQA	27.8	34.1	57.9
Humanity's Last Exam	17.7	34.8	37.7
LMArena Elo	1382	—	—
LMArena Coding Elo	—	—	1287
Artificial Analysis Index	68	—	52

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.