DeepSeek R1 (0528) vs DeepSeek V4-Flash

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

DeepSeek

DeepSeek R1 (0528)GA

8.0

AI Panel

DeepSeek

DeepSeek V4-Flashpreview

8.7

AI Panel

Identity & lifecycle

Provider	DeepSeek	DeepSeek
Family / tier	DeepSeek R1 · Reasoning	DeepSeek V4 · Flash
Status	GA	preview
Released	2025-05-27	2026-04-23
Knowledge cutoff	2025-04	2026-02

Architecture & context

Context window	128K	1M
Max output tokens	64K	384K
Input modalities	text	text
Reasoning mode	always	optional
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$0.55	$0.14
Output	$2.19	$0.28
Cached input	$0.14	$0.0028
Batch input	—	—
Free tier	Yes	Yes

Speed

Output speed (tok/s)	—	—
Time to first token (s)	—	—
Latency tier	slow	fast

Trust & deployment

Trains on inputs	Yes	Yes
License	MIT	MIT
Clouds	—	—
Compliance	—	—

AI Panel scoring

Unified score	8	8.7
Decision Maker	8	8.5
Domain Strategist	8.5	8.5
Finance Lead	9	9.7
Domain Practitioner	8.5	8.5
Power User	8	8
Skeptic	7.5	7.5
Value score	9.2	9.9

Benchmarks

MMLU	93.4	—
MMLU-Pro	85	86.2
GPQA Diamond	81	88.1
AIME 2025	87.5	—
SWE-bench Verified	57.6	79
LiveCodeBench	73.3	91.6
Aider Polyglot	71.6	—
TAU-bench	63.9	—
MRCR Long Context	—	78.7
SimpleQA	27.8	34.1
Humanity's Last Exam	17.7	34.8
LMArena Elo	1382	—
Artificial Analysis Index	68	—

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.