DeepSeek V3.1 vs DeepSeek V4-Pro

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

DeepSeek

DeepSeek V3.1GA

7.7

AI Panel

DeepSeek

DeepSeek V4-Propreview

8.5

AI Panel

Identity & lifecycle

Provider	DeepSeek	DeepSeek
Family / tier	DeepSeek V3 · Large	DeepSeek V4 · Pro
Status	GA	preview
Released	2025-08-20	2026-04-23
Knowledge cutoff	2025-06	2026-02

Architecture & context

Context window	128K	1M
Max output tokens	8K	384K
Input modalities	text	text
Reasoning mode	optional	optional
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$0.21	$0.435
Output	$0.79	$0.87
Cached input	$0.021	$0.0036
Batch input	—	—
Free tier	Yes	Yes

Speed

Output speed (tok/s)	—	—
Time to first token (s)	—	—
Latency tier	medium	medium

Trust & deployment

Trains on inputs	Yes	Yes
License	MIT	MIT
Clouds	—	—
Compliance	—	—

AI Panel scoring

Unified score	7.7	8.5
Decision Maker	7.5	8
Domain Strategist	7.5	8.5
Finance Lead	8.5	9.5
Domain Practitioner	8	8.5
Power User	7.5	7.5
Skeptic	7.5	7.5
Value score	8.8	9.7

Benchmarks

MMLU-Pro	83.7	87.5
GPQA Diamond	74	90.1
AIME 2025	88	—
SWE-bench Verified	66	80.6
HumanEval	—	76.8
LiveCodeBench	80	93.5
Aider Polyglot	76.3	—
MRCR Long Context	—	83.5
SimpleQA	—	57.9
Humanity's Last Exam	—	37.7
LMArena Coding Elo	—	1287
Artificial Analysis Index	—	52

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.