DeepSeek V3.1 vs DeepSeek V4-Flash vs DeepSeek V4-Pro

Best cell per row highlighted. Null means undisclosed — never counted as zero.

DeepSeek
DeepSeek V3.1GA
7.7
AI Panel
DeepSeek
DeepSeek V4-Flashpreview
8.7
AI Panel
DeepSeek
DeepSeek V4-Propreview
8.5
AI Panel

Identity & lifecycle

ProviderDeepSeekDeepSeekDeepSeek
Family / tierDeepSeek V3 · LargeDeepSeek V4 · FlashDeepSeek V4 · Pro
StatusGApreviewpreview
Released2025-08-202026-04-232026-04-23
Knowledge cutoff2025-062026-022026-02

Architecture & context

Context window128K1M1M
Max output tokens8K384K384K
Input modalitiestexttexttext
Reasoning modeoptionaloptionaloptional
Open weightsYesYesYes

Pricing (per Mtok)

Input$0.21$0.14$0.435
Output$0.79$0.28$0.87
Cached input$0.021$0.0028$0.0036
Batch input
Free tierYesYesYes

Speed

Output speed (tok/s)
Time to first token (s)
Latency tiermediumfastmedium

Trust & deployment

Trains on inputsYesYesYes
LicenseMITMITMIT
Clouds
Compliance

AI Panel scoring

Unified score7.78.78.5
Decision Maker7.58.58
Domain Strategist7.58.58.5
Finance Lead8.59.79.5
Domain Practitioner88.58.5
Power User7.587.5
Skeptic7.57.57.5
Value score8.89.99.7

Benchmarks

MMLU-Pro83.786.287.5
GPQA Diamond7488.190.1
AIME 202588
SWE-bench Verified667980.6
HumanEval76.8
LiveCodeBench8091.693.5
Aider Polyglot76.3
MRCR Long Context78.783.5
SimpleQA34.157.9
Humanity's Last Exam34.837.7
LMArena Coding Elo1287
Artificial Analysis Index52

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.