Best cell per row highlighted. Null means undisclosed — never counted as zero.
| Provider | DeepSeek | DeepSeek | DeepSeek |
| Family / tier | DeepSeek-VL · VL | DeepSeek V4 · Flash | DeepSeek V4 · Pro |
| Status | GA | preview | preview |
| Released | 2024-12-12 | 2026-04-23 | 2026-04-23 |
| Knowledge cutoff | 2024-10 | 2026-02 | 2026-02 |
| Context window | 4K | 1M | 1M |
| Max output tokens | 4K | 384K | 384K |
| Input modalities | text, image | text | text |
| Reasoning mode | none | optional | optional |
| Open weights | Yes | Yes | Yes |
| Input | — | $0.14 | $0.435 |
| Output | — | $0.28 | $0.87 |
| Cached input | — | $0.0028 | $0.0036 |
| Batch input | — | — | — |
| Free tier | Yes | Yes | Yes |
| Output speed (tok/s) | — | — | — |
| Time to first token (s) | — | — | — |
| Latency tier | fast | fast | medium |
| Trains on inputs | No | Yes | Yes |
| License | custom-deepseek-model-license | MIT | MIT |
| Clouds | — | — | — |
| Compliance | — | — | — |
| Unified score | 7.2 | 8.7 | 8.5 |
| Decision Maker | 7.5 | 8.5 | 8 |
| Domain Strategist | 7 | 8.5 | 8.5 |
| Finance Lead | 8.5 | 9.7 | 9.5 |
| Domain Practitioner | 7.5 | 8.5 | 8.5 |
| Power User | 6.5 | 8 | 7.5 |
| Skeptic | 7 | 7.5 | 7.5 |
| Value score | 8.8 | 9.9 | 9.7 |
| MMLU-Pro | — | 86.2 | 87.5 |
| GPQA Diamond | — | 88.1 | 90.1 |
| SWE-bench Verified | — | 79 | 80.6 |
| HumanEval | — | — | 76.8 |
| LiveCodeBench | — | 91.6 | 93.5 |
| MMMU | 51.1 | — | — |
| MRCR Long Context | — | 78.7 | 83.5 |
| SimpleQA | — | 34.1 | 57.9 |
| Humanity's Last Exam | — | 34.8 | 37.7 |
| LMArena Coding Elo | — | — | 1287 |
| Artificial Analysis Index | — | — | 52 |
Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.