Best cell per row highlighted. Null means undisclosed — never counted as zero.
| Provider | DeepSeek | DeepSeek |
| Family / tier | DeepSeek R1 · Reasoning | DeepSeek V4 · Flash |
| Status | GA | preview |
| Released | 2025-05-27 | 2026-04-23 |
| Knowledge cutoff | 2025-04 | 2026-02 |
| Context window | 128K | 1M |
| Max output tokens | 64K | 384K |
| Input modalities | text | text |
| Reasoning mode | always | optional |
| Open weights | Yes | Yes |
| Input | $0.55 | $0.14 |
| Output | $2.19 | $0.28 |
| Cached input | $0.14 | $0.0028 |
| Batch input | — | — |
| Free tier | Yes | Yes |
| Output speed (tok/s) | — | — |
| Time to first token (s) | — | — |
| Latency tier | slow | fast |
| Trains on inputs | Yes | Yes |
| License | MIT | MIT |
| Clouds | — | — |
| Compliance | — | — |
| Unified score | 8 | 8.7 |
| Decision Maker | 8 | 8.5 |
| Domain Strategist | 8.5 | 8.5 |
| Finance Lead | 9 | 9.7 |
| Domain Practitioner | 8.5 | 8.5 |
| Power User | 8 | 8 |
| Skeptic | 7.5 | 7.5 |
| Value score | 9.2 | 9.9 |
| MMLU | 93.4 | — |
| MMLU-Pro | 85 | 86.2 |
| GPQA Diamond | 81 | 88.1 |
| AIME 2025 | 87.5 | — |
| SWE-bench Verified | 57.6 | 79 |
| LiveCodeBench | 73.3 | 91.6 |
| Aider Polyglot | 71.6 | — |
| TAU-bench | 63.9 | — |
| MRCR Long Context | — | 78.7 |
| SimpleQA | 27.8 | 34.1 |
| Humanity's Last Exam | 17.7 | 34.8 |
| LMArena Elo | 1382 | — |
| Artificial Analysis Index | 68 | — |
Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.