Best cell per row highlighted. Null means undisclosed — never counted as zero.
| Provider | DeepSeek | DeepSeek |
| Family / tier | DeepSeek V4 · Pro | DeepSeek V3 · Large |
| Status | preview | GA |
| Released | 2026-04-23 | 2025-11-30 |
| Knowledge cutoff | 2026-02 | 2025-07 |
| Context window | 1M | 128K |
| Max output tokens | 384K | 64K |
| Input modalities | text | text |
| Reasoning mode | optional | optional |
| Open weights | Yes | Yes |
| Input | $0.435 | $0.252 |
| Output | $0.87 | $0.378 |
| Cached input | $0.0036 | $0.025 |
| Batch input | — | — |
| Free tier | Yes | Yes |
| Output speed (tok/s) | — | — |
| Time to first token (s) | — | — |
| Latency tier | medium | medium |
| Trains on inputs | Yes | Yes |
| License | MIT | MIT |
| Clouds | — | — |
| Compliance | — | — |
| Unified score | 8.5 | 8.2 |
| Decision Maker | 8 | 8 |
| Domain Strategist | 8.5 | 8 |
| Finance Lead | 9.5 | 9 |
| Domain Practitioner | 8.5 | 8.5 |
| Power User | 7.5 | 7.5 |
| Skeptic | 7.5 | 7.5 |
| Value score | 9.7 | 9.3 |
| MMLU-Pro | 87.5 | 85 |
| GPQA Diamond | 90.1 | — |
| AIME 2025 | — | 93.1 |
| SWE-bench Verified | 80.6 | 67.8 |
| HumanEval | 76.8 | — |
| LiveCodeBench | 93.5 | 74.1 |
| MRCR Long Context | 83.5 | — |
| SimpleQA | 57.9 | — |
| Humanity's Last Exam | 37.7 | 30.6 |
| LMArena Coding Elo | 1287 | — |
| Artificial Analysis Index | 52 | — |
Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.