Llama 3.1 405B vs Llama 4 Scout

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

Meta

Llama 3.1 405BGA

6.3

AI Panel

Meta

Llama 4 ScoutGA

7.5

AI Panel

Identity & lifecycle

Provider	Meta	Meta
Family / tier	Llama 3 · Large	Llama 4 · Scout
Status	GA	GA
Released	2024-07-22	2025-04-04
Knowledge cutoff	2023-12	2024-08

Architecture & context

Context window	128K	10M
Max output tokens	4K	8K
Input modalities	text	text, image
Reasoning mode	none	none
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$3	$0.1
Output	$3	$0.34
Cached input	—	—
Batch input	—	—
Free tier	No	No

Speed

Output speed (tok/s)	29	106.1
Time to first token (s)	0.7	0.56
Latency tier	slow	fast

Trust & deployment

Trains on inputs	No	No
License	Llama-3-Community	Llama-4-Community
Clouds	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance	—	—

AI Panel scoring

Unified score	6.3	7.5
Decision Maker	6.5	8.5
Domain Strategist	6	7.5
Finance Lead	5	9
Domain Practitioner	6.5	7.5
Power User	6	6.5
Skeptic	6	5.5
Value score	4.5	9.5

Benchmarks

MMLU	88.6	79.6
MMLU-Pro	73.3	74.3
GPQA Diamond	51.1	57.2
MATH-500	73.8	50.3
HumanEval	89	82
LiveCodeBench	—	32.8
MMMU	—	69.4
IFEval	88.6	—
BBH	81.3	—
LMArena Elo	1267	—
Artificial Analysis Index	17	14

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.