Llama 3.1 70B vs Llama 4 Scout

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

Meta

Llama 3.1 70BGA

6.7

AI Panel

Meta

Llama 4 ScoutGA

7.5

AI Panel

Identity & lifecycle

Provider	Meta	Meta
Family / tier	Llama 3 · Large	Llama 4 · Scout
Status	GA	GA
Released	2024-07-22	2025-04-04
Knowledge cutoff	2023-12	2024-08

Architecture & context

Context window	128K	10M
Max output tokens	4K	8K
Input modalities	text	text, image
Reasoning mode	none	none
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$0.4	$0.1
Output	$0.59	$0.34
Cached input	—	—
Batch input	—	—
Free tier	No	No

Speed

Output speed (tok/s)	81.8	106.1
Time to first token (s)	0.4	0.56
Latency tier	fast	fast

Trust & deployment

Trains on inputs	No	No
License	Llama-3-Community	Llama-4-Community
Clouds	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance	—	—

AI Panel scoring

Unified score	6.7	7.5
Decision Maker	6.5	8.5
Domain Strategist	6	7.5
Finance Lead	7	9
Domain Practitioner	7	7.5
Power User	6.5	6.5
Skeptic	6.5	5.5
Value score	7.5	9.5

Benchmarks

MMLU	86	79.6
MMLU-Pro	66.4	74.3
GPQA Diamond	46.7	57.2
MATH-500	68	50.3
HumanEval	80.5	82
LiveCodeBench	—	32.8
MMMU	—	69.4
IFEval	87.5	—
BBH	73	—
LMArena Elo	1247	—
Artificial Analysis Index	13	14

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.