Llama 3.3 70B vs Llama 4 Scout

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

Meta

Llama 3.3 70BGA

7.4

AI Panel

Meta

Llama 4 ScoutGA

7.5

AI Panel

Identity & lifecycle

Provider	Meta	Meta
Family / tier	Llama 3 · Large	Llama 4 · Scout
Status	GA	GA
Released	2024-12-05	2025-04-04
Knowledge cutoff	2023-12	2024-08

Architecture & context

Context window	128K	10M
Max output tokens	4K	8K
Input modalities	text	text, image
Reasoning mode	none	none
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$0.12	$0.1
Output	$0.4	$0.34
Cached input	—	—
Batch input	—	—
Free tier	No	No

Speed

Output speed (tok/s)	81.8	106.1
Time to first token (s)	0.4	0.56
Latency tier	fast	fast

Trust & deployment

Trains on inputs	No	No
License	Llama-3-Community	Llama-4-Community
Clouds	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance	—	—

AI Panel scoring

Unified score	7.4	7.5
Decision Maker	8	8.5
Domain Strategist	7	7.5
Finance Lead	8	9
Domain Practitioner	8	7.5
Power User	7	6.5
Skeptic	6.5	5.5
Value score	8.5	9.5

Benchmarks

MMLU	86	79.6
MMLU-Pro	68.9	74.3
GPQA Diamond	50.5	57.2
MATH-500	77	50.3
HumanEval	88.4	82
LiveCodeBench	—	32.8
MMMU	—	69.4
IFEval	92.1	—
LMArena Elo	1257	—
Artificial Analysis Index	14	14

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.