Llama 4 Scout vs Llama 3.3 70B

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

Meta

Llama 4 ScoutGA

7.5

AI Panel

Meta

Llama 3.3 70BGA

7.4

AI Panel

Identity & lifecycle

Provider	Meta	Meta
Family / tier	Llama 4 · Scout	Llama 3 · Large
Status	GA	GA
Released	2025-04-04	2024-12-05
Knowledge cutoff	2024-08	2023-12

Architecture & context

Context window	10M	128K
Max output tokens	8K	4K
Input modalities	text, image	text
Reasoning mode	none	none
Open weights	Yes	Yes

Pricing (per Mtok)

Input	$0.1	$0.12
Output	$0.34	$0.4
Cached input	—	—
Batch input	—	—
Free tier	No	No

Speed

Output speed (tok/s)	106.1	81.8
Time to first token (s)	0.56	0.4
Latency tier	fast	fast

Trust & deployment

Trains on inputs	No	No
License	Llama-4-Community	Llama-3-Community
Clouds	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance	—	—

AI Panel scoring

Unified score	7.5	7.4
Decision Maker	8.5	8
Domain Strategist	7.5	7
Finance Lead	9	8
Domain Practitioner	7.5	8
Power User	6.5	7
Skeptic	5.5	6.5
Value score	9.5	8.5

Benchmarks

MMLU	79.6	86
MMLU-Pro	74.3	68.9
GPQA Diamond	57.2	50.5
MATH-500	50.3	77
HumanEval	82	88.4
LiveCodeBench	32.8	—
MMMU	69.4	—
IFEval	—	92.1
LMArena Elo	—	1257
Artificial Analysis Index	14	14

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.