Llama 4 Scout vs Llama 4 Maverick vs Llama 3.3 70B

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Add model:

Meta

Llama 4 ScoutGA

7.5

AI Panel

Meta

Llama 4 MaverickGA

7.7

AI Panel

Meta

Llama 3.3 70BGA

7.4

AI Panel

Identity & lifecycle

Provider	Meta	Meta	Meta
Family / tier	Llama 4 · Scout	Llama 4 · Maverick	Llama 3 · Large
Status	GA	GA	GA
Released	2025-04-04	2025-04-04	2024-12-05
Knowledge cutoff	2024-08	2024-08	2023-12

Architecture & context

Context window	10M	1M	128K
Max output tokens	8K	8K	4K
Input modalities	text, image	text, image	text
Reasoning mode	none	none	none
Open weights	Yes	Yes	Yes

Pricing (per Mtok)

Input	$0.1	$0.2	$0.12
Output	$0.34	$0.85	$0.4
Cached input	—	—	—
Batch input	—	—	—
Free tier	No	No	No

Speed

Output speed (tok/s)	106.1	104.3	81.8
Time to first token (s)	0.56	0.66	0.4
Latency tier	fast	fast	fast

Trust & deployment

Trains on inputs	No	No	No
License	Llama-4-Community	Llama-4-Community	Llama-3-Community
Clouds	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx	Bedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance	—	—	—

AI Panel scoring

Unified score	7.5	7.7	7.4
Decision Maker	8.5	8.5	8
Domain Strategist	7.5	7.5	7
Finance Lead	9	9	8
Domain Practitioner	7.5	7.5	8
Power User	6.5	6.5	7
Skeptic	5.5	5.5	6.5
Value score	9.5	9	8.5

Benchmarks

MMLU	79.6	85.5	86
MMLU-Pro	74.3	80.5	68.9
GPQA Diamond	57.2	69.8	50.5
MATH-500	50.3	61.2	77
HumanEval	82	85.8	88.4
LiveCodeBench	32.8	43.4	—
Aider Polyglot	—	15.6	—
MMMU	69.4	73.4	—
IFEval	—	—	92.1
LMArena Elo	—	1271	1257
Artificial Analysis Index	14	18	14

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.