Llama 3.1 70B vs Llama 4 Maverick vs Llama 4 Scout

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Meta
Llama 3.1 70BGA
6.7
AI Panel
Meta
Llama 4 MaverickGA
7.7
AI Panel
Meta
Llama 4 ScoutGA
7.5
AI Panel

Identity & lifecycle

ProviderMetaMetaMeta
Family / tierLlama 3 · LargeLlama 4 · MaverickLlama 4 · Scout
StatusGAGAGA
Released2024-07-222025-04-042025-04-04
Knowledge cutoff2023-122024-082024-08

Architecture & context

Context window128K1M10M
Max output tokens4K8K8K
Input modalitiestexttext, imagetext, image
Reasoning modenonenonenone
Open weightsYesYesYes

Pricing (per Mtok)

Input$0.4$0.2$0.1
Output$0.59$0.85$0.34
Cached input
Batch input
Free tierNoNoNo

Speed

Output speed (tok/s)81.8104.3106.1
Time to first token (s)0.40.660.56
Latency tierfastfastfast

Trust & deployment

Trains on inputsNoNoNo
LicenseLlama-3-CommunityLlama-4-CommunityLlama-4-Community
CloudsBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance

AI Panel scoring

Unified score6.77.77.5
Decision Maker6.58.58.5
Domain Strategist67.57.5
Finance Lead799
Domain Practitioner77.57.5
Power User6.56.56.5
Skeptic6.55.55.5
Value score7.599.5

Benchmarks

MMLU8685.579.6
MMLU-Pro66.480.574.3
GPQA Diamond46.769.857.2
MATH-5006861.250.3
HumanEval80.585.882
LiveCodeBench43.432.8
Aider Polyglot15.6
MMMU73.469.4
IFEval87.5
BBH73
LMArena Elo12471271
Artificial Analysis Index131814

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.