Llama 3.1 8B vs Llama 4 Maverick

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Meta
Llama 3.1 8BGA
7.4
AI Panel
Meta
Llama 4 MaverickGA
7.7
AI Panel

Identity & lifecycle

ProviderMetaMeta
Family / tierLlama 3 · SmallLlama 4 · Maverick
StatusGAGA
Released2024-07-222025-04-04
Knowledge cutoff2023-122024-08

Architecture & context

Context window128K1M
Max output tokens4K8K
Input modalitiestexttext, image
Reasoning modenonenone
Open weightsYesYes

Pricing (per Mtok)

Input$0.05$0.2
Output$0.08$0.85
Cached input
Batch input
Free tierYesNo

Speed

Output speed (tok/s)159.4104.3
Time to first token (s)0.30.66
Latency tierfastfast

Trust & deployment

Trains on inputsNoNo
LicenseLlama-3-CommunityLlama-4-Community
CloudsBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance

AI Panel scoring

Unified score7.47.7
Decision Maker88.5
Domain Strategist7.57.5
Finance Lead9.59
Domain Practitioner8.57.5
Power User66.5
Skeptic6.55.5
Value score9.59

Benchmarks

MMLU69.485.5
MMLU-Pro48.380.5
GPQA Diamond30.469.8
MATH-50051.961.2
HumanEval72.685.8
LiveCodeBench43.4
Aider Polyglot15.6
MMMU73.4
IFEval80.4
BBH64.2
LMArena Elo11761271
Artificial Analysis Index1218

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.