Llama 4 Scout vs Llama 4 Maverick vs Llama 3.3 70B

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Meta
Llama 4 ScoutGA
7.5
AI Panel
Meta
Llama 4 MaverickGA
7.7
AI Panel
Meta
Llama 3.3 70BGA
7.4
AI Panel

Identity & lifecycle

ProviderMetaMetaMeta
Family / tierLlama 4 · ScoutLlama 4 · MaverickLlama 3 · Large
StatusGAGAGA
Released2025-04-042025-04-042024-12-05
Knowledge cutoff2024-082024-082023-12

Architecture & context

Context window10M1M128K
Max output tokens8K8K4K
Input modalitiestext, imagetext, imagetext
Reasoning modenonenonenone
Open weightsYesYesYes

Pricing (per Mtok)

Input$0.1$0.2$0.12
Output$0.34$0.85$0.4
Cached input
Batch input
Free tierNoNoNo

Speed

Output speed (tok/s)106.1104.381.8
Time to first token (s)0.560.660.4
Latency tierfastfastfast

Trust & deployment

Trains on inputsNoNoNo
LicenseLlama-4-CommunityLlama-4-CommunityLlama-3-Community
CloudsBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance

AI Panel scoring

Unified score7.57.77.4
Decision Maker8.58.58
Domain Strategist7.57.57
Finance Lead998
Domain Practitioner7.57.58
Power User6.56.57
Skeptic5.55.56.5
Value score9.598.5

Benchmarks

MMLU79.685.586
MMLU-Pro74.380.568.9
GPQA Diamond57.269.850.5
MATH-50050.361.277
HumanEval8285.888.4
LiveCodeBench32.843.4
Aider Polyglot15.6
MMMU69.473.4
IFEval92.1
LMArena Elo12711257
Artificial Analysis Index141814

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.