Llama 4 Maverick vs Llama 3.3 70B

Best cell per row highlighted. Null means undisclosed — never counted as zero.

Meta
Llama 4 MaverickGA
7.7
AI Panel
Meta
Llama 3.3 70BGA
7.4
AI Panel

Identity & lifecycle

ProviderMetaMeta
Family / tierLlama 4 · MaverickLlama 3 · Large
StatusGAGA
Released2025-04-042024-12-05
Knowledge cutoff2024-082023-12

Architecture & context

Context window1M128K
Max output tokens8K4K
Input modalitiestext, imagetext
Reasoning modenonenone
Open weightsYesYes

Pricing (per Mtok)

Input$0.2$0.12
Output$0.85$0.4
Cached input
Batch input
Free tierNoNo

Speed

Output speed (tok/s)104.381.8
Time to first token (s)0.660.4
Latency tierfastfast

Trust & deployment

Trains on inputsNoNo
LicenseLlama-4-CommunityLlama-3-Community
CloudsBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonxBedrock, Vertex AI, Azure AI Foundry, GCP, OCI, IBM watsonx
Compliance

AI Panel scoring

Unified score7.77.4
Decision Maker8.58
Domain Strategist7.57
Finance Lead98
Domain Practitioner7.58
Power User6.57
Skeptic5.56.5
Value score98.5

Benchmarks

MMLU85.586
MMLU-Pro80.568.9
GPQA Diamond69.850.5
MATH-50061.277
HumanEval85.888.4
LiveCodeBench43.4
Aider Polyglot15.6
MMMU73.4
IFEval92.1
LMArena Elo12711257
Artificial Analysis Index1814

Scores link to their sources. Missing cells mean the vendor hasn't disclosed a result — honesty over padding.