GPT-5 nano

by OpenAI · GPT-5 family · best for OpenAI cost floor for plumbing-grade work

Cost-OptimizedEdge / On-Device

6.4

AI Panel Score

Value 9.0/10

GPT-5 nano is the smallest and cheapest model in the GPT-5 family, released 2025-08-07 — the ultra-cheap floor of the lineup at $0.05/$0.40 per 1M tokens. It remains GA and is still the cheapest OpenAI text model as of 2026-05-28, even after GPT-5.4 nano arrived at a higher price with materially better benchmarks. The one-sentence buyer's take: a legitimate plumbing model for the very highest volumes and lowest stakes — bulk classification, tagging, and triage where cost is the only metric — but wrong for anything user-facing.

Compare this model All GPT-5 versions

What's new

GPT-5 nano was the original nano-tier model and established the ultra-cheap end of the GPT-5 lineup at $0.05/$0.40 with a 400K context and 128K max output. It was designed for summarization, classification, and other speed-and-cost-priority workloads. As a 2025 model it predates the GPT-5.4 generation's native computer use, apply_patch, and skills. Its enduring relevance today is purely economic: it is still the cheapest OpenAI text model, ~4x cheaper on input than GPT-5.4 nano.

AI Panel Review

Six personas, six verdicts — the same panel that reviews every product on TopReviewed.

Decision Maker6/10

“One differentiator left — it's the cheapest model in the lineup, period. For absorbed-error operational work, it still pencils out.”

GPT-5 nano has one differentiator: it is the cheapest model in the lineup, period. At $0.05/$0.40 it competes with embedding-pipeline costs for some classification jobs. The architectural question is whether the quality gap to GPT-5.4 nano matters for the specific workload. For genuine high-volume operational work where misclassification costs are absorbed downstream, GPT-5 nano can still pencil out. For anything user-facing, the quality gap pushes the right answer to GPT-5.4 nano or higher. Fine-tuning is a secondary reason to stay.

Strategic Fit 6Vendor Risk 6Roadmap Confidence 6

Pros

cheapest in lineup
fine-tuning

Cons

low quality ceiling
stale

Right for: absorbed-error operational work

Avoid if: output reaches users

Domain Strategist6/10

“A pure price-floor play — its only market is the volume tier where a 4x cost gap beats a benchmark gap, and that tier is shrinking.”

Strategically, GPT-5 nano is a pure price-floor play. Its market is the narrow band where the 4x input-cost advantage over GPT-5.4 nano outweighs the quality gap — massive, low-stakes, absorbed-error pipelines. That band is real but shrinking as GPT-5.4 nano's cost falls and as competitors (Gemini Flash Lite-tier) race the same curve. Differentiation is price alone. Market timing has passed for anything but throughput-floor work. Expect it to persist as infrastructure plumbing, not as a strategic asset.

Competitive Positioning 6Differentiation 5Market Timing 5

Pros

lowest price
fine-tuning

Cons

price-only wedge
shrinking band

Right for: throughput-floor pipelines

Avoid if: quality affects outcomes

Finance Lead9/10

“Where the unit economics get surreal — millions of decisions for single-digit dollars when the pipeline is prefix-cached.”

This is where the cost conversation gets surreal in the right way. $0.05 in / $0.40 out, $0.005 cached. For a heavily prefix-cached classification pipeline, you can run millions of decisions for single-digit dollars; Batch halves both sides again. The financial argument to stay on GPT-5 nano specifically over GPT-5.4 nano is genuine for the highest-volume workloads — at scale the 4x input difference compounds. The discipline question: is your pipeline getting nano-grade work, or is nano producing errors that cost more downstream than the savings? Track end-to-end ROI, not per-token spend.

Cost Efficiency 10Pricing Transparency 9Value per Dollar 9

Pros

cost floor
near-zero at scale
fine-tuning

Cons

downstream error cost risk

Right for: massive prefix-cached classification

Avoid if: errors are expensive downstream

Domain Practitioner6/10

“Same API, same SDK — but most teams shouldn't optimize down here; handling lower-quality output usually costs more than the savings.”

Same Responses API, same SDK behavior, same reasoning dial. The price drop versus GPT-5.4 nano is meaningful for very-high-volume pipelines, but the quality drop is also meaningful. Most developers shouldn't optimize down to GPT-5 nano unless throughput economics demand it — the engineering time to handle lower-quality output usually exceeds the dollar savings. Where it works: explicit pipelines that already produce low-stakes outputs (tags, categories, short summaries). Fine-tuning is supported, which can lift quality on a narrow task.

API Ergonomics 8Tool/Agent Support 6Reliability 7

Pros

familiar API
cheapest
fine-tuning

Cons

quality-handling overhead
low ceiling

Right for: low-stakes explicit pipelines

Avoid if: output quality needs engineering to fix

Power User6/10

“Plumbing, not personality — fast where its output reaches a surface, but upgrade for any direct user interaction.”

End users never see GPT-5 nano labeled — it powers infrastructure-grade backend tasks. Where its output reaches a user surface, the experience is fast but limited. Refusal rate is in line with the family. The mental model: GPT-5 nano is plumbing, not personality. For any user-facing direct interaction, upgrade. Its legitimate role is the silent, high-volume layer beneath the product, not the conversation the user has.

Output Quality 5Speed 9Everyday Usefulness 6

Pros

very fast
fine for plumbing

Cons

low quality at user surfaces

Right for: backend infrastructure tasks

Avoid if: it answers users directly

Skeptic6/10

“No published benchmarks at all — the entire pitch is price. That's honest, but it means 'good enough' is a claim you must test yourself.”

The adversarial read: GPT-5 nano has no published per-benchmark scores, so every quality claim is yours to verify. The pitch is entirely price, which is honest but means "good enough for classification" is an assertion, not a demonstrated fact — and on anything past simple tagging the low ceiling shows fast. The 2024-05 cutoff is the lineup's oldest. The genuine value case (throughput-floor, absorbed-error work plus fine-tuning) is real, but the temptation to use it one rung above where it belongs is exactly how teams ship silent quality regressions. Cheapest is not free of risk.

Claim Accuracy 6Weakness Severity 6Hype vs Reality 6

Pros

honest price-only pitch
real floor value

Cons

zero published benchmarks
easy to misuse

Right for: skeptics who measure end-to-end ROI

Avoid if: you assume "cheap" equals "fine."

Strengths

Cheapest OpenAI text model: $0.05/$0.40 per 1M tokens.
90% cached-input discount drops effective input cost to $0.005/1M.
400K context and 128K output despite the price.
Stable API; mature SDK support.
Supports fine-tuning (GPT-5.4 nano does not).

Limitations

Materially weaker than GPT-5.4 nano on every benchmark where both have published scores.
2024-05 knowledge cutoff — oldest in the in-scope lineup.
No computer use, apply_patch, or skills.
Output quality has a low ceiling; pushing reasoning effort up rarely matches GPT-5.4 nano.
Not user-facing in ChatGPT.

Best use cases

Highest-volume, lowest-stakes classification and extraction where cost is the only metric.
First-stage triage in tiered routing where misclassification cost is low.
Bulk content tagging, query rewriting, and metadata generation.
Background tasks where Batch + cached input drives effective cost to near-zero.
Fine-tuned nano-tier deployments (GPT-5.4 nano does not offer fine-tuning).

Deep dive

The full research notes behind this review — verified against primary sources.

Architecture Capabilities Benchmark analysis Speed & latency Pricing analysis Deployment & access Safety & privacy Ecosystem & tooling

Architecture

OpenAI discloses no parameter counts or structure for GPT-5 nano — all null. It is a unified reasoning model with text + image input and text-only output, o200k_base tokenizer, and a 400K context window. No further architectural detail is public; OpenAI positions nano for speed and cost rather than disclosed capability.

Capabilities

GPT-5 nano is the family's cost floor and capability floor, justifying its 6.0 coding and 6.2 reasoning scores — it handles bulk summarization, classification, extraction, query rewriting, and other operational tasks, but has a low ceiling on anything requiring depth. Instruction-following (7.6) and function-calling (7.6) are reliable for simple routing. Long-context (7.0) reflects the 400K window. Multilingual (7.0), vision (6.8), and document/OCR (6.6) handle image input at a basic level. Agentic (5.8) is weak — it is not an agent anchor. Safety calibration (8.0) is solid. Real-time data (4.8) is the weakest axis: a 2024-05 cutoff plus a low reasoning ceiling means it should not be relied on for current or nuanced answers. Pushing reasoning effort up rarely matches GPT-5.4 nano. No published per-benchmark scores exist at this tier, which is why research_confidence is medium.

Benchmark analysis

OpenAI positions GPT-5 nano for speed and cost rather than benchmark performance, and independent third-party benchmark coverage at this tier is limited. No verifiable per-benchmark score is recorded; all benchmark keys are null. The defensible comparison is qualitative: GPT-5 nano is materially weaker than GPT-5.4 nano on every benchmark where both have published figures, at roughly one-fourth the cost.

Speed & latency

GPT-5 nano is among the fastest models in the family — estimated steady-state output around 200–250 tokens/sec at low/default reasoning with sub-half-second time-to-first-token. latency_tier is fast; speed is the model's main user-visible characteristic. Default Tier-1 rate limits (500 RPM / 200K TPM) cap raw throughput until tier upgrades. For bulk work, Batch removes latency entirely.

Pricing analysis

Surface	Cost	Notes
API input	$0.05 / 1M tok	cheapest OpenAI text model
API output	$0.40 / 1M tok
Cached input	$0.005 / 1M tok	90% discount
Batch (in/out)	$0.025 / $0.20	50% off, 24h SLA
Direct UI	n/a	not user-selectable
Free tier	none (API)
Fine-tuning	available	GPT-5 nano supports fine-tuning
Rate limits (Tier 1)	500 RPM / 200K TPM	climb tiers for more

Reasoning-token note: keep nano at low/default effort. At $0.005 cached input, prefix-heavy jobs round to near-zero.

Deployment & access

API-only via the Responses API; not open-weights, license Proprietary, not self-hostable. Cloud-managed via Azure OpenAI and Azure AI Foundry; OpenRouter proxies it. Data residency covers US and EU. It supports fine-tuning (GPT-5.4 nano does not). The tool surface predates computer use, apply_patch, and skills.

Safety & privacy

Governed by OpenAI's Preparedness Framework. No training on API inputs by default; opt-out and zero-retention available for enterprise. Compliance covers SOC2, GDPR, CCPA, and HIPAA (BAA). Content moderation is built in. Refusal calibration is in line with the original GPT-5 generation.

Ecosystem & tooling

First-party SDKs in Python, TypeScript, Java, Go, and .NET, plus the OpenAI Agents SDK. Framework integrations span LangChain, LlamaIndex, Vercel AI SDK, and Pydantic AI. It powers content-tagging and classification backends and the leaf tier of cost-sensitive pipelines. Popularity tier: growing (as cost-floor plumbing), invisible at product surfaces.

Buyer questions

Why use GPT-5 nano over GPT-5.4 nano?

Only one reason: the ~4x lower input cost matters at your volume and your work is low-stakes enough to absorb the quality gap. Otherwise use GPT-5.4 nano.

Does it have published benchmarks?

No — OpenAI positions it on speed and cost. Validate quality on your own task before relying on it.

What's the throughput limit?

Tier-1 defaults are 500 RPM / 200K TPM. Climb the usage tiers for more.

How cheap is it really?

$0.005 cached input and $0.025/$0.20 on Batch mean millions of cached decisions cost single-digit dollars.

Can I fine-tune it?

Yes — and GPT-5.4 nano does not offer fine-tuning, so this is a real reason to choose it for a narrow tuned task.

Is my data used for training?

No, not by API default; enterprise opt-out and zero-retention exist.

Comparable models

GPT-5.4 nano — OpenAI

~4x more expensive but materially better on every published benchmark; the default if quality matters at all.

GPT-5 mini — OpenAI

same generation, ~5x the input price, higher capability ceiling.

Claude Haiku 4.5 — Anthropic

comparable role at higher price, often preferred for writing tone.

Gemini Flash Lite — Google

direct peer at the cost floor.

Sources

Primary references used to verify this review.

Model specs

Input price: $0.05 / Mtok
Output price: $0.40 / Mtok
Cached input: $0.01 / Mtok
Batch (in/out): $0.03 / $0.20
Context window: 400K tokens
Max output: 128K tokens
Knowledge cutoff: 2024-05
Released: 2025-08-06
Modalities: text, image → text
Output speed: ~220 tok/s
License: Proprietary
Clouds: Azure OpenAI, Azure AI Foundry

Does not train on API inputs by default

Other GPT-5 versions

Last verified 2026-05-27