GPT-5 Nano: token counter & pricing
OpenAI · exact (uses official tokenizer) · pricing as of 2026-05-31.
- Provider
- OpenAI
- API model ID
gpt-5-nano- Context window
- 400,000 tokens
- Input price
- $0.05 per 1M tokens
- Output price
- $0.40 per 1M tokens
- Tokenizer accuracy
- exact (uses official tokenizer)
- Pricing as of
- 2026-05-31
Open the counter to count tokens for GPT-5 Nano in real time.
What is GPT-5 Nano?
GPT-5 Nano is the smallest, cheapest, fastest member of OpenAI's GPT-5 family. Designed for high-volume cost-sensitive workloads where you want GPT-5's tokenizer and ecosystem at near-throwaway prices.
At $0.05 per 1M input tokens, GPT-5 Nano is the cheapest exact-tokenizer model on this counter, cheaper than Gemini 2.5 Flash-Lite ($0.10), Gemini 3.1 Flash-Lite Preview ($0.25), GPT-4.1 Nano ($0.10), and any open-source model.
How tokens are counted here
Like the rest of the GPT-5 family, GPT-5 Nano encodes text with o200k_base. This page runs js-tiktoken in your browser, so the count is exact and your text never leaves your device. Exactness matters more at Nano's end of the market than anywhere else: at tens of millions of calls per month, the roughly 3% error typical of character-ratio estimators stops being a rounding artifact and becomes a real line on the invoice. When the whole point of the model is squeezing cost, measure with the real encoder, not a heuristic.
When to use GPT-5 Nano
- High-volume classification, extraction, labeling. This is the use case the price is designed for.
- First-pass routing in agent pipelines before escalating hard cases to GPT-5 / GPT-5.5.
- Real-time UX where lower latency matters and the task is simple.
- Budget-extreme batch processing, millions of small calls per day at near-trivial cost.
When not to use it:
- Multi-step reasoning. The capability gap to GPT-5 / 5.5 is real on hard prompts.
- Code generation on non-trivial problems. Use GPT-5 Mini or higher.
- Anywhere quality has measurably mattered in your evals.
Pricing notes
$0.05 input / $0.40 output per 1M tokens. Cached input is $0.005/M, effectively free.
For comparison, a 1,000-token prompt with a 200-token reply on GPT-5 Nano:
- Per call: $0.000050 + $0.000080 = $0.000130 (about 1/30¢)
- Per million calls: $130
The same workload on GPT-5.5 costs $4,300. Per-million-call savings: $4,170.
What GPT-5 Nano costs in production
Take the canonical Nano workload: high-volume classification. Each call sends 500 input tokens (the text plus a labeling instruction) and returns a 50-token structured answer, at 1 million calls per month. That is 500M input tokens and 50M output tokens.
- Input: 500M tokens at $0.05/M = $25
- Output: 50M tokens at $0.40/M = $20
- Monthly total: $45
Nothing exact-tokenized undercuts that. The nearest budget rival, Gemini 2.5 Flash-Lite ($0.10 / $0.40), runs $50 + $20 = $70 per month on the same traffic. A pricier small model like Claude Haiku 4.5 ($1.00 / $5.00) costs $500 + $250 = $750 per month, nearly 17x Nano. At this scale the question is never whether Nano is cheap enough; it is whether its accuracy clears your bar.
Migrating from GPT-4.1 Nano
Swap the apiId from gpt-4.1-nano to gpt-5-nano. Both models use o200k_base, so token counts carry over exactly; no prompt re-measurement needed. Input price halves, $0.10 down to $0.05 per 1M tokens, while output stays at $0.40, so input-heavy classification gets cheaper and output-heavy work costs the same. One regression to check: context shrinks from 1M tokens to 400K. If you were feeding GPT-4.1 Nano whole long documents, add a length check before cutover or those calls will start erroring.
GPT-5 Nano vs the obvious alternative
Gemini 2.5 Flash-Lite is the model most teams shortlist against Nano. On the classification scenario above, Flash-Lite's bill is $70 per month to Nano's $45, entirely down to the 2x input rate ($0.10 vs $0.05). Flash-Lite answers back with a 1M context window and multimodal input, neither of which matters for short text labeling. For that core workload, Nano wins on price; pick Flash-Lite when the inputs are images or very long.
Common questions
Is GPT-5 Nano good enough for production?
For the workloads it's designed for, yes. Run a labeled eval set; if it hits your accuracy bar, the savings are massive.
What's the difference between GPT-5 Nano and GPT-5.4 Nano?
Same tokenizer and ballpark capability tier. GPT-5 Nano ($0.05/$0.40) is cheaper. GPT-5.4 Nano ($0.20/$1.25) is incrementally better on hard reasoning. The gap is narrow; default to GPT-5 Nano unless you measure the upgrade winning.
How does GPT-5 Nano compare to Gemini 2.5 Flash-Lite?
Both target the same workloads. GPT-5 Nano: $0.05/$0.40. Gemini 2.5 Flash-Lite: $0.10/$0.40. Nano is half the input price and tied on output. Gemini Flash-Lite has a 1M context window and multimodal input, both advantages over Nano's text-only 400K context. Choose by what your workload actually needs.
Compare GPT-5 Nano to other models
- GPT-5.5 (OpenAI, $5.00/$30.00)
- GPT-5.5 Pro (OpenAI, $30.00/$180.00)
- GPT-5.4 (OpenAI, $2.50/$15.00)
- GPT-5.4 Mini (OpenAI, $0.75/$4.50)
- GPT-5.4 Nano (OpenAI, $0.20/$1.25)
- GPT-5.4 Pro (OpenAI, $30.00/$180.00)
- GPT-5.3 (OpenAI, $1.75/$14.00)
- GPT-5.2 (OpenAI, $1.75/$14.00)
- GPT-5.2 Pro (OpenAI, $21.00/$168.00)
- GPT-5.1 (OpenAI, $1.25/$10.00)
- GPT-5 (OpenAI, $1.25/$10.00)
- GPT-5 Mini (OpenAI, $0.25/$2.00)
- GPT-5 Pro (OpenAI, $15.00/$120.00)
- GPT-4.1 (OpenAI, $2.00/$8.00)
- GPT-4.1 Mini (OpenAI, $0.40/$1.60)
- GPT-4.1 Nano (OpenAI, $0.10/$0.40)
- o3 (OpenAI, $2.00/$8.00)
- o3-mini (OpenAI, $1.10/$4.40)
- o3-pro (OpenAI, $20.00/$80.00)
- o4-mini (OpenAI, $1.10/$4.40)
- GPT-4o (OpenAI, $2.50/$10.00)
- GPT-4o mini (OpenAI, $0.15/$0.60)
- GPT-4 Turbo (OpenAI, $10.00/$30.00)
- Gemini 2.5 Flash-Lite (Google, $0.10/$0.40)
- Llama 3.1 8B (Meta, $0.18/$0.18)
- Gemini 3.1 Flash-Lite (Google, $0.25/$1.50)