Skip to content

GPT-4o mini API Pricing

GPT-4o mini costs $0.15 per million input tokens and $0.60 per million output tokens at list price. Here is the cost at volume and how it ranks against 6 other models (2 of 7 by blended price).

input$0.15per 1M tokens
output$0.60per 1M tokens
blended$0.263:1 input:output mix
one chat request$0.00061.5k in / 600 out, no tools
agent workload / month$2742,000 req/day, 3 tool calls, RAG on
free_toolPrice GPT-4o mini on your own workloadSet your volume, tokens, tool calls, and RAG in the AI Agent Cost Calculator, prefilled with GPT-4o mini, and compare it against every other model.

alternatives

GPT-4o mini vs every other model on price

Models ranked by blended price against GPT-4o mini
modelblended / 1Mvs GPT-4o mini
Gemini 1.5 FlashGoogle (Vertex)$0.1350% cheaper
Claude OpusAnthropic$30.00114× the price
Claude SonnetAnthropic$6.0023× the price
GPT-4oOpenAI$4.3817× the price
Gemini 1.5 ProGoogle (Vertex)$2.198.3× the price
Claude HaikuAnthropic$1.606.1× the price

cheaper than GPT-4o mini · public list prices as of 2026-06 · estimates, not quotes

when_it_fits

When GPT-4o mini is the right call

OpenAI's small, low-cost model for high-volume, latency-sensitive work.

On blended price, GPT-4o mini is mid-pack: 1 compared here are cheaper and 5 cost more. Whether its rate is worth paying comes down to fit. A model that resolves a task in fewer attempts or shorter prompts can come out ahead of a cheaper one that needs retries, so price it on your own evaluation set and token mix rather than the per-million rate alone.

faq

Questions & answers

How much does GPT-4o mini cost?
GPT-4o mini is $0.15 per million input tokens and $0.60 per million output tokens, at public list price (2026-06). At a typical 3:1 input-to-output mix that blends to about $0.26 per million tokens.
What is a cheaper alternative to GPT-4o mini?
The cheapest option compared here is Gemini 1.5 Flash at about $0.13 per million blended, roughly 50% less than GPT-4o mini. Whether it fits depends on your task: a cheaper model that needs retries or longer prompts can cost more in practice.
How much does GPT-4o mini cost to run an AI agent per month?
On a representative agent workload (2,000 requests a day, 3 tool calls, RAG on), GPT-4o mini comes to about $274 a month. That includes the agentic context tax, where every tool call re-sends the conversation and you pay for it again. Your real figure depends on volume and tokens, so run it in the calculator.

A model's list price is not its bill.

Prompt caching, context trimming, and the right tier per task usually move an LLM bill more than the model choice does. Book a call, or leave your email and I'll reach out.

Book a call

No spam. You'll get a reply from me.

Prefer proof first? See how this plays out in real case studies →