Skip to content

Gemini 1.5 Flash API Pricing

Gemini 1.5 Flash costs $0.08 per million input tokens and $0.30 per million output tokens at list price. Here is the cost at volume and how it ranks against 6 other models (1 of 7 by blended price).

input$0.08per 1M tokens
output$0.30per 1M tokens
blended$0.133:1 input:output mix
one chat request$0.00031.5k in / 600 out, no tools
agent workload / month$1382,000 req/day, 3 tool calls, RAG on
free_toolPrice Gemini 1.5 Flash on your own workloadSet your volume, tokens, tool calls, and RAG in the AI Agent Cost Calculator, prefilled with Gemini 1.5 Flash, and compare it against every other model.

alternatives

Gemini 1.5 Flash vs every other model on price

Models ranked by blended price against Gemini 1.5 Flash
modelblended / 1Mvs Gemini 1.5 Flash
Claude OpusAnthropic$30.00229× the price
Claude SonnetAnthropic$6.0046× the price
GPT-4oOpenAI$4.3833× the price
Gemini 1.5 ProGoogle (Vertex)$2.1917× the price
Claude HaikuAnthropic$1.6012× the price
GPT-4o miniOpenAI$0.262.0× the price

cheaper than Gemini 1.5 Flash · public list prices as of 2026-06 · estimates, not quotes

when_it_fits

When Gemini 1.5 Flash is the right call

Google's fastest, cheapest tier on Vertex, aimed at high-volume tasks.

On blended price, Gemini 1.5 Flash is the cheapest model compared here. Whether its rate is worth paying comes down to fit. A model that resolves a task in fewer attempts or shorter prompts can come out ahead of a cheaper one that needs retries, so price it on your own evaluation set and token mix rather than the per-million rate alone.

faq

Questions & answers

How much does Gemini 1.5 Flash cost?
Gemini 1.5 Flash is $0.08 per million input tokens and $0.30 per million output tokens, at public list price (2026-06). At a typical 3:1 input-to-output mix that blends to about $0.13 per million tokens.
What is a cheaper alternative to Gemini 1.5 Flash?
Gemini 1.5 Flash is already the cheapest model compared here, so the question is usually whether a pricier model fits your task well enough to be worth the higher rate.
How much does Gemini 1.5 Flash cost to run an AI agent per month?
On a representative agent workload (2,000 requests a day, 3 tool calls, RAG on), Gemini 1.5 Flash comes to about $138 a month. That includes the agentic context tax, where every tool call re-sends the conversation and you pay for it again. Your real figure depends on volume and tokens, so run it in the calculator.

A model's list price is not its bill.

Prompt caching, context trimming, and the right tier per task usually move an LLM bill more than the model choice does. Book a call, or leave your email and I'll reach out.

Book a call

No spam. You'll get a reply from me.

Prefer proof first? See how this plays out in real case studies →