Gemini 1.5 Flash API Pricing
Gemini 1.5 Flash costs $0.08 per million input tokens and $0.30 per million output tokens at list price. Here is the cost at volume and how it ranks against 6 other models (1 of 7 by blended price).
alternatives
Gemini 1.5 Flash vs every other model on price
| model | blended / 1M | vs Gemini 1.5 Flash |
|---|---|---|
| Claude OpusAnthropic | $30.00 | 229× the price |
| Claude SonnetAnthropic | $6.00 | 46× the price |
| GPT-4oOpenAI | $4.38 | 33× the price |
| Gemini 1.5 ProGoogle (Vertex) | $2.19 | 17× the price |
| Claude HaikuAnthropic | $1.60 | 12× the price |
| GPT-4o miniOpenAI | $0.26 | 2.0× the price |
cheaper than Gemini 1.5 Flash · public list prices as of 2026-06 · estimates, not quotes
when_it_fits
When Gemini 1.5 Flash is the right call
Google's fastest, cheapest tier on Vertex, aimed at high-volume tasks.
On blended price, Gemini 1.5 Flash is the cheapest model compared here. Whether its rate is worth paying comes down to fit. A model that resolves a task in fewer attempts or shorter prompts can come out ahead of a cheaper one that needs retries, so price it on your own evaluation set and token mix rather than the per-million rate alone.
faq
Questions & answers
- How much does Gemini 1.5 Flash cost?
- Gemini 1.5 Flash is $0.08 per million input tokens and $0.30 per million output tokens, at public list price (2026-06). At a typical 3:1 input-to-output mix that blends to about $0.13 per million tokens.
- What is a cheaper alternative to Gemini 1.5 Flash?
- Gemini 1.5 Flash is already the cheapest model compared here, so the question is usually whether a pricier model fits your task well enough to be worth the higher rate.
- How much does Gemini 1.5 Flash cost to run an AI agent per month?
- On a representative agent workload (2,000 requests a day, 3 tool calls, RAG on), Gemini 1.5 Flash comes to about $138 a month. That includes the agentic context tax, where every tool call re-sends the conversation and you pay for it again. Your real figure depends on volume and tokens, so run it in the calculator.
A model's list price is not its bill.
Prompt caching, context trimming, and the right tier per task usually move an LLM bill more than the model choice does. Book a call, or leave your email and I'll reach out.
Prefer proof first? See how this plays out in real case studies →