Claude Opus API Pricing
Claude Opus costs $15.00 per million input tokens and $75.00 per million output tokens at list price. Here is the cost at volume and how it ranks against 6 other models (7 of 7 by blended price).
alternatives
Claude Opus vs every other model on price
| model | blended / 1M | vs Claude Opus |
|---|---|---|
| Gemini 1.5 FlashGoogle (Vertex) | $0.13 | 99% cheaper |
| GPT-4o miniOpenAI | $0.26 | 99% cheaper |
| Claude HaikuAnthropic | $1.60 | 95% cheaper |
| Gemini 1.5 ProGoogle (Vertex) | $2.19 | 93% cheaper |
| GPT-4oOpenAI | $4.38 | 85% cheaper |
| Claude SonnetAnthropic | $6.00 | 80% cheaper |
cheaper than Claude Opus · public list prices as of 2026-06 · estimates, not quotes
when_it_fits
When Claude Opus is the right call
Anthropic's most capable and most expensive tier, for the hardest reasoning.
On blended price, Claude Opus is the most expensive model compared here. Whether its rate is worth paying comes down to fit. A model that resolves a task in fewer attempts or shorter prompts can come out ahead of a cheaper one that needs retries, so price it on your own evaluation set and token mix rather than the per-million rate alone.
faq
Questions & answers
- How much does Claude Opus cost?
- Claude Opus is $15.00 per million input tokens and $75.00 per million output tokens, at public list price (2026-06). At a typical 3:1 input-to-output mix that blends to about $30.00 per million tokens.
- What is a cheaper alternative to Claude Opus?
- The cheapest option compared here is Gemini 1.5 Flash at about $0.13 per million blended, roughly 99% less than Claude Opus. Whether it fits depends on your task: a cheaper model that needs retries or longer prompts can cost more in practice.
- How much does Claude Opus cost to run an AI agent per month?
- On a representative agent workload (2,000 requests a day, 3 tool calls, RAG on), Claude Opus comes to about $29,342 a month. That includes the agentic context tax, where every tool call re-sends the conversation and you pay for it again. Your real figure depends on volume and tokens, so run it in the calculator.
A model's list price is not its bill.
Prompt caching, context trimming, and the right tier per task usually move an LLM bill more than the model choice does. Book a call, or leave your email and I'll reach out.
Prefer proof first? See how this plays out in real case studies →