AI Tools

Anthropic Claude API Pricing 2026: Token Rates, Plans, and Full Cost Breakdown

Complete Anthropic Claude API pricing for 2026 -- exact token rates for Claude Opus 4.7, Sonnet 4.6, and Haiku 4.5, prompt caching savings, Batch API discounts, and all subscription plan costs from Free to Enterprise.

Victor Ogonyo

·2026-05-25·15 min read

If you are looking for Anthropic Claude API pricing in 2026, this guide covers everything: exact token rates for all three Claude model families, the 2026 price cuts that changed the value proposition, prompt caching savings of up to 90%, Batch API discounts, and every consumer subscription plan from Free to Enterprise.

Whether you are comparing Claude AI pricing for a subscription or evaluating the Claude API for a production application, the numbers below are current as of May 2026.

Claude Pricing at a Glance

Product	Price	Best For
Claude Free	$0/month	Casual use, testing
Claude Pro	$20/month	Power users, professionals
Claude Max 5x	$100/month	Heavy daily users
Claude Max 20x	$200/month	Extremely high-volume users
Team Standard	$25/seat/month (min 5)	Business teams
Team Premium	$100/seat/month (min 5)	Engineering teams with Claude Code
Enterprise	Custom pricing	Large organisations
Claude API	Pay per token	Developers building applications

Anthropic Claude API Pricing 2026: Exact Token Rates

The Anthropic API (formerly Claude API) gives developers pay-per-token access to all Claude models. Pricing is per million tokens, with input and output billed at different rates.

Claude API Rates (Per Million Tokens)

Model	Input	Output	Cache Read	Cache Write
Claude Haiku 4.5	$1.00	$5.00	$0.10	$1.25
Claude Sonnet 4.6	$3.00	$15.00	$0.30	$3.75
Claude Opus 4.7	$5.00	$25.00	$0.50	$6.25

What Each Claude API Model Is For

Claude Haiku 4.5 at $1.00/$5.00 per million tokens is Claude's fastest and most cost-efficient model. Best suited for high-volume tasks where reasoning depth is less critical: summarisation, classification, structured data extraction, customer support routing, and simple Q&A. At $1.00/M input, running 1,000 short customer support interactions costs approximately $2–$10 total.

Claude Sonnet 4.6 at $3.00/$15.00 is the optimal balance model for most production applications. It handles complex writing, multi-step reasoning, code generation, and detailed analysis — tasks where Haiku falls short — at a lower cost than Opus. Most teams building AI products default to Sonnet 4.6 as their primary model.

Claude Opus 4.7 at $5.00/$25.00 is Anthropic's most capable model and the right choice for genuinely hard problems: complex multi-step reasoning, advanced code generation, nuanced analysis with many constraints, and tasks where maximum intelligence is the bottleneck rather than throughput or cost. Following the February 2026 price reduction, Opus 4.7 is now accessible for production use cases where it was previously too expensive.

Practical Cost Examples

Processing a 10-page document (5,000 tokens) with Sonnet 4.6: $0.015
Generating a 500-word article (700 output tokens) with Opus 4.7: $0.0175
Running 1,000 customer support responses with Haiku 4.5 (2M total tokens): $2–$10
Processing a 100,000-token codebase with Sonnet 4.6: $0.30 input cost

Claude API Prompt Caching: Up to 90% Savings

Prompt caching is the most impactful cost reduction available on the Anthropic Claude API. If your application sends the same large system prompt, document, or knowledge base on every request, you pay the full input rate only once — subsequent uses of the cached content cost 10% of the input rate.

How prompt caching works:

First request: full input token rate + cache write surcharge
Subsequent requests using cached content: cache read rate (10% of input price)
Cache TTL: 5 minutes, reset with each use

Cache read rates vs full input rates:

Model	Full Input	Cache Read	Savings
Haiku 4.5	$1.00/M	$0.10/M	90%
Sonnet 4.6	$3.00/M	$0.30/M	90%
Opus 4.7	$5.00/M	$0.50/M	90%

Real-world example: A customer service application with a 10,000-token system prompt running 500 requests per day:

Without caching: 10,000 × 500 × $3.00/M = $15.00/day in system prompt tokens alone
With caching: $3.75 (one-time write) + 499 × $0.30/M × 10,000 = $1.50/day
Result: 90% reduction, saving $13.50/day or ~$405/month

For any API application with a repeated large context, prompt caching is non-negotiable.

Claude Batch API: 50% Discount on Non-Urgent Requests

The Anthropic Batch API processes requests asynchronously and returns results within 24 hours. In exchange for the wait, you receive a 50% discount on both input and output tokens.

Batch API Rates (Per Million Tokens)

Model	Batch Input	Batch Output
Haiku 4.5	$0.50	$2.50
Sonnet 4.6	$1.50	$7.50
Opus 4.7	$2.50	$12.50

Best use cases for Batch API:

Processing large document libraries or archives
Nightly data analysis pipelines
Generating product descriptions or metadata at scale
Running evaluation datasets for AI research and testing
Any workload where results are not needed in real time

At $1.50/$7.50 for Sonnet 4.6 via Batch API, it becomes cheaper than GPT-4o ($2.50/$10.00) at list price for equivalent-tier workloads.

Claude Subscription Plans: Full Breakdown

Claude Free

Claude's free tier gives access to Claude Sonnet with rolling usage limits. No credit card required, no trial expiry — free indefinitely.

Free plan includes:

Claude Sonnet access (not Opus 4.7)
Rolling daily usage limits (significantly lower than paid plans)
Basic Projects functionality
No Claude Code
No extended thinking
No priority access during peak hours

The free plan covers occasional use, short conversations, and evaluating Claude before subscribing. For regular daily professional use, limits become a bottleneck.

Claude Pro — $20/Month

Pro is the right plan for most individual power users and professionals.

Claude Pro includes, over free:

5x the usage volume of the free tier (rolling)
Access to all Claude models including Claude Opus 4.7
Claude Code — file creation, code execution, and full agentic task capability
Unlimited Projects
Google Workspace integration
Remote MCP (Model Context Protocol) connectors
Extended thinking mode
Priority access during high-traffic periods
Voice mode

At $20/month, Claude Pro matches ChatGPT Plus ($20/month) in price and includes Claude Code — a significant capability advantage for developers that ChatGPT Plus does not directly replicate.

Limitation: Anthropic does not publish exact message counts. Limits vary by model — heavy Opus 4.7 usage depletes your allowance faster than Sonnet or Haiku.

Claude Max — $100/Month and $200/Month

Max plans are for users who regularly hit Pro limits.

Max 5x — $100/month:

5x the usage of Pro (25x free tier)
All Pro features
Same models, same features — more headroom

Max 20x — $200/month:

20x the usage of Pro (100x free tier)
All Pro and Max 5x features
Designed for users running Claude as their primary work tool for multiple hours daily, or developers running extended Claude Code agentic sessions

Both Max plans support 100K+ token context windows. As of March 2026, the 1M token context window is included at no surcharge for Max plan users — previously a premium add-on.

Claude Team Plans

Team plans add collaboration, admin controls, and higher per-seat limits.

Team Standard — $25/seat/month (minimum 5 seats):

All Pro features per seat
Admin dashboard and usage controls
SSO (Single Sign-On) support
Domain capture for automatic team onboarding
Slack, Microsoft 365, and Google Workspace integrations
Centralised billing
Does not include Claude Code

Team Premium — $100/seat/month (minimum 5 seats):

All Team Standard features
Full Claude Code access per seat
6.25x Max 5x headroom per session
Priority support

Minimum 5-seat requirement: Team Standard costs $125/month minimum; Team Premium costs $500/month minimum.

Claude Enterprise

Enterprise is custom-priced via direct contract with Anthropic.

Enterprise adds over Team:

500K token context window (no surcharge as of March 2026)
1M token context window available
HIPAA readiness for healthcare organisations
Custom compliance and security controls
Dedicated account management
Custom rate limits negotiated per organisation
Advanced audit logging and data residency options

Enterprise targets healthcare companies (HIPAA), financial services (data residency), and large organisations running Claude at significant API volume with contractual data handling requirements.

The 2026 Claude Price Cuts

Two changes in early 2026 substantially improved Anthropic Claude API pricing:

Opus 4.7 price reduction (67% cut, February 2026): Opus 4.7 was previously priced at $15 input / $75 output per million tokens. The February 2026 cut to $5/$25 made frontier-model quality economically viable for production applications. At the old price, Opus usage at any real volume was cost-prohibitive for most companies.

1M token context window at no surcharge (March 2026): The 1M token context window was previously a premium add-on. As of March 2026, it is included at no extra cost for Enterprise and Max plan users — making Claude significantly more capable for processing entire codebases, legal document libraries, or research corpora in a single prompt.

Claude API vs GPT-4o vs Grok API: Price Comparison

Low-Cost Models

Model	Input (per 1M)	Output (per 1M)
GPT-4o Mini	$0.15	$0.60
Grok 4.1 Fast	$0.20	$0.50
Grok 3 Mini	$0.30	$0.50
Claude Haiku 4.5	$1.00	$5.00

Claude Haiku 4.5 is more expensive than GPT-4o Mini and Grok's fast-tier models at list price, but is significantly more capable for tasks requiring nuanced language understanding.

Frontier Models

Model	Input (per 1M)	Output (per 1M)
Grok 4.3	$1.25	$2.50
GPT-4o	$2.50	$10.00
Claude Sonnet 4.6	$3.00	$15.00
Claude Opus 4.7	$5.00	$25.00

At list prices, Claude Sonnet 4.6 and Opus 4.7 are more expensive than GPT-4o and Grok 4.3. However:

Prompt caching on Sonnet 4.6 brings the effective input cost to $0.30/M for repeated context — cheaper than GPT-4o at $2.50/M
Batch API on Sonnet 4.6 at $1.50/$7.50 is cheaper than GPT-4o at list price
Claude's benchmark performance on reasoning, writing, and coding tasks frequently exceeds GPT-4o, making the cost-per-quality comparison more favourable than raw token prices suggest

The best model for a given application depends on the task, not just the price.

Which Claude Plan Should You Choose?

Choose Claude Free if:

You use AI a few times per week for non-critical tasks
You are evaluating Claude before committing to a subscription
Your tasks are short and do not require Opus-level reasoning

Choose Claude Pro ($20/month) if:

You use Claude daily for professional work
You need Claude Code for software development or agentic tasks
You need Opus 4.7 access for complex reasoning
You regularly work with long documents or large codebases

Choose Claude Max 5x ($100/month) if:

You regularly hit Pro usage limits mid-day
You run long Claude Code agentic sessions
You need 100K+ context windows consistently

Choose Claude Max 20x ($200/month) if:

Claude is your primary work tool for 6+ hours daily
You run complex multi-step agentic tasks every day
You are a developer or researcher at the frontier of AI-assisted work

Choose Claude Team ($25–$100/seat/month) if:

You are buying for a team of 5+ and need admin controls and SSO
You need centralised billing and usage visibility
Your team uses Claude as a core work tool

Choose the Claude API if:

You are building an application that uses Claude programmatically
You need per-token control over model selection and cost
Your usage is variable and token pricing beats a flat subscription

Tips to Reduce Claude API Costs

Match model to task. Haiku 4.5 handles summarisation, classification, and simple Q&A. Sonnet 4.6 covers complex writing, reasoning, and code. Reserve Opus 4.7 for genuinely hard problems. Defaulting to Opus on every request is the most common and expensive mistake.

Implement prompt caching. Any application with a repeated system prompt or knowledge base should use prompt caching — the 90% savings on cached tokens is the highest-ROI optimisation available.

Use Batch API for overnight workloads. The 50% discount compounds significantly at scale. Any pipeline that does not need real-time results should go through the Batch API.

Write concise system prompts. Every token in your system prompt is billed on every non-cached request. Cutting a 2,000-token system prompt to 500 tokens reduces that cost by 75% at scale.

Frequently Asked Questions

What is Claude API pricing in 2026? Claude Haiku 4.5 costs $1.00/$5.00 per million input/output tokens. Claude Sonnet 4.6 costs $3.00/$15.00. Claude Opus 4.7 costs $5.00/$25.00. Prompt caching reduces effective input costs by up to 90%.

How much does Claude Pro cost? Claude Pro costs $20/month. It includes access to Claude Opus 4.7, Claude Code, extended thinking, and 5x the message volume of the free tier.

Is Claude API cheaper than OpenAI? At list prices, Claude Sonnet 4.6 ($3.00/$15.00) is more expensive than GPT-4o ($2.50/$10.00). With prompt caching, the effective Sonnet 4.6 cost for repeated-context applications drops to $0.30/M input — below GPT-4o list price. Grok 4.3 ($1.25/$2.50) is cheaper than both at list price.

What is prompt caching and how much does it save? Prompt caching lets you reuse a cached system prompt or document across multiple API calls. Subsequent uses cost 10% of the standard input rate — a 90% saving on those tokens.

What is the Batch API discount? The Anthropic Batch API processes requests asynchronously (results within 24 hours) at 50% off both input and output token rates. Sonnet 4.6 drops from $3.00/$15.00 to $1.50/$7.50.

What context window does Claude support? Claude Opus 4.7, Sonnet 4.6, and Haiku 4.5 all support 100K+ token context windows via API. Enterprise and Max plan users have access to 500K and 1M token context windows at no surcharge as of March 2026.

How does Claude compare to Grok API for developers? Grok 4.3 ($1.25/$2.50) has lower list prices than Claude Sonnet 4.6 ($3.00/$15.00). Claude's advantage is prompt caching (reduces effective cost to $0.30/M input for repeated context), stronger benchmark performance on complex reasoning tasks, and Claude Code for agentic development workflows.

The Bottom Line

Anthropic Claude API pricing in 2026 is tiered to match different workloads. The February 2026 price cut made Opus 4.7 viable for production. Prompt caching and Batch API discounts make Sonnet 4.6 cost-competitive with GPT-4o for applications with repeated context. The subscription plans — especially Claude Pro at $20/month — are competitively priced and include Claude Code, a meaningful capability advantage for developers.

The core decision: subscription for predictable monthly cost and individual or team use; API for programmatic access with per-token control. For developers, always prototype with the Batch API enabled and prompt caching implemented before comparing costs with competitors.

Building an AI startup? List it on Startup Launch Page and reach investors and early adopters who are actively looking for what you're building.

Building something great?

List your startup on Startup Launch Page -- reach real investors, founders, and early adopters.

Launch your startup →

← Back to Blog