What is the cheapest AI API in 2026?

GPT-oss 20B is the cheapest at $0.08/$0.35 per million input/output tokens. For open-source, Llama 3.1 8B via Together.ai costs $0.10/$0.10. DeepSeek V4 Flash is $0.14/$0.28.

How much does GPT-5 cost per API call?

GPT-5 costs $1.25 per million input tokens and $10.00 per million output tokens. A typical 1000-token request costs about $0.00125 for input and $0.01 for output. GPT-5 mini is 5x cheaper at $0.25/$2.00.

How do I calculate my monthly AI API costs?

Multiply your daily requests by average tokens per request, then by the model's per-token price. Use the APIpulse calculator above to estimate costs across 48 models instantly — just enter your request volume and token usage.

Which AI provider is cheapest for API access?

DeepSeek and Google offer the cheapest models. DeepSeek V4 Flash ($0.14/$0.28) and Gemini 2.5 Flash-Lite ($0.10/$0.40) are the budget leaders. GPT-oss 20B ($0.08/$0.35) has the absolute cheapest input. For mid-tier, Mistral Large 3 ($0.50/$1.50) offers excellent value.

AI API Cost Calculator

Compare pricing across 48 models from 10 providers. Enter your usage below to see exactly what you'd pay — no signup required.

Typical request:

By volume:

API mode:

Provider & Model

Input tokens per request

Output tokens per request

Requests per day

Days per month

Cost Estimate

Cost per request $0.0000

Cost per 1K requests $0.00

Input cost (monthly) $0.00

Output cost (monthly) $0.00

Total tokens/month 0

Monthly Total $0.00

💡 Optimization Tips

🔄 Model Routing

Use cheaper models for 80% of tasks, premium only for complex reasoning…

Route simple queries to GPT-5 mini ($0.25/$2.00) and reserve Opus 4.8 ($5/$25) for complex tasks. Most teams save 60-70% with this strategy. Pro includes a personalized routing guide for your workload.

Try Pro Free for 24h →

📦 Batch Processing

Combine multiple API calls into batch requests to reduce overhead…

Batch API calls reduce per-request overhead by 30-50%. Group similar tasks, use async processing, and implement request queuing. Pro shows exact batch sizes for your usage pattern.

Try Pro Free for 24h →

💾 Response Caching

Cache common API responses to avoid redundant calls…

Implement semantic caching for repeated queries. Even 20% cache hit rate cuts costs significantly. Use embeddings-based similarity matching for fuzzy cache hits. Pro includes a caching implementation guide.

Try Pro Free for 24h →

These are just 3 of 12 strategies. Pro users save an average of 40% on API costs.

Stop Losing Money — $29

Share your cost card →

Based on published API pricing. Actual costs may vary.

See how prices have changed →

1-5 request types · 6-8 volume · C copy · ? shortcuts

How This Calculator Works

Enter your expected usage — tokens per request, requests per day, and days per month — and the calculator shows your estimated cost per request, cost per 1K requests, and monthly total for any of 48 LLM models across 10 providers. Use the "Typical request" presets to quickly see what common workloads cost. It also suggests the cheapest alternative if a cheaper model can handle your workload.

Supported Providers

OpenAI: GPT-5.5, GPT-5.5 Pro, GPT-5.3 Codex, GPT-5, GPT-5 mini, GPT-4o, GPT-4o mini, GPT-oss 120B/20B
Anthropic: Claude Opus 4.7, Claude 4 Opus, Claude Sonnet 4.6, Claude Sonnet 4.6, Claude Haiku 4.5
Google: Gemini 3.5 Flash, Gemini 3.1 Pro, Gemini 3 Flash, Gemini 2.5 Pro, Gemini 2.5 Flash-Lite
DeepSeek: V4 Pro, V4 Flash, V3
Mistral: Large 3, Small 4
Cohere: Command R+, Command R
Meta (Together.ai): Llama 4 Scout, Llama 4 Maverick, Llama 3.1 70B, Llama 3.1 8B
Moonshot: Kimi K2.6
xAI: Grok 4.3, Grok Build 0.1
AI21: Jamba 1.5 Large

Common Use Cases

Our calculator helps you estimate costs for:

Chatbots and virtual assistants — typically 500-2000 input tokens, 200-800 output tokens per turn
Code generation — 1000-5000 input tokens, 500-3000 output tokens per request
Document analysis — 3000-10000 input tokens, 200-1000 output tokens per document
Content generation — 500-2000 input tokens, 1000-4000 output tokens per article

Want to compare two models side by side?

Use the Comparison Tool

Tips to Reduce Your API Costs

The calculator often reveals that a cheaper model can handle your workload. Here are the biggest cost-saving strategies:

Use budget models for simple tasks: GPT-4o mini ($0.15/$0.60) handles 80% of chatbot requests at 1/17th the cost of GPT-4o
Optimize prompts: Shorter prompts = fewer input tokens = lower costs
Set token limits: Don't let models generate 4000 tokens when 200 will do
Cache responses: Identical prompts can be cached to eliminate redundant API calls

Read our full guide: How to Cut Your AI API Bill in Half: 10 Practical Tips

Related Tools

MCP Server — Get live pricing data in Claude Code, Cursor, and other AI tools (free)
🎯 AI API Advisor — Get a personalized model recommendation for your use case and budget
📊 2026 Pricing Benchmark — Download the full pricing report with 37× price gap analysis
Model Recommendation Engine — Answer 4 questions, get your top 3 models with scores and cost estimates
Migration Code Generator — Get copy-paste code to switch providers (Python, Node.js, curl)
Cost Health Check — 5-question assessment with personalized savings grade
Cost Explorer — See all 48 models ranked by cost for your usage
Token Cost Estimator — Quick cost lookup across all 48 models for any token count
Model Comparison — Side-by-side model comparison tool
Pricing Cheat Sheet — Printable reference with all model prices
Model Selector Quiz — Answer 5 questions to find the right model
API Cost Report Card — Grade your spending efficiency, get a shareable report
Pricing Badges — Embeddable SVG badges for 48 models, copy-paste into READMEs and docs
AI Stack Cost Optimizer — Find the cheapest model combination for your multi-feature app
Startup Cost Planner — Budget your AI API spend from pre-seed to Series A
Token Counter — Count tokens instantly and see costs across all 48 models
AI API ROI Calculator — Calculate your return on AI investment and find savings

Get weekly AI pricing updates & cost-saving tips

Join 200+ developers optimizing their AI API spend. No spam, unsubscribe anytime.

Provider Calculators

Cohere Cost Calculator — Command R+ & Command R pricing
Moonshot Cost Calculator — Kimi K2.6 pricing
Together.ai Cost Calculator — Llama 4 & Llama 3.1 pricing

Related Tools

📊 Live Pricing Dashboard AI Feature Cost Estimator AI Project Budget Planner API Cost Report Card

Stop guessing — get exact costs for every model

Pro gives you 42-model comparison, migration code snippets, PDF reports, and personalized optimization tips.

Get Pro — $29 lifetime

✅ 14-day money-back guarantee · ⚡ Instant access · 🔒 One-time payment

AI API Cost Calculator

Cost Estimate

💡 Optimization Tips

Stop overpaying — save $0/mo

How This Calculator Works

Supported Providers

Common Use Cases

Tips to Reduce Your API Costs

Related Tools

Provider Calculators

Related Reading

Related Tools

Stop guessing — get exact costs for every model

AI API Cost Calculator

Cost Estimate

💡 Optimization Tips

Stop overpaying — save $0/mo

Keyboard Shortcuts

How This Calculator Works

Supported Providers

Common Use Cases

Tips to Reduce Your API Costs

Related Tools

Provider Calculators

Related Reading

Related Tools

Stop guessing — get exact costs for every model