AI API Cost Calculator
Compare pricing across 33 models from 10 providers. Enter your usage below to see exactly what you'd pay โ no signup required.
Cost Estimate
Based on published API pricing. Actual costs may vary.
See how prices have changed โKeyboard Shortcuts
How This Calculator Works
Enter your expected usage โ tokens per request, requests per day, and days per month โ and the calculator shows your estimated cost per request, cost per 1K requests, and monthly total for any of 33 LLM models across 10 providers. Use the "Typical request" presets to quickly see what common workloads cost. It also suggests the cheapest alternative if a cheaper model can handle your workload.
Supported Providers
- OpenAI: GPT-5.5, GPT-5.5 Pro, GPT-5.3 Codex, GPT-5, GPT-5 mini, GPT-4o, GPT-4o mini, GPT-oss 120B/20B
- Anthropic: Claude Opus 4.7, Claude 4 Opus, Claude Sonnet 4.6, Claude Sonnet 4, Claude Haiku 4.5
- Google: Gemini 3.1 Pro, Gemini 2.5 Pro, Gemini 2.0 Flash
- DeepSeek: V4 Pro, V4 Flash, V3
- Mistral: Large 3, Small 4
- Cohere: Command R+, Command R
- Meta (Together.ai): Llama 4 Scout, Llama 4 Maverick, Llama 3.1 70B, Llama 3.1 8B
- Moonshot: Kimi K2.6
- xAI: Grok 3, Grok 3 Mini
- AI21: Jamba 1.5 Large
Common Use Cases
Our calculator helps you estimate costs for:
- Chatbots and virtual assistants โ typically 500-2000 input tokens, 200-800 output tokens per turn
- Code generation โ 1000-5000 input tokens, 500-3000 output tokens per request
- Document analysis โ 3000-10000 input tokens, 200-1000 output tokens per document
- Content generation โ 500-2000 input tokens, 1000-4000 output tokens per article
Want to compare two models side by side?
Use the Comparison ToolTips to Reduce Your API Costs
The calculator often reveals that a cheaper model can handle your workload. Here are the biggest cost-saving strategies:
- Use budget models for simple tasks: GPT-4o mini ($0.15/$0.60) handles 80% of chatbot requests at 1/17th the cost of GPT-4o
- Optimize prompts: Shorter prompts = fewer input tokens = lower costs
- Set token limits: Don't let models generate 4000 tokens when 200 will do
- Cache responses: Identical prompts can be cached to eliminate redundant API calls
Read our full guide: How to Cut Your AI API Bill in Half: 10 Practical Tips
Related Tools
- Cost Explorer โ See all 33 models ranked by cost for your usage
- Token Cost Estimator โ Quick cost lookup across all 33 models for any token count
- Model Comparison โ Side-by-side model comparison tool
- Pricing Cheat Sheet โ Printable reference with all model prices
- Model Selector Quiz โ Answer 5 questions to find the right model
Related Reading
- AI API Cost Per Request โ The metric developers actually need for budgeting LLM costs
- The Complete Guide to LLM Cost Optimization โ Strategies to cut your API spend by 40%+
- Cheapest LLM APIs in 2026 โ Full ranking of every model by price
- LLM Pricing Cheat Sheet โ Quick reference for all 33 models
- How to Estimate Your AI API Costs โ Step-by-step cost planning guide
- AI API Cost Comparison Tool โ Compare 33 models side by side to find the cheapest option
- May 2026 Pricing Shakeup โ Latest price changes across providers