Question 1

How do I calculate AI API cost per request?

Accepted Answer

Multiply input tokens by the input price per token, output tokens by the output price per token, then add them together. For example, a request with 1,000 input tokens and 500 output tokens on GPT-5 ($1.25/$10.00 per 1M) costs: (1000 × $1.25/1M) + (500 × $10.00/1M) = $0.00125 + $0.005 = $0.00625 per request.

Question 2

What is the cheapest AI API per request?

Accepted Answer

GPT-oss 20B is the cheapest at $0.08/$0.35 per 1M tokens — a typical 1K/500 token request costs $0.000255. DeepSeek V4 Flash ($0.14/$0.28) is $0.00028 per request. For comparison, GPT-5.5 Pro costs $0.12 per request — 470x more expensive.

Question 3

How much does a GPT-5 API request cost?

Accepted Answer

GPT-5 costs $1.25 per 1M input tokens and $10.00 per 1M output tokens. A typical request (1,000 input + 500 output tokens) costs approximately $0.00625. At 100 requests/day, that's $0.63/day or $18.75/month.

Question 4

How much does a Claude API request cost?

Accepted Answer

Claude Sonnet 4.6 costs $3.00/$15.00 per 1M tokens. A typical request (1,000 input + 500 output tokens) costs approximately $0.0105. Claude Haiku 4.5 is cheaper at $1.00/$5.00 per 1M — the same request costs $0.0035.

Question 5

How do I reduce my AI API cost per request?

Accepted Answer

5 strategies: (1) Use cheaper models for simple tasks (DeepSeek V4 Flash at $0.00028/req vs GPT-5 at $0.00625/req). (2) Cache repeated prompts to avoid re-computation. (3) Shorten system prompts to reduce input tokens. (4) Use structured output to get shorter, more precise responses. (5) Batch requests when possible.

AI API Cost Per Request

All Models Ranked by Cost Per Request

💡 Why Per-Request Cost Matters

📊 The 470x Gap

Find Your Cheapest Model

Frequently Asked Questions