API Pricing Digest

TL;DR

3 new models — GPT-5.4 mini, GPT-5.4 Pro, and Gemini 3.1 Flash-Lite all launched this week
Claude 4 retired — Anthropic sunsets Claude 4; migration to Claude 4.5 or Sonnet 5 recommended
Pricing pressure continues — OpenAI's GPT-5.4 mini at $0.75/$4.50 per 1M tokens brings premium reasoning to budget tier
Flash sale — APIpulse Pro lifetime access $19 (reg $29) through Jul 12

🆕 New Models This Week

New

GPT-5.4 mini — OpenAI's new value champion

OpenAI launched GPT-5.4 mini at $0.75 input / $4.50 output per 1M tokens. Priced between GPT-4o mini and GPT-4o, it delivers significantly better reasoning than its predecessor while remaining OpenAI's most cost-effective new-gen model. For chatbots, content generation, and data extraction that need stronger reasoning than GPT-4o mini could offer.

New mid-budget option · View pricing & alternatives →

New

GPT-5.4 Pro — OpenAI's premium reasoning model

The new flagship from OpenAI. GPT-5.4 Pro targets complex reasoning, code generation, and agentic workflows at $30.00 input / $180.00 output per 1M tokens. Premium pricing positions it alongside Claude Opus 4.8 and GPT-5.5 Pro for the most demanding workloads.

Premium tier · View pricing & alternatives →

New

Gemini 3.1 Flash-Lite — Google's ultra-cheap option

Google's answer to the price war. Gemini 3.1 Flash-Lite at $0.25 input / $1.50 output per 1M tokens is one of the cheapest multimodal models from a major provider. Perfect for high-volume tasks like classification, routing, and simple Q&A where cost matters most.

▼ Budget tier · View pricing & alternatives →

⚠️ Deprecations & Retirements

Deprecation

Claude 4 officially retired by Anthropic

Anthropic has retired Claude 4 from its API. If you're still pointing at claude-4, you need to migrate. Claude 4.5 is the direct successor (same price, better performance). Sonnet 5 is the new mid-tier champion at $3/$15 per 1M tokens.

Action required if using claude-4 endpoint · See migration guide & alternatives →

📊 Pricing Trends

Trend

The price floor keeps dropping

Twelve months ago, the cheapest API model from a major provider was ~$0.15/1M input tokens. Today, GPT-oss 20B is at $0.08, Mistral Small at $0.10, and Gemini 2.5 Flash-Lite at $0.10. The "good enough for most tasks" price has nearly halved in a year. If you locked in pricing assumptions 6 months ago, you're overpaying.

▼ ~50% YoY on budget models · Full trend analysis →

Trend

Premium models holding steady — for now

While budget models race to the bottom, premium-tier pricing (GPT-5.4 Pro, Claude Opus 4.8, GPT-5.5 Pro) remains at $5-30 input / $25-180 output per 1M tokens. The gap between "cheap" and "best" is now 30-600x. This creates a clear optimization opportunity: route simple tasks to cheap models, reserve premium for complex reasoning.

Learn model routing strategies →

📚 Past Digests

Week of Jul 4, 2026 — GPT-5.4 launch, Claude 4 retirement, Gemini Flash-Lite

This is the first edition. More coming every Friday.

Track every price change automatically

APIpulse Pro monitors 49 models across 10 providers. Get alerts when prices drop — or when your costs are about to increase.

Get Pro for $19 (reg $29) →