Together.ai API Cost Calculator

Estimate your Together.ai spend across Llama 4 Scout, Llama 4 Maverick, Llama 3.1 70B, and Llama 3.1 8B. See cost per request, per 1K requests, and monthly totals. Open-source models with managed inference.

Typical request:
By volume:

Cost Estimate

Input cost per request $0.0000
Output cost per request $0.0000
Total cost per request $0.0000
Cost per 1,000 requests $0.00
Daily cost $0.00
Monthly cost $0.00
Annual cost $0.00

All Together.ai Models — Cost Comparison

See how your costs compare across all available models with your current settings

Cheaper Alternatives from Other Providers

These models from other providers offer similar capabilities at lower prices:

Model Provider Input/1M Output/1M Your Cost/Req Savings vs Selected

Together.ai API Pricing Explained

Together.ai provides managed inference for open-source models, giving you the cost advantages of models like Llama 4 without managing GPU infrastructure. Llama 4 Scout ($0.11/$0.34 per 1M tokens) is the cheapest option with a massive 10M context window. Llama 4 Maverick ($0.20/$0.60) offers improved quality. Llama 3.1 70B ($0.88/$0.88) delivers strong performance for complex tasks.

When to Use Each Model

Together.ai vs Competitors

Together.ai's biggest advantage is open-source model access without infrastructure management. Llama 4 Scout ($0.11/$0.34) is 91% cheaper than GPT-5 for input tokens. For teams that want the flexibility of open-source models with the convenience of a managed API, Together.ai offers the best of both worlds.

How to Reduce Your Together.ai Costs

Together.ai Free Tier

Together.ai offers $5 in free credits for new accounts. This is enough for approximately 45M input tokens on Llama 4 Scout or 5.7M input tokens on Llama 3.1 70B. Great for prototyping and evaluation.

Related Tools

Want to compare Together.ai with other providers?

Compare All Models →