Inferras
Buying guide

Cheapest LLM API Providers

The cheapest LLM API depends on model fit, input price, output price, source type, and the terms behind each provider page.

Low-cost LLM API discovery page.

2026-05-10/5 min read

TLDR

Sort by input price for long-document and classification workloads.

Sort by output price for chat, writing, and code generation workloads.

Keep source verification visible while comparing cheap options.

Who this is for

Teams trying to reduce model API spend.

Buyers building a provider shortlist.

Developers testing cheaper models for specific tasks.

Cheap depends on the workload

A low input price is useful, but it is not the whole story. If your application writes long answers, output price may dominate monthly spend.

The best low-cost option is the one that handles the task well enough without retries, extra prompts, or hidden operational work.

How to build a shortlist

Start with approved listings, sort by the price that matters for your workload, then check source links and model fit.

Avoid comparing providers only by brand name. Compare the exact model and the exact pricing unit.

Practical examples

For classification, start with low input price.

For support chat, compare output price and rate limits.

For coding, test quality before committing to volume.

Current low-input listings

The table below uses approved Inferras listings sorted by input price. It does not add prices beyond the live data already in Supabase.

Current low-input-price listings

Approved public-source AI API listings sorted by input price. Use source links to confirm the latest provider pricing.

ModelProviderRegionInput / 1MOutput / 1MLatencyUptimeSourceVerificationCheckedUpdatedView
Groq Llama 3.1 8B Instant 128k
GroqVerified
Global$0.05$0.08N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
Groq GPT OSS 20B 128k
GroqVerified
Global$0.075$0.30N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
DeepInfra Qwen3-32B
DeepInfraVerified
Global$0.08$0.28N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
DeepInfra Llama-3.3-70B-Instruct-Turbo
DeepInfraVerified
Global$0.10$0.32N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
Gemini 2.5 Flash-Lite
Google AI Gemini APIVerified
Global$0.10$0.40N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
Groq Llama 4 Scout 17Bx16E 128k
GroqVerified
Global$0.11$0.34N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
Gemini 2.5 Flash
Google AI Gemini APIVerified
Global$0.15$1.25N/AN/A
officialSource
Source listedMay 10, 2026May 10, 2026View
OpenAI: GPT-4o-mini
OpenRouterVerified
Global$0.15$0.60N/AN/A
marketplaceSource
Source listedMay 10, 2026May 10, 2026View

Verification indicates how confidently the listed price matches its public source.

Prices are collected from public provider pages and may change over time. Use the source links to confirm the latest provider pricing.

FAQ

cheapest LLM API

Does the cheapest LLM API mean the best provider?

No. Low price is only one signal. Review model fit, source links, terms, rate limits, support, and reliability needs.

Should I sort by input price or output price?

Sort by the side that dominates your workload. Long documents favor input price; chat and generation favor output price.

Why are some low-price listings still marked source listed?

That label means a public source is provided, but the exact price may still need manual confirmation before buying.

Where should I compare low-cost alternatives?

Use the cheapest AI API compare page and the main Price Radar for current approved public listings.

Source references

Related guides

2 likes

Leave a comment

Keep comments under 1000 characters.

Comments

No approved comments yet

Reviewed comments will appear here.