Inferras AI API Price Radar and Provider Directory
Buying guide

Cheapest LLM API Providers

The cheapest LLM API depends on model fit, input price, output price, source type, and the terms behind each provider page.

Low-cost LLM API discovery page.

2026-05-10/5 min read

TLDR

Sort by input price for long-document and classification workloads.

Sort by output price for chat, writing, and code generation workloads.

Keep source verification visible while comparing cheap options.

Who this is for

Teams trying to reduce model API spend.

Buyers building a provider shortlist.

Developers testing cheaper models for specific tasks.

Cheap depends on the workload

A low input price is useful, but it is not the whole story. If your application writes long answers, output price may dominate monthly spend.

The best low-cost option is the one that handles the task well enough without retries, extra prompts, or hidden operational work.

How to build a shortlist

Start with approved listings, sort by the price that matters for your workload, then check source links and model fit.

Avoid comparing providers only by brand name. Compare the exact model and the exact pricing unit.

Practical examples

For classification, start with low input price.

For support chat, compare output price and rate limits.

For coding, test quality before committing to volume.

Current low-input listings

The table below uses approved Inferras listings sorted by input price. It does not add prices beyond the live data already in Supabase.

Current low-input-price listings

Approved public-source AI API listings sorted by input price. Use source links to confirm the latest provider pricing.

ModelProviderProvider categoryRegionInput / 1MOutput / 1MSourceVerificationCheckedUpdatedView
Groq Llama 3.1 8B Instant 128k
GroqOfficial source
Official model ownerGlobal$0.05$0.08
officialSource
Source listed
May 10, 2026May 10, 2026View
Groq GPT OSS 20B 128k
GroqOfficial source
Official model ownerGlobal$0.075$0.30
officialSource
Source listed
May 10, 2026May 10, 2026View
DeepInfra Qwen3-32B
DeepInfraOfficial source
Official model ownerGlobal$0.08$0.28
officialSource
Source listed
May 10, 2026May 10, 2026View
DeepInfra Llama-3.3-70B-Instruct-Turbo
DeepInfraOfficial source
Official model ownerGlobal$0.10$0.32
officialSource
Source listed
May 10, 2026May 10, 2026View
Gemini 2.5 Flash-Lite
Google AI Gemini APIOfficial source
Official model ownerGlobal$0.10$0.40
officialSource
Source listed
May 10, 2026May 10, 2026View
Groq Llama 4 Scout 17Bx16E 128k
GroqOfficial source
Official model ownerGlobal$0.11$0.34
officialSource
Source listed
May 10, 2026May 10, 2026View
OpenAI: GPT-4o-mini
OpenRouterOfficial source
Official model ownerGlobal$0.15$0.60
marketplaceSource
Source listed
May 10, 2026May 10, 2026View
Gemini 2.5 Flash
Google AI Gemini APIOfficial source
Official model ownerGlobal$0.15$1.25
officialSource
Source listed
May 10, 2026May 10, 2026View

Verification indicates how confidently the listed price matches its public source.

Prices are collected from public provider pages and may change over time. Verify pricing, billing units, rate limits, and terms directly on the source page before purchase.

FAQ

cheapest LLM API

Does the cheapest LLM API mean the best provider?

No. Low price is only one signal. Review model fit, source links, terms, rate limits, support, and reliability needs.

Should I sort by input price or output price?

Sort by the side that dominates your workload. Long documents favor input price; chat and generation favor output price.

Why are some low-price listings still marked source listed?

That label means a public source is provided, but the exact price may still need manual confirmation before buying.

Where should I compare low-cost alternatives?

Use the cheapest AI API compare page and the main Price Radar for current approved public listings.

Source references

Related guides

2 likes

Leave a comment

Keep comments under 1000 characters.

Comments

I

May 10, 2026

Good