
LLM API Pricing Comparison - June 2026
Verified June 8: Ministral 3B cheapest at $0.04/MTok, DeepSeek V4 Flash best value at $0.14, Claude Opus 4.8 Fast Mode cut to $10/$50, Mistral Large 3 corrected to $0.50/$1.50.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Verified June 8: Ministral 3B cheapest at $0.04/MTok, DeepSeek V4 Flash best value at $0.14, Claude Opus 4.8 Fast Mode cut to $10/$50, Mistral Large 3 corrected to $0.50/$1.50.

DeepSeek's maiden external funding round is nearing completion at up to $59B valuation, with Tencent and EV battery giant CATL as the biggest outside investors.

China's government has extended travel restrictions to AI researchers at Alibaba, DeepSeek, and other private firms, requiring official approval before they can leave the country.

DeepSeek makes its 75% V4-Pro discount permanent, leaving output tokens 34x cheaper than GPT-5.5 and resetting market expectations for enterprise API costs.

Q2 2026 AI API pricing review: DeepSeek V4 hits the API, GPT-5.5 launches at $5/1M, and overall token costs are down 60-80% year-over-year - but a hidden tokenizer change at Anthropic quietly raised effective prices.

The state of open-source large language models in 2026 - who leads, how close they are to proprietary models, which licenses allow commercial use, and how to access them.

The best LLM APIs under $1 per million input tokens in 2026 - comparing Gemini Flash, DeepSeek V4 Flash, GPT-4.1 Nano, Mistral Small, Qwen3, and Claude Haiku on price and quality.

A practical comparison of every production LLM with a 1M+ token context window - verified pricing, real retrieval notes, and clear picks for different workloads.

Chinese AI providers now handle over 60% of all tokens routed through OpenRouter, up from less than 2% just a year ago.

Seven Claude alternatives compared on API cost, context window, coding performance, and data privacy - from GPT-5.5 and Gemini to open-weight options like Kimi K2.6 and Llama 4.

Eight ChatGPT alternatives compared on pricing, context limits, and real-world performance - from Claude and Gemini to DeepSeek and self-hosted setups.

Updated May 2026: DeepSeek V4-Flash reasoning now $0.28/MTok output (8x cheaper than R1), o3-pro launched at $20/$80, Grok 4 retires May 15 - verified pricing across 11 models.