
OpenAI Files for IPO, Eyes $1 Trillion Valuation
OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

a16z-backed Orbital wants to run AI inference from low Earth orbit using NVIDIA Blackwell GPUs, targeting 10,000 satellites and 1 GW of compute at full scale.

OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.

PM Mark Carney's AI for All commits $2.3 billion to hit 250,000 new jobs and 60% business adoption by 2034 - but critics call it a wish list without hard delivery mechanisms.

Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

Three papers: strategic attack timing exposes gaps in AI control evaluations, Perplexity's agents slash task time by 87%, and Lean4 formal proofs make agent workflows more reliable.

Anthropic published internal data showing Claude writes 80% of its own codebase - and called for a coordinated global AI pause - four days after filing a $965B IPO.

Senator Bernie Sanders proposes seizing 50% of OpenAI, Anthropic, and xAI stock to fund a federal sovereign wealth fund with government board seats.

iOS 27 Beta 1 is live for developers today, shipping Apple's new Extensions framework that lets Gemini, Claude, and ChatGPT plug into Siri - plus the Nvidia B200 Confidential Computing architecture that keeps those cloud queries private.

MiniMax M3 uses sparse attention to cut long-context inference cost 20x, topping GPT-5.5 on coding benchmarks at a fraction of the price.

A practical beginner's guide to the five most accessible AI side hustles in 2026, with honest earnings estimates and a clear starting point for each.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

Learn how to use ChatGPT, Perplexity, Gemini, and Amazon's AI assistant to research products, compare prices, and spot fake reviews before you buy.

OpenAI's life sciences reasoning model gets a June update with global access and new NGS plugins - strong benchmarks, but still locked behind a Trusted Access Program with no public pricing.

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Claude Opus 4.8 sets new highs on SWE-bench Pro and long-context tasks while a 4x improvement in code flaw detection may matter more than any benchmark number.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

Mistral AI's mid-tier open-weight edge model - 8B parameters, 256K context, Apache 2.0 license, built for agentic pipelines and cost-sensitive production workloads.

Mistral's open-weight coding agent model - 123B parameters, 256K context window, 72.2% on SWE-bench Verified, priced at $0.40/M input tokens.

Grok Build 0.1 is xAI's first model built specifically for agentic coding workflows, with a 256K context window, native MCP support, and always-on reasoning at $1/M input tokens.

A data-driven comparison of Alibaba's Qwen3.5-27B and Mistral's Small 3.2 - two Apache 2.0 dense models in the 24-27B range with very different benchmark profiles and deployment strengths.

A data-driven comparison of Alibaba's Qwen3.5-27B and Microsoft's Phi-4 - a 27B hybrid architecture versus a 14B STEM specialist, testing whether raw parameter count or training efficiency wins in practice.

Head-to-head comparison of Qwen3.5-35B-A3B and GLM-4.7-Flash - two Chinese-origin 30B-A3B MoE models with Apache 2.0/MIT licenses that dominate different benchmarks despite near-identical parameter budgets.

David vs Goliath: Qwen3.5-35B-A3B activates 3B parameters and beats Llama 4 Scout's 17B active on MMLU-Pro, GPQA, and coding benchmarks - but Scout's 10M context window and native multimodal support tell a different story.

A data-driven comparison of Alibaba's Qwen3.5-35B-A3B and NVIDIA's Nemotron 3 Nano 30B-A3B - two ~30B MoE models activating ~3B parameters that take fundamentally different architectural approaches to the same problem.

A detailed comparison of Qwen3.5-Flash and DeepSeek V3.2 API pricing, benchmarks, and tradeoffs - flat-rate simplicity versus cache-dependent discounts in the budget AI tier.

A data-driven comparison of Qwen3.5-Flash and Gemini 2.5 Flash-Lite - two models at the exact same $0.10/$0.40 per million token price point with 1M context windows but very different performance profiles.

A detailed comparison of Qwen3.5-Flash and GPT-4o mini covering benchmarks, pricing, context windows, and ecosystem - the new open-source challenger versus OpenAI's entrenched budget API.

Vercel releases the Chat SDK, a TypeScript library installable via 'npm i chat' that lets developers write chatbot logic once and deploy to Slack, Microsoft Teams, Google Chat, Discord, GitHub, and Linear. MIT licensed, AI-provider agnostic, now in public beta.

MiniMax M2.5 is a 230B MoE model (10B active) that scores 80.2% on SWE-Bench Verified while costing 1/10th to 1/20th of frontier competitors like Claude Opus 4.6 and GPT-5.2.

Alibaba releases four Qwen 3.5 medium models - Flash, 35B-A3B, 122B-A10B, and 27B - that match or beat the previous 235B flagship at a fraction of the compute. The 35B model activates just 3 billion parameters and still outperforms Qwen3-235B-A22B.

Qwen3.5-122B-A10B is a 122B-parameter MoE model activating 10B parameters per token, narrowing the gap between medium and frontier models with top scores in GPQA Diamond (86.6), MMMU (83.9), and OCRBench (92.1). Apache 2.0 licensed.