
OpenAI Files for IPO, Eyes $1 Trillion Valuation
OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.

PM Mark Carney's AI for All commits $2.3 billion to hit 250,000 new jobs and 60% business adoption by 2034 - but critics call it a wish list without hard delivery mechanisms.

Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

Three papers: strategic attack timing exposes gaps in AI control evaluations, Perplexity's agents slash task time by 87%, and Lean4 formal proofs make agent workflows more reliable.

Anthropic published internal data showing Claude writes 80% of its own codebase - and called for a coordinated global AI pause - four days after filing a $965B IPO.

Senator Bernie Sanders proposes seizing 50% of OpenAI, Anthropic, and xAI stock to fund a federal sovereign wealth fund with government board seats.

iOS 27 Beta 1 is live for developers today, shipping Apple's new Extensions framework that lets Gemini, Claude, and ChatGPT plug into Siri - plus the Nvidia B200 Confidential Computing architecture that keeps those cloud queries private.

MiniMax M3 uses sparse attention to cut long-context inference cost 20x, topping GPT-5.5 on coding benchmarks at a fraction of the price.

Blackstone-backed AirTrunk pledges $30 billion and 5GW of AI data center capacity in India by 2030 - more than triple the country's current total installed base.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

Learn how to use ChatGPT, Perplexity, Gemini, and Amazon's AI assistant to research products, compare prices, and spot fake reviews before you buy.

A practical beginner's guide to using AI tools to write a stronger resume, craft tailored cover letters, and prepare confidently for job interviews.

OpenAI's life sciences reasoning model gets a June update with global access and new NGS plugins - strong benchmarks, but still locked behind a Trusted Access Program with no public pricing.

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Claude Opus 4.8 sets new highs on SWE-bench Pro and long-context tasks while a 4x improvement in code flaw detection may matter more than any benchmark number.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

Mistral AI's mid-tier open-weight edge model - 8B parameters, 256K context, Apache 2.0 license, built for agentic pipelines and cost-sensitive production workloads.

Mistral's open-weight coding agent model - 123B parameters, 256K context window, 72.2% on SWE-bench Verified, priced at $0.40/M input tokens.

Grok Build 0.1 is xAI's first model built specifically for agentic coding workflows, with a 256K context window, native MCP support, and always-on reasoning at $1/M input tokens.

Comparing Kimi K2.5 and GPT-4o mini - Moonshot AI's trillion-parameter frontier model with agent swarms against OpenAI's most widely deployed budget model.

Benchmark comparison of Kimi K2.5 and GPT-5.3 Codex - Moonshot AI's open-weight trillion-parameter MoE versus OpenAI's premium agentic coding model.

A detailed comparison of Kimi K2.5 and Llama 4 Maverick - two open-weight MoE models with radically different takes on the size, cost, and capability trade-off.

Comparing Kimi K2.5 and Llama 4 Scout - Moonshot AI's benchmark-crushing trillion-parameter model versus Meta's 10-million-token context window specialist.

Kimi K2.5 and MiniMax M2.5 compared side by side - two Chinese MoE models where the smaller, cheaper one actually wins on SWE-bench. A detailed analysis of when each model delivers more value.

Comparison of Kimi K2.5 and Mistral Large 3 - two large open-weight MoE models with 256K context, each representing a different vision for open AI.

Comparing Kimi K2.5 and Mistral Small 3.2 - Moonshot AI's trillion-parameter open-weight frontier model against Mistral's compact, EU-compliant function calling specialist.

Comparing Moonshot AI's trillion-parameter Kimi K2.5 with NVIDIA's Mamba2-MoE hybrid Nemotron 3 Nano 30B-A3B - frontier intelligence versus a model engineered for maximum throughput, 1M context, and 10x lower cost.

A detailed comparison of Moonshot AI's 1T-parameter Kimi K2.5 against Microsoft's 14B Phi-4 - the most extreme size gap in frontier AI, with 71x the parameters but vastly different use cases.

Comparing Kimi K2.5 and Qwen3.5 Flash - Moonshot AI's trillion-parameter frontier model against Alibaba's cheapest and fastest API offering.

Comparing Kimi K2.5's 1T-parameter benchmark dominance against Qwen3.5-122B-A10B's extraordinary parameter efficiency - and why the smaller model is harder to dismiss than the numbers suggest.

Comparing Kimi K2.5's trillion-parameter benchmark dominance against Qwen3.5-27B's single-GPU accessibility - two models from entirely different tiers that both have compelling use cases.