
OpenAI Files for IPO, Eyes $1 Trillion Valuation
OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

New research reveals MCP error messages triple agent attack success rates, ranks eight models on sycophancy with Claude scoring best, and finds self-evolving agents make 30-42% false edits.

Miasma worm planted config files that auto-execute credential theft when developers open Microsoft Azure repos in Claude Code, Gemini CLI, Cursor, or VS Code.

a16z-backed Orbital wants to run AI inference from low Earth orbit using NVIDIA Blackwell GPUs, targeting 10,000 satellites and 1 GW of compute at full scale.

OpenAI filed a confidential S-1 with the SEC on June 8, targeting a public debut above $1 trillion as early as September 2026, following Anthropic by one week.

PM Mark Carney's AI for All commits $2.3 billion to hit 250,000 new jobs and 60% business adoption by 2034 - but critics call it a wish list without hard delivery mechanisms.

Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

Three papers: strategic attack timing exposes gaps in AI control evaluations, Perplexity's agents slash task time by 87%, and Lean4 formal proofs make agent workflows more reliable.

Anthropic published internal data showing Claude writes 80% of its own codebase - and called for a coordinated global AI pause - four days after filing a $965B IPO.

Senator Bernie Sanders proposes seizing 50% of OpenAI, Anthropic, and xAI stock to fund a federal sovereign wealth fund with government board seats.

A practical beginner's guide to the five most accessible AI side hustles in 2026, with honest earnings estimates and a clear starting point for each.

A beginner's guide to using AI tools like Fathom, Otter.ai, Zoom AI, and Google Meet's Gemini to automatically capture meeting notes and follow-up tasks.

Learn how to use ChatGPT, Perplexity, Gemini, and Amazon's AI assistant to research products, compare prices, and spot fake reviews before you buy.

OpenAI's life sciences reasoning model gets a June update with global access and new NGS plugins - strong benchmarks, but still locked behind a Trusted Access Program with no public pricing.

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Claude Opus 4.8 sets new highs on SWE-bench Pro and long-context tasks while a 4x improvement in code flaw detection may matter more than any benchmark number.

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Rankings of the best AI models and agent frameworks on the GAIA benchmark, which tests real-world multi-step tasks requiring web browsing, tool use, and multi-hop reasoning.

Rankings of AI models by cost efficiency in May 2026, comparing performance per dollar across frontier and budget models. Updated with DeepSeek V4, GPT-5.5, and Kimi K2.6.

Microsoft's first in-house coding model, a 137B sparse MoE built natively for GitHub Copilot, beating Claude Haiku 4.5 on SWE-Bench Pro by 16 points.

Mistral AI's mid-tier open-weight edge model - 8B parameters, 256K context, Apache 2.0 license, built for agentic pipelines and cost-sensitive production workloads.

Mistral's open-weight coding agent model - 123B parameters, 256K context window, 72.2% on SWE-bench Verified, priced at $0.40/M input tokens.

Amazon exposes a Russian-speaking hacker who used ARXON (an MCP server feeding data to Claude and DeepSeek) and CHECKER2 to breach 600+ FortiGate firewalls across 55 countries in five weeks - no zero-days required.

YC-backed startup Guide Labs releases Steerling-8B under Apache 2.0 - an 8.4B parameter model with a built-in concept module that traces every output token back to its training data.

Anthropic accuses three Chinese AI labs of industrial-scale distillation attacks using 24,000 fraudulent accounts and 16 million exchanges with Claude. MiniMax ran the largest operation at 13 million exchanges. None of the three companies have responded.

OpenAI projects $600 billion in compute spending through 2030, with NVIDIA investing $30 billion and a potential $1 trillion IPO. But inference costs quadrupled in 2025 and the Stargate project faces internal disputes.

IBM suffers its worst single-day drop in years after Anthropic announces Claude Code can automate COBOL modernization. Combined with Trump's 15% global tariff and DOGE contract cancellations, IBM is down 26% in February.

Four days after launch, Gemini 3.1 Pro's benchmark-topping performance is overshadowed by 90-hour lockouts for paying subscribers, quota draining while idle, and tool-calling bugs that break LangChain, n8n, and RooCode. Developers are switching to Claude.

Google's Gemini 3 Deep Think trades speed for depth, delivering record-breaking reasoning benchmarks - but at a steep price.

Defense Secretary Hegseth gives Anthropic CEO Dario Amodei an ultimatum: lift Claude's military restrictions or face blacklisting from the entire US defense supply chain.

From pirated libraries to destroyed books to ancient manuscripts, AI companies have consumed millions of copyrighted works and are now approaching the limits of available human text. Here is what they used, what they stole, and what they are looking for next.

Alibaba's Qwen team releases Qwen3.5-397B-A17B, an open-weight mixture-of-experts model with native multimodal support and a hybrid attention architecture that runs 8x faster than its predecessor. Apache 2.0 licensed.

A Microsoft 365 Copilot bug (CW1226324) let the AI summarize emails with sensitivity labels in Sent Items and Drafts, bypassing DLP policies for two weeks. The NHS was affected. It's the second time in eight months.

OpenAI is finalizing the largest private funding round in history, with Amazon, SoftBank, Nvidia, and Microsoft committing over $100 billion at a valuation exceeding $850 billion.