
llm-d Joins CNCF - Kubernetes Gets a Native LLM Inference Stack
IBM Research, Red Hat, and Google Cloud donated llm-d to the CNCF at KubeCon EU, giving Kubernetes a production-grade distributed LLM inference framework built on vLLM.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

IBM Research, Red Hat, and Google Cloud donated llm-d to the CNCF at KubeCon EU, giving Kubernetes a production-grade distributed LLM inference framework built on vLLM.

The 15-person startup that launched an H100 GPU into space last November just became the fastest Y Combinator company ever to reach unicorn status.

Mistral AI secures $830M in debt financing from seven banks to build a 13,800-GPU Nvidia GB300 cluster near Paris, targeting 200MW of European compute by 2027.

NVIDIA's DSX Flex library and Emerald AI's Conductor platform let AI factories ramp GPU power up or down in seconds, unlocking faster grid connections and up to 100GW of new U.S. capacity.

Microsoft signs a deal with Crusoe for a new 900 MW AI factory campus in Abilene, Texas, adjacent to the Stargate site Oracle and OpenAI walked away from three weeks ago.

A developer's guide to migrating from AWS Bedrock to Azure OpenAI Service, covering SDK changes, model mapping, pricing differences, and authentication gotchas.

At its Arm Everywhere event in San Francisco, Arm unveiled the AGI CPU - a 136-core data center processor co-developed with Meta and the company's first owned silicon product in its 35-year history.

An exclusive TechCrunch tour of Amazon's Trainium chip lab reveals how AWS is training Claude for Anthropic and now holds an $138B commitment from OpenAI.

Meta signs a five-year, $27 billion AI infrastructure agreement with Nebius Group, marking one of the largest cloud computing contracts ever announced.

AI training crawlers from Anthropic, Google, and OVHcloud knocked a French national research institute's GitLab server offline for hours - nobody at CNRS's Institute of Complex Systems could work.

At BlackRock's Infrastructure Summit, Sam Altman laid out OpenAI's vision of selling AI compute by the token like a metered utility, backed by a $110 billion war chest and the Stargate buildout.

Oracle's Q3 FY2026 results show $553B in contract backlog - up 325% year-over-year - as AI infrastructure demand continues to outpace supply.