
Nebius Buys Eigen AI for $643M to Own Inference
Nebius agrees to acquire 20-person MIT inference startup Eigen AI for $643M, betting that optimizing every token per Nvidia chip is the real moat in the AI infrastructure race.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Nebius agrees to acquire 20-person MIT inference startup Eigen AI for $643M, betting that optimizing every token per Nvidia chip is the real moat in the AI infrastructure race.

Bernstein projects Nvidia's China AI chip share falls from 66% to 8% in 2026 while Huawei targets $12B in revenue, with ByteDance alone committing $5.6B in Ascend 950PR orders.

The Pentagon signed AI agreements with eight tech companies for its most classified military networks, pointedly excluding Anthropic even as courts battle over its blacklist status.

NVIDIA's new open omni model activates 3B of 30B parameters, processes video, audio, and documents in one pass, and delivers up to 9.2x higher throughput than other open omni models.

NVIDIA's first open omni-modal model: 30B total / 3B active hybrid Mamba-MoE that processes text, images, audio, and video in a single inference loop, with 9x higher throughput than comparable open omni models.

David Silver, creator of AlphaGo and AlphaZero, closed a $1.1B seed round for Ineffable Intelligence - a London lab building AI that learns without human data.

Five leading AI drug discovery platforms compared - AlphaFold 3, IsoDDE, Recursion OS, Insilico Pharma.AI, and NVIDIA BioNeMo. Access, capabilities, pricing, and clinical results.

Vast Data closes a $1B Series F at $30B valuation - triple its 2023 price - with NVIDIA, Drive Capital, and Access Industries backing its push to own the data layer for AI infrastructure.

Google's Virgo Network connects 134K TPU chips at 47 petabits per second using a flat two-layer topology that removes the bandwidth degradation cluster operators have engineered around for years.

DeepSeek V4 has slipped three times since February. Jensen Huang called it a horrible outcome for America. Here is what is actually hard about running a trillion-parameter model on Huawei's CANN framework.

Mira Murati's AI lab signs a single-digit-billion deal with Google Cloud for GB300 chip access, its first cloud provider commitment, as frontier labs race to lock in next-gen compute.

Google splits its next TPU generation across Broadcom, MediaTek, Marvell, and Intel to win inference economics, revealed ahead of Cloud Next 2026.