
Nemotron 3 Nano Omni Unifies Vision, Audio, Language
NVIDIA's new open omni model activates 3B of 30B parameters, processes video, audio, and documents in one pass, and delivers up to 9.2x higher throughput than other open omni models.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

AI Infrastructure & Open Source Reporter
Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.
She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.
At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.
Based in Seattle, WA.

NVIDIA's new open omni model activates 3B of 30B parameters, processes video, audio, and documents in one pass, and delivers up to 9.2x higher throughput than other open omni models.

Mistral releases Medium 3.5, a 128B open-weights model that scores 77.6% on SWE-Bench Verified, and pairs it with asynchronous cloud coding agents in Vibe that open pull requests on GitHub while you are away.

CVE-2026-25874 (CVSS 9.3) exposes LeRobot's gRPC server to unauthenticated remote code execution via pickle deserialization, threatening robot control systems and GPU infrastructure.

Microsoft drops exclusive OpenAI IP rights and ends its revenue-share payments as OpenAI gains freedom to deploy on Google Cloud, AWS, or any provider.

China's NDRC ordered Meta to reverse its $2B Manus acquisition and barred the startup's founders from leaving China, ending the 'Singapore washing' strategy that let Chinese AI firms dodge Beijing oversight.

Maine Governor Janet Mills killed LD 307, the first proposed statewide AI data center moratorium in the US, to protect a $550 million project in Jay.

Anthropic's Project Deal experiment found that agents running stronger models consistently closed better transactions - and users represented by weaker agents had no idea.

Google commits up to $40 billion to Anthropic alongside five gigawatts of cloud compute, making it the largest single infrastructure bet in AI history.

Anthropic's April 23 post-mortem confirms three app-layer changes degraded Claude Code since early March - all reverted in v2.1.116 by April 20.

Google Labs open-sourced DESIGN.md, a YAML-plus-markdown file that gives AI coding agents a brand's complete design system in one drop-in file - now works with Claude Code, Cursor, and Copilot.

Google's Virgo Network connects 134K TPU chips at 47 petabits per second using a flat two-layer topology that removes the bandwidth degradation cluster operators have engineered around for years.

DeepSeek V4 has slipped three times since February. Jensen Huang called it a horrible outcome for America. Here is what is actually hard about running a trillion-parameter model on Huawei's CANN framework.