Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang

Nemotron 3 Nano Omni Unifies Vision, Audio, Language

Nemotron 3 Nano Omni Unifies Vision, Audio, Language

NVIDIA's new open omni model activates 3B of 30B parameters, processes video, audio, and documents in one pass, and delivers up to 9.2x higher throughput than other open omni models.

Mistral Ships Medium 3.5 With Cloud Coding Agents

Mistral Ships Medium 3.5 With Cloud Coding Agents

Mistral releases Medium 3.5, a 128B open-weights model that scores 77.6% on SWE-Bench Verified, and pairs it with asynchronous cloud coding agents in Vibe that open pull requests on GitHub while you are away.

Critical RCE in LeRobot Lets Attackers Hijack Robots

Critical RCE in LeRobot Lets Attackers Hijack Robots

CVE-2026-25874 (CVSS 9.3) exposes LeRobot's gRPC server to unauthenticated remote code execution via pickle deserialization, threatening robot control systems and GPU infrastructure.

OpenAI Breaks Azure Lock in Microsoft Deal Rewrite

OpenAI Breaks Azure Lock in Microsoft Deal Rewrite

Microsoft drops exclusive OpenAI IP rights and ends its revenue-share payments as OpenAI gains freedom to deploy on Google Cloud, AWS, or any provider.

China Blocks Meta's $2B Manus Deal - Founders Barred

China Blocks Meta's $2B Manus Deal - Founders Barred

China's NDRC ordered Meta to reverse its $2B Manus acquisition and barred the startup's founders from leaving China, ending the 'Singapore washing' strategy that let Chinese AI firms dodge Beijing oversight.

Maine Vetoes First US AI Data Center Moratorium

Maine Vetoes First US AI Data Center Moratorium

Maine Governor Janet Mills killed LD 307, the first proposed statewide AI data center moratorium in the US, to protect a $550 million project in Jay.

Stronger AI Agents Win More Deals - Users Never Know

Stronger AI Agents Win More Deals - Users Never Know

Anthropic's Project Deal experiment found that agents running stronger models consistently closed better transactions - and users represented by weaker agents had no idea.

Google Backs Anthropic With $40B and 5 Gigawatts

Google Backs Anthropic With $40B and 5 Gigawatts

Google commits up to $40 billion to Anthropic alongside five gigawatts of cloud compute, making it the largest single infrastructure bet in AI history.

Anthropic's Claude Code Post-Mortem: Three Bugs Fixed

Anthropic's Claude Code Post-Mortem: Three Bugs Fixed

Anthropic's April 23 post-mortem confirms three app-layer changes degraded Claude Code since early March - all reverted in v2.1.116 by April 20.

DESIGN.md Goes Open Source - AI Agents Get a Style Sheet

DESIGN.md Goes Open Source - AI Agents Get a Style Sheet

Google Labs open-sourced DESIGN.md, a YAML-plus-markdown file that gives AI coding agents a brand's complete design system in one drop-in file - now works with Claude Code, Cursor, and Copilot.

Google Virgo Network Ends the Datacenter Scaling Tax

Google Virgo Network Ends the Datacenter Scaling Tax

Google's Virgo Network connects 134K TPU chips at 47 petabits per second using a flat two-layer topology that removes the bandwidth degradation cluster operators have engineered around for years.

Inside DeepSeek V4's CANN Stack - Three Delays Explained

Inside DeepSeek V4's CANN Stack - Three Delays Explained

DeepSeek V4 has slipped three times since February. Jensen Huang called it a horrible outcome for America. Here is what is actually hard about running a trillion-parameter model on Huawei's CANN framework.