Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang

DuckDuckGo Traffic Triples After Google's AI Search Pivot

DuckDuckGo Traffic Triples After Google's AI Search Pivot

DuckDuckGo's no-AI search page saw a threefold traffic spike after Google's I/O 2026 overhaul made AI-generated summaries mandatory with no opt-out.

Mistral Vibe Adds Work Mode and a VS Code Extension

Mistral Vibe Adds Work Mode and a VS Code Extension

Mistral rebrands Le Chat as Vibe, ships Work Mode with enterprise integrations, a VS Code extension, and remote coding agents powered by Mistral Medium 3.5 at 77.6% SWE-Bench.

Gemini CLI Dies June 18 - Google Goes Closed-Source

Gemini CLI Dies June 18 - Google Goes Closed-Source

Google is shutting down free access to its open-source Gemini CLI on June 18, replacing it with the proprietary Antigravity CLI - after accepting 6,000+ community pull requests.

IBM and Red Hat Bet $5B on AI to Secure Open Source

IBM and Red Hat Bet $5B on AI to Secure Open Source

IBM and Red Hat's Project Lightwell deploys 20,000 engineers and AI to patch open source vulnerabilities against exact deployed versions - no forced upgrades, commercial subscription model.

XCENA Raises $135M Betting Memory Is AI's Real Bottleneck

XCENA Raises $135M Betting Memory Is AI's Real Bottleneck

XCENA raises $135M at a $570M valuation to build the MX1 - a CXL 3.2 chip with thousands of RISC-V cores that processes AI workloads where data lives, eliminating costly data transfers between GPU and RAM.

Mistral Physics AI Shrinks Days of Simulation to Seconds

Mistral Physics AI Shrinks Days of Simulation to Seconds

Mistral acquired Vienna-based Emmi AI and launched Physics AI - models that replace multi-day engineering simulations with seconds of inference on a single GPU.

NVIDIA SANA-WM - Minute-Scale Video on One GPU

NVIDIA SANA-WM - Minute-Scale Video on One GPU

NVIDIA NVLabs open-sourced SANA-WM, a 2.6B-parameter world model that generates 60-second 720p camera-controlled video on a single GPU, outperforming 14B+ competitors that need 8 GPUs.

ClickUp Cuts 290 Jobs and Deploys 3,000 AI Agents

ClickUp Cuts 290 Jobs and Deploys 3,000 AI Agents

ClickUp cut 22% of its workforce and replaced them with roughly 3,000 internal AI agents - a ratio of three agents per remaining employee.

TeamPCP Breaches GitHub via Poisoned VS Code Extension

TeamPCP Breaches GitHub via Poisoned VS Code Extension

TeamPCP stole 3,800 GitHub internal repos via a malicious Nx Console update live for just 11 minutes, tracing back to the TanStack supply chain compromise.

Gemini 3.5 Flash: Real Speed, Selective Benchmarks

Gemini 3.5 Flash: Real Speed, Selective Benchmarks

Google's Gemini 3.5 Flash is genuinely fast at 289 tok/s and competitive on agentic tasks - but the benchmark portfolio has gaps worth knowing before you build on it.

Chinese Models Claim 60% of OpenRouter Token Traffic

Chinese Models Claim 60% of OpenRouter Token Traffic

Chinese AI providers now handle over 60% of all tokens routed through OpenRouter, up from less than 2% just a year ago.

NVIDIA Ships Vera CPU to Labs, Claims $200B Market

NVIDIA Ships Vera CPU to Labs, Claims $200B Market

NVIDIA delivered first Vera CPUs to Anthropic, OpenAI, and SpaceX on May 17-19 as Q1 FY2027 earnings hit $81.6B, with $20B in standalone Vera CPU orders on the books for 2026.