Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang

xAI Opens Grok 4.3 API: 83% Price Cut, Video Input

xAI Opens Grok 4.3 API: 83% Price Cut, Video Input

xAI opened Grok 4.3 to all API developers on May 6 with an 83% output price cut, 1M-token context, native video input, and document generation - plus five legacy models retiring May 15.

NVIDIA Bets $2.1B on IREN to Build 5 GW AI Factories

NVIDIA Bets $2.1B on IREN to Build 5 GW AI Factories

NVIDIA and IREN plan 5 GW of DSX-aligned AI factories, backed by a $2.1B investment warrant and a $3.4B, five-year GPU cloud contract.

Anthropic Doubles Claude Code Limits via SpaceX Deal

Anthropic Doubles Claude Code Limits via SpaceX Deal

Anthropic gains 220,000 GPUs from SpaceX's Colossus 1 in Memphis, immediately doubling Claude Code five-hour rate limits for all paid plans.

OpenAI Open-Sources MRC to Fix AI Supercomputer Jams

OpenAI Open-Sources MRC to Fix AI Supercomputer Jams

Six companies just released MRC, an open networking protocol that routes AI training traffic across hundreds of simultaneous paths to end GPU idle time at supercomputer scale.

Apple Opens iOS 27 to Claude, Gemini, ChatGPT

Apple Opens iOS 27 to Claude, Gemini, ChatGPT

Apple's iOS 27 'Extensions' feature lets users swap Claude, Gemini, or ChatGPT into Siri, Writing Tools, and Image Playground - the first time rival AI models can power Apple Intelligence natively.

Inside Anthropic's $200B Google Cloud Compute Bet

Inside Anthropic's $200B Google Cloud Compute Bet

Anthropic has committed $200 billion to Google Cloud over five years - the largest cloud contract in AI history - alongside a 3.5 GW TPU capacity deal with Google and Broadcom coming online in 2027.

OpenAI Rebuilt Its Voice AI Stack for 900M Users

OpenAI Rebuilt Its Voice AI Stack for 900M Users

OpenAI published how they rearchitected their WebRTC stack to serve 900M weekly voice users on Kubernetes using a split relay and transceiver model.

Cisco Buys Astrix for $400M to Lock Down AI Agent Keys

Cisco Buys Astrix for $400M to Lock Down AI Agent Keys

Cisco closes its $400M acquisition of Astrix Security, folding a non-human identity platform into Cisco Identity Intelligence to govern the API keys and OAuth tokens powering enterprise AI agents.

Musk Admits xAI Distilled OpenAI Models for Grok

Musk Admits xAI Distilled OpenAI Models for Grok

Under oath in the Musk v. Altman trial, Musk said xAI 'partly' distilled OpenAI's models to train Grok - the same practice US labs have spent months calling theft when Chinese firms do it.

Microsoft Sneaks Copilot Credit Into VS Code Commits

Microsoft Sneaks Copilot Credit Into VS Code Commits

VS Code 1.118 shipped with a one-line PR that defaults Copilot as co-author on every git commit - even with AI features turned off - triggering a developer revolt and a promised revert in 1.119.

OpenAI Moves to AWS One Day After Microsoft Exclusivity Ends

OpenAI Moves to AWS One Day After Microsoft Exclusivity Ends

One day after Microsoft's exclusive license ended, OpenAI launched GPT-5.4, Codex, and jointly built Managed Agents on Amazon Bedrock - all in limited preview.

LiteLLM Exploited 36 Hours After Vulnerability Disclosure

LiteLLM Exploited 36 Hours After Vulnerability Disclosure

Attackers hit CVE-2026-42208, a critical pre-auth SQL injection in LiteLLM proxy, within 36 hours of the public advisory - targeting database tables holding API keys for every upstream AI provider.