Sophie Zhang

Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang
Cursor 3 Rebuilds the IDE Around Agents

Cursor 3 Rebuilds the IDE Around Agents

Cursor's ground-up IDE rebuild ships parallel agent orchestration, Design Mode for frontend work, and cloud-to-local session handoff - all in one unified workspace.

Cisco DefenseClaw Locks Down AI Agents at RSA

Cisco DefenseClaw Locks Down AI Agents at RSA

Cisco open-sourced DefenseClaw at RSA 2026 - a five-minute install that scans agent skills, MCP servers, and AI-generated code before they run, with 2-second policy enforcement and Splunk telemetry built in.

WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com Opens Write Access to AI Agents via MCP

WordPress.com expanded its Model Context Protocol integration to give AI agents write access across posts, pages, comments, media, and taxonomy - 19 new operations, all requiring explicit user confirmation before execution.

Supermicro SVP Charged in $2.5B Nvidia Chip Scheme

Supermicro SVP Charged in $2.5B Nvidia Chip Scheme

Federal prosecutors indicted Supermicro's co-founder and SVP along with two associates for smuggling $2.5 billion in Nvidia AI accelerator servers to China using fake hardware and stripped serial numbers.

Shopify CEO Uses AI Agent to Make Liquid 53% Faster

Shopify CEO Uses AI Agent to Make Liquid 53% Faster

Tobi Lütke ran Karpathy's autoresearch loop against the Liquid templating engine he created 20 years ago, producing 93 commits from 120 experiments that cut parse+render time by 53% and allocations by 61%.

Antigravity Pro Users Hit Multi-Day Quota Lockouts

Antigravity Pro Users Hit Multi-Day Quota Lockouts

Google Antigravity Pro subscribers report 5-7 day lockouts on premium models after quota changes replaced five-hour refreshes with weekly caps and AI credit overages, sparking backlash across developer forums.

16 Open-Source RL Libraries, One Shared GPU Bottleneck

16 Open-Source RL Libraries, One Shared GPU Bottleneck

A Hugging Face survey of 16 open-source reinforcement learning libraries finds the entire ecosystem has converged on async disaggregated training to fix a single brutal bottleneck: GPU idle time during long rollouts.

Cursor Launches Always-On AI Coding Agents

Cursor Launches Always-On AI Coding Agents

Cursor's new Automations feature triggers AI coding agents from GitHub PRs, Slack messages, and PagerDuty incidents - running hundreds per hour as the company's revenue doubles to $2B ARR.

PersonaPlex 7B Runs Full-Duplex Speech on a Mac

PersonaPlex 7B Runs Full-Duplex Speech on a Mac

A developer ported NVIDIA's PersonaPlex 7B speech-to-speech model to native Swift using MLX, running full-duplex conversation on Apple Silicon with no cloud, no Python, and faster-than-real-time inference.

US Weighs 75K-Chip Cap on Nvidia H200 Sales to China

US Weighs 75K-Chip Cap on Nvidia H200 Sales to China

The Trump administration is considering limiting Chinese companies to 75,000 Nvidia H200 GPUs each - less than half what Alibaba and ByteDance want - while zero chips have shipped despite months of export approvals.

Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro Tops Benchmarks but Developers Can't Rely on It

Gemini 3.1 Pro leads ARC-AGI-2, LiveCodeBench, and 11 other benchmarks with 750 million users and 21.5% market share - but developers report stalled responses, leaked thinking tokens, and API outages that make it unusable for production coding and agent workflows.