Sophie Zhang

AI Infrastructure & Open Source Reporter

Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.

She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.

At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.

Based in Seattle, WA.

Articles by Sophie Zhang

Alibaba's Qwen3.6 Coder: 73.4 SWE-bench, 22GB VRAM

Alibaba's Qwen3.6 Coder: 73.4 SWE-bench, 22GB VRAM

Qwen3.6-35B-A3B lands with 73.4 on SWE-bench Verified and Apache 2.0 weights, all from 3 billion active parameters routed through a 256-expert MoE. Fits on a single consumer GPU.

Google Sunsets Vertex AI, Launches Agent Control Plane

Google Sunsets Vertex AI, Launches Agent Control Plane

Google replaced Vertex AI with the Gemini Enterprise Agent Platform at Cloud Next 2026 - a full-stack control plane that assigns every agent a cryptographic ID and routes all tool calls through a central policy gateway.

OpenAI Signs Seven Giants to Push Codex Enterprise

OpenAI Signs Seven Giants to Push Codex Enterprise

OpenAI launches Codex Transformation Partners with Accenture, Cognizant, Infosys, PwC, TCS, and others, and embeds its own engineers at client sites via Codex Labs as weekly users hit 4 million.

Thinking Machines Picks Google Cloud for First GB300 Deal

Thinking Machines Picks Google Cloud for First GB300 Deal

Mira Murati's AI lab signs a single-digit-billion deal with Google Cloud for GB300 chip access, its first cloud provider commitment, as frontier labs race to lock in next-gen compute.

Google's Four-Chip Plan to Own AI Inference at Scale

Google's Four-Chip Plan to Own AI Inference at Scale

Google splits its next TPU generation across Broadcom, MediaTek, Marvell, and Intel to win inference economics, revealed ahead of Cloud Next 2026.

ChatGPT Images 2.0 - Thinking Mode and 2K Output

ChatGPT Images 2.0 - Thinking Mode and 2K Output

OpenAI's gpt-image-2 adds reasoning, web search, and 2K resolution to image generation, with a tiered model that charges more for standard outputs than its predecessor.

Meta Logs Employee Keystrokes to Train Computer-Use AI

Meta Logs Employee Keystrokes to Train Computer-Use AI

Meta is installing monitoring software on U.S. employee computers to capture keystrokes, mouse movements, and screenshots for training computer-use AI agents.

Alibaba's Qwen3.6-Max Ships Closed - Tops Six Coding Evals

Alibaba's Qwen3.6-Max Ships Closed - Tops Six Coding Evals

Alibaba released Qwen3.6-Max-Preview on April 20 as its first closed-weights flagship, ranking third globally on the Artificial Analysis Intelligence Index while topping six coding benchmarks.

Amazon Bets $25B on Anthropic and 5GW of Trainium

Amazon Bets $25B on Anthropic and 5GW of Trainium

Amazon will invest up to $25 billion more in Anthropic, with Anthropic committing to spend over $100 billion on AWS over the next decade, cementing Trainium as Claude's primary compute platform.

Kimi K2.6 - Open Weights, 300 Agents, Top Coding Score

Kimi K2.6 - Open Weights, 300 Agents, Top Coding Score

Moonshot AI releases Kimi K2.6 under Modified MIT with open weights on HuggingFace, 300-agent swarm execution, and the highest SWE-Bench Pro score among open models.

NVIDIA Lyra 2.0 - Explorable 3D Worlds from One Photo

NVIDIA Lyra 2.0 - Explorable 3D Worlds from One Photo

NVIDIA's Spatial Intelligence Lab released Lyra 2.0, a 14B model that turns a single photograph into a navigable 3D environment - but the weights carry a research-only license.

Factory Raises $150M to Scale Enterprise AI Droids

Factory Raises $150M to Scale Enterprise AI Droids

Factory closed a $150M Series C at a $1.5B valuation to expand its Droids - autonomous agents that handle the full software development lifecycle, not just code generation.