
Alibaba's Qwen3.6 Coder: 73.4 SWE-bench, 22GB VRAM
Qwen3.6-35B-A3B lands with 73.4 on SWE-bench Verified and Apache 2.0 weights, all from 3 billion active parameters routed through a 256-expert MoE. Fits on a single consumer GPU.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

AI Infrastructure & Open Source Reporter
Sophie is a journalist and former systems engineer who covers AI infrastructure, open-source models, and the developer tooling ecosystem. She spent three years as a site reliability engineer at a cloud provider in Seattle before transitioning to tech journalism, which gives her writing an unusual level of technical depth - she understands distributed systems, GPU clusters, and inference optimization from the inside.
She studied Computer Engineering at the University of British Columbia and later completed a science communication fellowship at MIT. Her engineering background means she can read a model card, spot a misleading benchmark, and explain why quantization matters - all in the same paragraph.
At Awesome Agents, Sophie covers AI infrastructure news: new model releases, open-source launches, developer tools, deployment trends, and the hardware that makes it all run. She has a soft spot for underdog open-source projects that punch above their weight and a sharp eye for when a "breakthrough" is really just better marketing.
Based in Seattle, WA.

Qwen3.6-35B-A3B lands with 73.4 on SWE-bench Verified and Apache 2.0 weights, all from 3 billion active parameters routed through a 256-expert MoE. Fits on a single consumer GPU.

Google replaced Vertex AI with the Gemini Enterprise Agent Platform at Cloud Next 2026 - a full-stack control plane that assigns every agent a cryptographic ID and routes all tool calls through a central policy gateway.

OpenAI launches Codex Transformation Partners with Accenture, Cognizant, Infosys, PwC, TCS, and others, and embeds its own engineers at client sites via Codex Labs as weekly users hit 4 million.

Mira Murati's AI lab signs a single-digit-billion deal with Google Cloud for GB300 chip access, its first cloud provider commitment, as frontier labs race to lock in next-gen compute.

Google splits its next TPU generation across Broadcom, MediaTek, Marvell, and Intel to win inference economics, revealed ahead of Cloud Next 2026.

OpenAI's gpt-image-2 adds reasoning, web search, and 2K resolution to image generation, with a tiered model that charges more for standard outputs than its predecessor.

Meta is installing monitoring software on U.S. employee computers to capture keystrokes, mouse movements, and screenshots for training computer-use AI agents.

Alibaba released Qwen3.6-Max-Preview on April 20 as its first closed-weights flagship, ranking third globally on the Artificial Analysis Intelligence Index while topping six coding benchmarks.

Amazon will invest up to $25 billion more in Anthropic, with Anthropic committing to spend over $100 billion on AWS over the next decade, cementing Trainium as Claude's primary compute platform.

Moonshot AI releases Kimi K2.6 under Modified MIT with open weights on HuggingFace, 300-agent swarm execution, and the highest SWE-Bench Pro score among open models.

NVIDIA's Spatial Intelligence Lab released Lyra 2.0, a 14B model that turns a single photograph into a navigable 3D environment - but the weights carry a research-only license.

Factory closed a $150M Series C at a $1.5B valuation to expand its Droids - autonomous agents that handle the full software development lifecycle, not just code generation.