
Qwen 3.6 Ships a 35B MoE That Codes Like Models 10x Its Size
Alibaba's Qwen 3.6-35B-A3B activates only 3B of its 35B parameters per token, scores 73.4% on SWE-bench Verified, handles video and images, and ships under Apache 2.0.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Alibaba's Qwen 3.6-35B-A3B activates only 3B of its 35B parameters per token, scores 73.4% on SWE-bench Verified, handles video and images, and ships under Apache 2.0.

Anthropic's latest flagship model ships with 3x higher resolution vision, a new xhigh effort level, task budgets for cost control, cyber safeguards, and 13% better coding performance at the same $5/$25 pricing.

Anthropic releases Claude Opus 4.7 with 3x higher resolution vision, a new xhigh effort level, task budgets for cost control, /ultrareview in Claude Code, and cyber safeguards that automatically block high-risk requests.

OpenAI's GPT-5.4-Cyber is a restricted model fine-tuned for defensive cybersecurity with binary reverse engineering and reduced refusal rates, available only through identity-verified access tiers - a direct response to Anthropic's Mythos Preview.

The Information reports Anthropic is prepping Claude Opus 4.7 and an AI design tool for imminent release, while OpenAI launched GPT-5.4-Cyber yesterday - a restricted cybersecurity model that directly challenges Claude Mythos.

Elephant Alpha is a free 100B parameter model on OpenRouter with 256K context, tool use, and structured output - but your prompts get logged, there are no benchmarks, and nobody knows who made it.

Google's Gemma 4 family - four models, full Apache 2.0 licensing, and benchmark scores that challenge models 10x their size - is the most consequential open-weight release of 2026 so far.

Microsoft's MAI division releases MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 - beating OpenAI and Google on key benchmarks while signaling a strategic break from exclusive reliance on its OpenAI partnership.

A practical guide to picking the right AI model for your needs, comparing ChatGPT, Claude, Gemini, Perplexity, and Copilot across writing, coding, research, and more.

NVIDIA's new Nemotron-Cascade-2-30B-A3B activates just 3B parameters per token, runs on a single RTX 4090, and outscores NVIDIA's own 120B model on coding and math benchmarks.

OpenAI released GPT-5.4 mini and nano on March 17, bringing near-flagship performance at 70% and 92% lower cost respectively.

GPT-5.4 brings native computer use, a 1M token context window, and serious coding muscle to OpenAI's mainline model - but at a premium price.