
DeepSeek V4-Pro Review: Frontier Power, Penny Prices
DeepSeek V4-Pro matches Claude Opus 4.6 on SWE-bench at a fraction of the cost - a thorough review of what it gets right, where it still trails, and whether the price gap justifies the switch.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

DeepSeek V4-Pro matches Claude Opus 4.6 on SWE-bench at a fraction of the cost - a thorough review of what it gets right, where it still trails, and whether the price gap justifies the switch.

Google Labs open-sourced DESIGN.md, a YAML-plus-markdown file that gives AI coding agents a brand's complete design system in one drop-in file - now works with Claude Code, Cursor, and Copilot.

DeepSeek V4 has slipped three times since February. Jensen Huang called it a horrible outcome for America. Here is what is actually hard about running a trillion-parameter model on Huawei's CANN framework.

Seth Showes' viral blog post describes sequencing his whole genome on an Oxford Nanopore MinION in his kitchen over 72 hours, with Claude generating the BED file that targeted his autoimmune-risk genes. The kit costs $3,200. The AI's role is more interesting than either number.

OpenAI released Privacy Filter today, a 1.5B MoE with 50M active parameters that tags eight categories of PII in text. Apache 2.0, 128K context, runs in a browser via WebGPU.

Qwen3.6-35B-A3B lands with 73.4 on SWE-bench Verified and Apache 2.0 weights, all from 3 billion active parameters routed through a 256-expert MoE. Fits on a single consumer GPU.

Mozilla's blog says Claude Mythos Preview uncovered 271 vulnerabilities patched in Firefox 150. The security advisory lists 36 CVEs, and only three of them credit Anthropic. The gap is the whole story.

Eraser, Mermaid, draw.io, Whimsical, Miro, and Lucidchart compared on AI quality, pricing, and developer workflow fit.

A hands-on review of Gemini CLI, Google's open-source AI agent for the terminal - featuring Gemini 3.1 Pro, 1M context, built-in Google Search, MCP support, and the most generous free tier in the category.

Qwen3.6-27B is a 27B dense open-weight multimodal model from Alibaba that scores 77.2% on SWE-bench Verified - beating Alibaba's own 397B MoE - under Apache 2.0.

Z.ai's GLM-5.1 is an open-weight 754B MoE model that tops SWE-Bench Pro with 58.4, sustains 8-hour autonomous coding sessions, and runs under MIT license at $0.95/M input tokens.

Anthropic walked back the April 4 block on third-party Claude harnesses, telling OpenClaw and NanoClaw that subscription-based CLI usage is permitted again for personal accounts.