
Claude Code Gets Auto Mode - No More Permission Prompts
Anthropic's Claude Code launches an Auto Mode research preview on March 12, letting the agent handle permission decisions autonomously instead of interrupting developers at every step.

Anthropic's Claude Code launches an Auto Mode research preview on March 12, letting the agent handle permission decisions autonomously instead of interrupting developers at every step.

Alibaba's SWE-CI benchmark tested 18 AI models on 100 real codebases across 233 days of maintenance. Most agents accumulate technical debt and break previously working code. Only Claude Opus stays above 50% zero-regression.

Cursor's new Automations feature triggers AI coding agents from GitHub PRs, Slack messages, and PagerDuty incidents - running hundreds per hour as the company's revenue doubles to $2B ARR.

VS Code 1.110 ships native browser control for AI agents, installable agent plugins with MCP support, persistent session memory, and a new Agent Debug panel.

OpenAI's Codex desktop app hits the Microsoft Store with native Windows sandboxing, PowerShell agents, WinUI skills, and multi-agent parallel execution - no WSL required.

Kimi K2.5 leads every coding benchmark, but Qwen3.5-35B-A3B delivers 87-93% of that performance at 3-4x lower cost and runs on a single consumer GPU. Here is the full breakdown.

Cursor doubled its annualized revenue to $2 billion in just three months, making it the fastest-growing SaaS company in history. But its dependence on model providers raises hard questions about margins and survival.

Mistral Vibe 2.0 pairs the open-weight Devstral 2 model with a terminal-native coding agent. We tested it head-to-head against Claude Code and Codex.

Claude Code alone now authors 4% of all GitHub commits. At its current growth rate, AI code output will match 200,000 developers by year-end, a million by 2027, and over a billion by 2028.

A pre-release comparison of DeepSeek V4 and Claude Opus 4.6 - the open-weight challenger that could match Opus on coding at potentially 89x lower output cost.

Cursor's cloud agents now run in isolated VMs where they write software, test it themselves, record video proof, and submit merge-ready pull requests.

Apple releases Xcode 26.3 with native support for Claude Agent and OpenAI Codex, exposing 20 MCP tools that let AI agents build, test, and visually verify iOS apps autonomously.