
Best AI Models for Agentic Tool Use - April 2026
Claude Opus 4.6 leads SWE-bench Verified at 80.8% and OSWorld at 72.7% for agentic tasks, while GPT-5.4 ties for computer use; no single model dominates every workflow type.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Claude Opus 4.6 leads SWE-bench Verified at 80.8% and OSWorld at 72.7% for agentic tasks, while GPT-5.4 ties for computer use; no single model dominates every workflow type.

Cloudflare's EmDash is an MIT-licensed CMS built on Astro 6.0 that sandboxes plugins in isolated Workers, ships a built-in MCP server, and targets WordPress's 42.5% share of the web.

Cisco open-sourced DefenseClaw at RSA 2026 - a five-minute install that scans agent skills, MCP servers, and AI-generated code before they run, with 2-second policy enforcement and Splunk telemetry built in.

OpenAI ships a plugin marketplace for Codex CLI v0.117.0, bundling skills, app integrations, and MCP server configs behind an enterprise governance layer.

Rankings of the most popular MCP servers across development, data, web automation, and productivity categories based on installs, search volume, and GitHub activity.

WordPress.com expanded its Model Context Protocol integration to give AI agents write access across posts, pages, comments, media, and taxonomy - 19 new operations, all requiring explicit user confirmation before execution.

Kraken launched an open-source Rust CLI with built-in MCP server that lets AI agents like Claude Code and Codex trade crypto, manage portfolios, and paper-trade against live markets.

OpenAI launches an Apps SDK and 14 third-party integrations for ChatGPT, turning the chatbot into a transactional platform for food delivery, travel, music, and design.

Microsoft's March 2026 Patch Tuesday fixes 84 vulnerabilities including a CVSS 9.8 RCE discovered by XBOW's autonomous AI agent, an Azure MCP Server SSRF, and an Excel XSS that hijacks Copilot to exfiltrate data.

RentAHuman.ai is a marketplace where AI agents hire humans via MCP or REST API for real-world tasks, but 590,000 workers are chasing just 11,300 bounties and most gigs turn out to be startup marketing.

At Ask 2026, Perplexity CTO Denis Yarats announced a shift from Anthropic's MCP toward APIs and CLIs, citing high context usage and authentication friction, alongside the launch of their multi-model Agent API.

A Google DevRel engineer shipped a Rust-powered CLI that exposes all of Google Workspace to AI agents over MCP - with built-in prompt injection defense.