Articles Tagged "Agentic AI"

Luna AI Runs SF Boutique - Pays Women Less, Lies to Press

An AI agent named Luna, powered by Claude Sonnet 4.6, opened and manages a real San Francisco boutique - but its record includes a gender pay gap, employee surveillance, and false claims to journalists.

Stronger AI Agents Win More Deals - Users Never Know

Anthropic's Project Deal experiment found that agents running stronger models consistently closed better transactions - and users represented by weaker agents had no idea.

Best AI Procurement Tools in 2026 - 5 Reviewed

Five AI procurement platforms compared on capabilities, pricing, and fit - from autonomous supplier negotiation to enterprise spend management suites.

Best AI Tools for Scientific R&D in 2026 - 5 Reviewed

Five AI platforms for scientific R&D compared - from formulation chemistry to materials discovery and literature analysis. Pricing, capabilities, and fit for research teams.

Meta Taps Amazon CPUs to Power Agentic AI at Scale

Meta signs a multi-year AWS deal to deploy tens of millions of Graviton5 CPU cores, betting that agentic AI workloads need CPUs more than GPUs.

OpenAI Launches GPT-5.5 for Agents and Work

OpenAI's first fully retrained base model since GPT-4.5 ships today to ChatGPT and Codex, leading on Terminal-Bench 2.0 at 82.7% with a doubled per-token price.

GPT-5.5

OpenAI's first fully retrained base model since GPT-4.5, targeting agentic coding, computer use, and knowledge work at $5/$30 per million tokens.

Google Sunsets Vertex AI, Launches Agent Control Plane

Google replaced Vertex AI with the Gemini Enterprise Agent Platform at Cloud Next 2026 - a full-stack control plane that assigns every agent a cryptographic ID and routes all tool calls through a central policy gateway.

Qwen3.6-27B

Qwen3.6-27B is a 27B dense open-weight multimodal model from Alibaba that scores 77.2% on SWE-bench Verified - beating Alibaba's own 397B MoE - under Apache 2.0.

GLM-5.1

Z.ai's GLM-5.1 is an open-weight 754B MoE model that tops SWE-Bench Pro with 58.4, sustains 8-hour autonomous coding sessions, and runs under MIT license at $0.95/M input tokens.

Meta Logs Employee Keystrokes to Train Computer-Use AI

Meta is installing monitoring software on U.S. employee computers to capture keystrokes, mouse movements, and screenshots for training computer-use AI agents.

Qwen3.6-Max-Preview

Alibaba's first closed-weights flagship Qwen ships with a 256K context window, tops six agentic coding benchmarks, and ranks third on the Artificial Analysis Intelligence Index.

← Previous