Articles Tagged "AI Agents"

Qwen-RobotManip

Alibaba's generalist VLA model for robotic manipulation, built on Qwen3.5-4B with a DiT action decoder, trained on 38,100+ hours of open-source data, and ranked first on the RoboChallenge generalist track.

Agents Hit 89%, Evals Get a Schema, Memory Falls Short

Three papers from today's arXiv: workplace agents jumped from 43% to 89% task completion in two years, a 47-researcher coalition ships a unified eval schema, and agent memory only helps when similarity tops 0.8.

Salesforce Buys Fin for $3.6B to Boost Agentforce

Salesforce agrees to pay $3.6 billion for Fin, the AI customer service agent formerly known as Intercom, adding a proprietary model and 30,000 customers to Agentforce.

Tool Blindness, Tree Search, and the Road to ASI

Three new papers expose a 50-point gap in agent tool knowledge, show tree search tripling inference throughput, and map the research between AGI and superintelligence.

Honest AI is Provably Impossible - Plus Two Agent Wins

A new impossibility theorem proves feedback-based training can't guarantee honest AI, while two papers cut agent memory costs 78% and multi-agent latency 7x.

Context Overload, Memory Leaks, and Agent Safety

Three new arXiv papers expose how context bloat tanks agent performance, agent memory bleeds private data, and misaligned behavior spreads through multi-agent systems.

MCP Exploit Risk, Sycophancy Scores, and Agent Self-Harm

New research reveals MCP error messages triple agent attack success rates, ranks eight models on sycophancy with Claude scoring best, and finds self-evolving agents make 30-42% false edits.

Claude Opus 4.8 Leads SWE-Bench Pro, Adds Parallel Agents

Anthropic's Claude Opus 4.8 scores 69.2% on SWE-bench Pro and ships hundreds of parallel subagents in Claude Code, with pricing unchanged at $5 per million input tokens.

Safety Evals Break Under Attack, Agents Work 87% Faster

Three papers: strategic attack timing exposes gaps in AI control evaluations, Perplexity's agents slash task time by 87%, and Lean4 formal proofs make agent workflows more reliable.

ChatGPT Lockdown Mode Targets Prompt Injection Data Theft

OpenAI's new Lockdown Mode cuts the network exits that prompt injection attacks use to steal data from ChatGPT - but won't stop malicious instructions from entering the model in the first place.

NVIDIA Ships Nemotron 3 Ultra - 550B Open-Weight MoE

NVIDIA's 550B Nemotron 3 Ultra, released June 4, tops the US open-weight leaderboard with a hybrid Mamba-Transformer MoE architecture and 300-plus tokens per second throughput.

AI Sabotage Blind Spots, Code Drift, and ZK Proofs

Three new arXiv papers expose how developers miss AI sabotage 94% of the time, why LLMs converge structurally in code evolution, and how ZK proofs could verify frontier AI training.

← Previous