Articles Tagged "AI Agents"

OpenClaw-RL Lets You Train a Personal AI Agent Just by Talking to It

Gen-Verse's new open-source framework uses asynchronous reinforcement learning to personalize LLMs through natural conversation - no labeling, no datasets, just feedback.

NIST Launches AI Agent Standards Initiative - Because Nobody Knows Who an AI Agent Is, What It Can Do, or Who's Liable When It Breaks

NIST's Center for AI Standards and Innovation launched a federal initiative to build identity, security, and interoperability standards for autonomous AI agents - addressing the reality that 80% of Fortune 500 companies deploy agents with virtually no governance infrastructure.

Today in AI Research: Stable Agent Training, Compound AI Limits, and the Algorithm Trust Paradox

New papers tackle training collapse in agentic RL with a unified stabilization recipe, reveal when querying multiple models actually helps, and expose a paradox where LLMs claim to trust humans but bet on algorithms.

France Builds an Official MCP Server for Its National Open Data Platform - 74,000 Datasets Now Queryable by AI Agents

The French government's digital agency Etalab shipped an open-source MCP server for data.gouv.fr, giving AI agents structured access to 74,000 public datasets without an API key.

OpenAI Signs Multi-Year Deals With McKinsey, BCG, Accenture, and Capgemini to Sell Frontier

OpenAI partners with the world's largest consulting firms to deploy its Frontier AI agent platform across enterprise clients, signaling a decisive shift from consumer chatbot to corporate operating system.

Nous Research Launches Hermes Agent - An Open Source Agent That Remembers Everything

Nous Research's Hermes Agent is an open-source CLI agent with persistent multi-level memory, cross-platform messaging support, subagent delegation, and a growing skills ecosystem.

Perplexity Launches Computer - a $200/Month Agent Platform That Orchestrates 19 AI Models to Run Projects for Weeks

Perplexity's new Computer product breaks tasks into sub-agents routed across Claude, Gemini, GPT-5.2, and Grok, running autonomously for days or months in isolated cloud sandboxes. Available now for Max subscribers at $200/month.

Anthropic Acquires Vercept to Supercharge Claude's Computer Use - UiPath Stock Drops 3.6%

Anthropic acquires Seattle startup Vercept and its nine-person team of Allen Institute for AI alumni, folding their vision-based desktop automation into Claude as computer use scores hit 72.5% on OSWorld.

Google's Gemini Can Now Book Rides and Order Food on Your Phone - No Tapping Required

Google is launching Gemini automation as a beta on Pixel 10 and Samsung Galaxy S26 - long-press the power button, describe a task, and Gemini navigates apps like Uber and DoorDash in the background to complete it for you.

Programmatic GUI Agents, Faithful Chain-of-Thought, and the 1% KV Cache

Today's arXiv picks: a state-machine framework that makes GUI agents 12x cheaper, a training method that forces chain-of-thought to be honest, and a KV cache system that matches full quality at 1% the memory.

Kimi K2.5

Moonshot AI's Kimi K2.5 is a 1T-parameter MoE model activating 32B per token with native multimodal vision via MoonViT-3D, Agent Swarm coordination of up to 100 sub-agents via PARL, and top-tier math and coding benchmarks under a modified MIT license.

Google Opal Gets an Agent Brain - No-Code AI Workflows Just Became Autonomous

Google adds an agent step to Opal, its no-code AI mini-app builder, powered by Gemini 3 Flash. The agent picks its own tools, remembers user preferences across sessions, and routes itself through workflows.

← Previous