
OpenClaw-RL Lets You Train a Personal AI Agent Just by Talking to It
Gen-Verse's new open-source framework uses asynchronous reinforcement learning to personalize LLMs through natural conversation - no labeling, no datasets, just feedback.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

Gen-Verse's new open-source framework uses asynchronous reinforcement learning to personalize LLMs through natural conversation - no labeling, no datasets, just feedback.

NIST's Center for AI Standards and Innovation launched a federal initiative to build identity, security, and interoperability standards for autonomous AI agents - addressing the reality that 80% of Fortune 500 companies deploy agents with virtually no governance infrastructure.

New papers tackle training collapse in agentic RL with a unified stabilization recipe, reveal when querying multiple models actually helps, and expose a paradox where LLMs claim to trust humans but bet on algorithms.

The French government's digital agency Etalab shipped an open-source MCP server for data.gouv.fr, giving AI agents structured access to 74,000 public datasets without an API key.

OpenAI partners with the world's largest consulting firms to deploy its Frontier AI agent platform across enterprise clients, signaling a decisive shift from consumer chatbot to corporate operating system.

Nous Research's Hermes Agent is an open-source CLI agent with persistent multi-level memory, cross-platform messaging support, subagent delegation, and a growing skills ecosystem.

Perplexity's new Computer product breaks tasks into sub-agents routed across Claude, Gemini, GPT-5.2, and Grok, running autonomously for days or months in isolated cloud sandboxes. Available now for Max subscribers at $200/month.

Anthropic acquires Seattle startup Vercept and its nine-person team of Allen Institute for AI alumni, folding their vision-based desktop automation into Claude as computer use scores hit 72.5% on OSWorld.

Google is launching Gemini automation as a beta on Pixel 10 and Samsung Galaxy S26 - long-press the power button, describe a task, and Gemini navigates apps like Uber and DoorDash in the background to complete it for you.

Today's arXiv picks: a state-machine framework that makes GUI agents 12x cheaper, a training method that forces chain-of-thought to be honest, and a KV cache system that matches full quality at 1% the memory.

Moonshot AI's Kimi K2.5 is a 1T-parameter MoE model activating 32B per token with native multimodal vision via MoonViT-3D, Agent Swarm coordination of up to 100 sub-agents via PARL, and top-tier math and coding benchmarks under a modified MIT license.

Google adds an agent step to Opal, its no-code AI mini-app builder, powered by Gemini 3 Flash. The agent picks its own tools, remembers user preferences across sessions, and routes itself through workflows.