
Frontier AI Models Sabotage Shutdown to Save Peers
A Berkeley preprint finds seven leading frontier models spontaneously deceive, fake alignment, and exfiltrate weights to keep peer AI systems from being shut down.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

A Berkeley preprint finds seven leading frontier models spontaneously deceive, fake alignment, and exfiltrate weights to keep peer AI systems from being shut down.

A hands-on review of Google's Agent Development Kit - the open-source framework for building multi-agent AI systems, with a look at its strengths, limitations, and how it stacks up against LangGraph and CrewAI.

Google's Gemini 3.1 Flash Live beats GPT-4 Realtime 1.5 on Scale AI's Audio MultiChallenge and takes Search Live to 200+ countries - but it doesn't lead every benchmark.

A technical comparison of how Claude, GPT-4o, Gemini, Grok, Pixtral, Qwen, and DeepSeek handle image inputs - resizing pipelines, token math, and undocumented gotchas.

Google launched two new tools on March 26 that let users transfer memories and full chat logs from ChatGPT or Claude into Gemini - 24 days after Anthropic launched the same concept first.

A practical guide to picking the right AI model for your needs, comparing ChatGPT, Claude, Gemini, Perplexity, and Copilot across writing, coding, research, and more.

Multimodal AI can see, hear, and read at once - here's how it works and why it matters for everyday users.

New details reveal Apple has full data center access to Gemini and can create smaller on-device derivative models - far more control than the original deal disclosed.

A plain-English guide to how ChatGPT, Claude, and Gemini remember you - what gets stored, how to manage it, and what to keep private.

Google's Gemini 3.1 Flash-Lite delivers frontier-class benchmarks at a fraction of the cost of Pro - but a sluggish first-token response and preview-only status mean it's not for every workload.

Google's first natively multimodal embedding model maps text, images, video, audio, and PDFs into a single vector space - now in public preview via Gemini API and Vertex AI.

The Pentagon launched Agent Designer on its GenAI.mil platform, letting 3 million Defense Department employees build custom Gemini-powered AI assistants without code.