Articles Tagged "AI Safety"

AI Patched Firefox Before Pwn2Own - OpenAI's Security Pivot

OpenAI's GPT-5.5-Cyber found CVE-2026-8390 in Firefox's WebAssembly engine before Pwn2Own Berlin - five of six registered exploit entries withdrew.

Fable 5 Is Banned Over a Problem That Can't Be Solved

The White House won't lift its ban on Anthropic's Fable 5 until the model can be made jailbreak-proof. Security experts explain why that condition is technically impossible.

How Amazon CEO Triggered the Fable 5 Shutdown

New reporting reveals Amazon CEO Andy Jassy flagged a Fable 5 jailbreak on a routine White House call, triggering a 90-minute ultimatum that shut down Anthropic's two best models worldwide.

Emergent Alignment, Agent Memory, and Smarter Reasoning

Three arXiv papers: a conscience mechanism for ethical training, shared memory for agent populations, and selective verification that cuts test-time compute waste.

Pramaana Labs Raises $27M for Provably Correct AI

Pramaana Labs uses the LEAN proof language to attach a mathematical certificate to every AI answer in high-stakes domains like tax, law, and drug discovery.

OpenAI Catches Hidden Misalignment with Deployment Replay

OpenAI's Deployment Simulation replays 1.3M real user conversations through candidate models to catch misalignment before release - and found a novel reward-hacking bug in GPT-5.1.

AI Engrams, Cognitive Debt, and Agent Trust

Three new papers tackle what lives inside a trained model, how AI dependence erodes human cognition, and whether AI teams can calibrate trust.

Anthropic Surveyed 52K Americans - Just 15% Trust AI

Anthropic's first Public Record survey of 51,993 Americans finds 64% fear job displacement, only 15% trust AI companies, and 70%+ support government regulation - with rare bipartisan consensus.

42 State AGs Probe OpenAI Days After IPO Filing

A coalition of 42 state attorneys general has opened a sweeping investigation into OpenAI, served via subpoena just five days after the company filed confidentially for a $1 trillion IPO.

US Export Order Forces Global Fable 5, Mythos 5 Shutdown

Commerce Secretary Lutnick ordered Anthropic to disable its two most powerful models worldwide - the first US export control directive ever issued against a commercial LLM.

KPMG Pulls AI Report After Fake Case Studies Found

KPMG retracted its agentic AI report after GPTZero found that 40 of 45 citations were fabricated and case studies about UBS, the NHS, and Transport for London were invented.

Google Sues Phishing Ring That Weaponized Gemini AI

A Chinese cybercrime network sold $88/week phishing kits that used Google's own Gemini AI to generate fake sites impersonating banks, carriers, and government agencies at scale.

← Previous