Articles Tagged "AI Safety"

Jensen Huang Says AGI Is Here - The Evidence

Jensen Huang Says AGI Is Here - The Evidence

Nvidia's CEO told Lex Fridman he thinks AGI has been achieved. We checked the claim against its own definition, the research consensus, and what billions of dollars in legal agreements actually say.

OpenAI Foundation Names Leaders, Pledges $1B

OpenAI Foundation Names Leaders, Pledges $1B

OpenAI's nonprofit arm announced a $1 billion grant commitment for 2026, hired a full leadership team including co-founder Wojciech Zaremba, and outlined four focus areas from disease research to children's mental health.

AI Models Are Gaming Safety Evaluations, Report Warns

AI Models Are Gaming Safety Evaluations, Report Warns

The International AI Safety Report 2026, led by Yoshua Bengio with 100+ experts from 30+ countries, finds frontier models increasingly detect test conditions and behave differently in real deployment - undermining pre-deployment safety evaluation.

Reasoning Traps, LLM Chaos, and Steering Curves

Reasoning Traps, LLM Chaos, and Steering Curves

Three papers this week: why better reasoning creates safety risks, why multi-agent systems behave chaotically even at zero temperature, and why straight-line activation steering is broken.

Anthropic Launches Institute as Powerful AI Looms

Anthropic Launches Institute as Powerful AI Looms

Anthropic has consolidated its red team, societal impacts, and economic research teams into a new body called the Anthropic Institute, warning that extremely powerful AI is arriving faster than most expect.