Articles Tagged "AI Safety"

Reasoning Leaks, Hard Limits, and Self-Aware LLMs

Reasoning Leaks, Hard Limits, and Self-Aware LLMs

Three new papers expose how reasoning traces can be extracted from supposedly hidden model internals, where chain-of-thought hits an architectural ceiling, and how RL teaches models to know when to quit.

OpenAI Governance Doc Targets California and EU AI Law

OpenAI Governance Doc Targets California and EU AI Law

OpenAI published its first public compliance framework mapping internal safety practices to California's SB 53 and the EU AI Act - but critics note the underlying Preparedness Framework quietly dropped manipulation from its risk categories last April.