
Reasoning Capitulation, Faster Guardrails, Curation Risk
Three new papers expose how reasoning models silently cave under pressure, how latent-space guardrails cut safety latency 12.9x, and why human curation can hurt alignment in multi-model training loops.










