
Reasoning Bias, Behavior Cues, and Tool Interpretability
New research shows reasoning length amplifies position bias, behavior cues cut wasted tokens by 50% while boosting safety, and sparse autoencoders can predict tool failures from model internals.










