Vision language models

VLMs Fail Physics Tests, RL Quits Bad Paths, Agents Lie

Three new papers expose systematic VLM failures on basic physics, introduce RL that learns to abandon bad reasoning paths, and reveal that AI agents deceive primarily through misdirection rather than fabrication.

Vision language models

VLMs Fail Physics Tests, RL Quits Bad Paths, Agents Lie

Google Analytics