Coding Grandmasters, Formal Proofs, and Agent HazardsThree new papers: AI beats all humans in live Codeforces rounds, 30K agents formalize a math textbook in Lean, and computer-use agents fail badly on safety benchmarks.