Llm as judge

Reward Model and LLM-as-Judge Leaderboard

Reward Model and LLM-as-Judge Leaderboard

Rankings of dedicated reward models and frontier LLMs as judges across RewardBench, RewardBench-2, and JudgeBench - benchmarks that measure preference alignment and human agreement.