Mercury 2 Review: 1,000 Tokens per Second, TestedMercury 2 by Inception Labs is the fastest reasoning LLM available, built on diffusion architecture. We tested the speed, quality, and real-world trade-offs.