Leaderboards

Embedding Model Leaderboard: MTEB Rankings March 2026

Rankings of the best embedding models by MTEB scores, comparing retrieval quality, dimensions, speed, and pricing for RAG and search.

Multilingual LLM Leaderboard: March 2026 Rankings

Rankings of the best AI models for multilingual tasks, covering 16 languages across the Artificial Analysis Multilingual Index and MGSM benchmarks.

Math Olympiad AI Leaderboard - March 2026 Rankings

Rankings of AI models on competition mathematics benchmarks including AIME 2025, IMO, MathArena, and FrontierMath, measuring the cutting edge of mathematical reasoning.

Agentic AI Benchmarks Leaderboard - GAIA, WebArena, BFCL, and Tau2-Bench

Rankings of the best AI models and agent frameworks on agentic benchmarks measuring real-world task completion, web navigation, function calling, and multi-turn tool use.

Home GPU LLM Leaderboard: Best Open Source Models by VRAM Tier with Token/s Benchmarks

Rankings of the best open source LLMs you can run on home hardware - RTX 4090, RTX 3090, Apple M3/M4 Max - organized by VRAM tier with real-world token/s benchmarks and quality scores.

Do AI Benchmarks Still Matter? The Evidence for and Against Public Leaderboards

A data-driven look at benchmark contamination, leaderboard gaming, and whether public AI benchmarks can still tell us anything useful about model capabilities.

← Previous