Recent Articles - Page 21

Latest News

Reasoning Leaks, Hard Limits, and Self-Aware LLMs

Reasoning Leaks, Hard Limits, and Self-Aware LLMs

Three new papers expose how reasoning traces can be extracted from supposedly hidden model internals, where chain-of-thought hits an architectural ceiling, and how RL teaches models to know when to quit.

View All News →

Guides

View All →

Reviews

View All →

Leaderboards

View All →
AI Image Generation Leaderboard: Best Models 2026

AI Image Generation Leaderboard: Best Models 2026

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Models

View All →
Llama 3.3 70B Instruct

Llama 3.3 70B Instruct

Meta's Llama 3.3 70B Instruct matches Llama 3.1 405B on instruction following and math while running at 4-5x lower cost, with the lowest hallucination rate of any open-weight model on the Vectara summarization leaderboard.

Cohere Command A+

Cohere Command A+

Cohere Command A+ is a 218B sparse MoE model with Apache 2.0 license, native citations, and a 128K context window that runs on just two H100 GPUs.

NVIDIA Cosmos 3

NVIDIA Cosmos 3

NVIDIA Cosmos 3 is an open physical AI omnimodel with Mixture-of-Transformers architecture that natively handles text, images, video, sound, and robot actions in a single 16B or 64B model.

Recent

Apple Opens iOS 27 to Claude, Gemini, ChatGPT

Apple Opens iOS 27 to Claude, Gemini, ChatGPT

Apple's iOS 27 'Extensions' feature lets users swap Claude, Gemini, or ChatGPT into Siri, Writing Tools, and Image Playground - the first time rival AI models can power Apple Intelligence natively.

Inside Anthropic's $200B Google Cloud Compute Bet

Inside Anthropic's $200B Google Cloud Compute Bet

Anthropic has committed $200 billion to Google Cloud over five years - the largest cloud contract in AI history - alongside a 3.5 GW TPU capacity deal with Google and Broadcom coming online in 2027.