
Gemma 4 Review: Google's Biggest Open-Source Bet
Google's Gemma 4 family - four models, full Apache 2.0 licensing, and benchmark scores that challenge models 10x their size - is the most consequential open-weight release of 2026 so far.

Google's Gemma 4 family - four models, full Apache 2.0 licensing, and benchmark scores that challenge models 10x their size - is the most consequential open-weight release of 2026 so far.

Microsoft's MAI division releases MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 - beating OpenAI and Google on key benchmarks while signaling a strategic break from exclusive reliance on its OpenAI partnership.

A practical guide to picking the right AI model for your needs, comparing ChatGPT, Claude, Gemini, Perplexity, and Copilot across writing, coding, research, and more.

NVIDIA's new Nemotron-Cascade-2-30B-A3B activates just 3B parameters per token, runs on a single RTX 4090, and outscores NVIDIA's own 120B model on coding and math benchmarks.

OpenAI released GPT-5.4 mini and nano on March 17, bringing near-flagship performance at 70% and 92% lower cost respectively.

GPT-5.4 brings native computer use, a 1M token context window, and serious coding muscle to OpenAI's mainline model - but at a premium price.

OpenAI posted '5.4 sooner than you Think.' one hour after launching GPT-5.3 Instant. Here is what the Codex repo leaks already told us - and what the tweet does not.

Google quietly launched Gemini 3.1 Flash-Lite Preview - the first Flash-Lite in the Gemini 3 series - while announcing Gemini 3 Pro will shut down March 9. Developers have six days to migrate.

OpenAI ships GPT-5.3 Instant with 27% fewer hallucinations, a less preachy tone, and better web search - available now across all ChatGPT tiers and the API.

Two pull requests in OpenAI's public Codex GitHub repo referenced GPT-5.4 before being scrubbed - one adding full-resolution vision support, the other a fast mode toggle. Seven force pushes and a deleted employee screenshot confirm this was not intentional.

Google's Gemini 3.1 Pro more than doubles its predecessor's reasoning scores and introduces adjustable thinking modes, but latency issues and preview-status quirks keep it from a clean sweep.

Alibaba releases four Qwen 3.5 medium models - Flash, 35B-A3B, 122B-A10B, and 27B - that match or beat the previous 235B flagship at a fraction of the compute. The 35B model activates just 3 billion parameters and still outperforms Qwen3-235B-A22B.