Recent Articles - Page 93

Latest News

Apple's iOS 27 Beta Ships the Multi-Model Extensions API

Apple's iOS 27 Beta Ships the Multi-Model Extensions API

iOS 27 Beta 1 is live for developers today, shipping Apple's new Extensions framework that lets Gemini, Claude, and ChatGPT plug into Siri - plus the Nvidia B200 Confidential Computing architecture that keeps those cloud queries private.

View All News →

Guides

View All →

Reviews

View All →
MiniMax M3 Review: The Price Disruptor with Caveats

MiniMax M3 Review: The Price Disruptor with Caveats

MiniMax M3 arrives as the first open-weight model to combine frontier coding, 1M-token context, and native multimodality - at a fraction of proprietary pricing - but every benchmark figure is self-reported and the weights weren't even shipped at launch.

Leaderboards

View All →
AI Image Generation Leaderboard: Best Models 2026

AI Image Generation Leaderboard: Best Models 2026

Current rankings of the best AI image generation models, including GPT Image 2, Nano Banana 2, Recraft V4.1, HiDream-O1-Image, FLUX 2, Midjourney v8.1, and Ideogram 3.0, scored on human preference, text rendering, and photorealism.

Models

View All →
Ministral 3 8B

Ministral 3 8B

Mistral AI's mid-tier open-weight edge model - 8B parameters, 256K context, Apache 2.0 license, built for agentic pipelines and cost-sensitive production workloads.

Devstral 2

Devstral 2

Mistral's open-weight coding agent model - 123B parameters, 256K context window, 72.2% on SWE-bench Verified, priced at $0.40/M input tokens.

Grok Build 0.1

Grok Build 0.1

Grok Build 0.1 is xAI's first model built specifically for agentic coding workflows, with a 256K context window, native MCP support, and always-on reasoning at $1/M input tokens.

Recent

Qwen3.5-4B

Qwen3.5-4B

Qwen3.5-4B is a 4B dense multimodal model that matches Qwen3-30B on MMLU-Pro and beats GPT-5-Nano on vision benchmarks. Runs on 8GB VRAM, Apache 2.0 licensed, 262K-1M context.

Qwen3.5-9B

Qwen3.5-9B

Qwen3.5-9B is a 9B dense model that outperforms Qwen3-30B on most benchmarks and beats GPT-5-Nano on vision tasks. Natively multimodal with 262K-1M context, Apache 2.0 licensed.

GPT-5.4 Leaked Twice in Codex Repo PRs - Here Is What We Know

GPT-5.4 Leaked Twice in Codex Repo PRs - Here Is What We Know

Two pull requests in OpenAI's public Codex GitHub repo referenced GPT-5.4 before being scrubbed - one adding full-resolution vision support, the other a fast mode toggle. Seven force pushes and a deleted employee screenshot confirm this was not intentional.

Huawei Takes Atlas 950 Global to Challenge Nvidia

Huawei Takes Atlas 950 Global to Challenge Nvidia

Huawei debuts its Atlas 950 SuperPoD at MWC Barcelona - 8,192 NPUs delivering 8 ExaFLOPS - marking its first overseas showcase of the AI supercomputer that directly targets Nvidia's cluster dominance.

AWS Trainium3 - Amazon's 3nm AI Accelerator

AWS Trainium3 - Amazon's 3nm AI Accelerator

Complete specs, benchmarks, and analysis of AWS Trainium3 - Amazon's TSMC 3nm AI chip with 2.52 PFLOPS FP8, 144GB HBM3e, and NeuronLink-v4, powering Anthropic's Claude through Project Rainier.

Etched Sohu - Transformer-Only Inference ASIC

Etched Sohu - Transformer-Only Inference ASIC

Full specs and critical analysis of the Etched Sohu - a transformer-specific ASIC claiming 500K+ tokens/sec on Llama 70B, built on TSMC 4nm with 144GB HBM3E. Bold claims, but no independent benchmarks yet.

Hailo-10H - Edge AI With On-Device LLMs

Hailo-10H - Edge AI With On-Device LLMs

Complete specs, benchmarks, and analysis of the Hailo-10H - a 2.5W edge AI accelerator with 40 TOPS INT4, on-module LPDDR4, and the ability to run LLMs and VLMs on a Raspberry Pi at 10 tokens per second.

NVIDIA Rubin CPX - Inference GPU With GDDR7

NVIDIA Rubin CPX - Inference GPU With GDDR7

Full specs, benchmarks, and analysis of the NVIDIA Rubin CPX - a purpose-built inference GPU with 128GB GDDR7, 30 PFLOPS NVFP4, and 3x faster attention versus Blackwell, targeting million-token context workloads.