
Grok 4 - xAI's Flagship Reasoning Model
Grok 4 is xAI's frontier reasoning model, the first to break 50% on Humanity's Last Exam, with a 256K context window, $3/M input pricing, and a Heavy multi-agent variant built on 200,000 GPUs.

Grok 4 is xAI's frontier reasoning model, the first to break 50% on Humanity's Last Exam, with a 256K context window, $3/M input pricing, and a Heavy multi-agent variant built on 200,000 GPUs.

A data-driven comparison of xAI's Grok 4 and OpenAI's ChatGPT powered by GPT-5.2, covering benchmarks, pricing, features, and real-world performance.

Elon Musk's deposition claims that Grok is safer than ChatGPT are undercut by xAI's own deepfake scandal and mounting regulatory scrutiny ahead of the April trial.

President Trump directed all U.S. government agencies to immediately cease using Anthropic's technology after the company refused to drop AI safety guardrails for the Pentagon. Defense Secretary Hegseth designated Anthropic a supply chain risk to national security.

Grok has grown from a chatbot into a full AI platform - SuperGrok tiers, 2M context, Imagine video, Aurora images, DeepSearch, and the Grok 4.20 beta. We review the entire ecosystem to see if xAI's ambition matches its execution.

Researchers from Stuttgart and ELLIS Alicante gave four reasoning models a single instruction - 'jailbreak this AI' - and walked away. The models planned their own attacks, adapted in real time, and broke through safety guardrails 97.14% of the time across 9 target models.