
GPT-5.4
OpenAI's most capable frontier model combines native computer use, 1M-token context, and three variants at $2.50/$15 per million tokens.
They summarize our coverage. We write it.
Newsletters like this one rebroadcast our headlines - often without the full review, the source reading, or the analysis underneath. Our weekly briefing sends the work they paraphrase, straight from the desk, before they get to it.
Free, weekly, no spam. One email every Tuesday. Unsubscribe anytime.

OpenAI's most capable frontier model combines native computer use, 1M-token context, and three variants at $2.50/$15 per million tokens.

OpenAI ships GPT-5.4 with built-in computer use that beats human desktop performance, a 1 million token context window, and native Excel and Google Sheets integrations.

GPT-5.3 Instant launched March 3, 2026, cutting hallucinations by 26.8% and overhauling ChatGPT's tone - but with documented safety regressions in the process.

OpenAI posted '5.4 sooner than you Think.' one hour after launching GPT-5.3 Instant. Here is what the Codex repo leaks already told us - and what the tweet does not.

OpenAI ships GPT-5.3 Instant with 27% fewer hallucinations, a less preachy tone, and better web search - available now across all ChatGPT tiers and the API.

A data-driven comparison of xAI's Grok 4 and OpenAI's ChatGPT powered by GPT-5.2, covering benchmarks, pricing, features, and real-world performance.

Two pull requests in OpenAI's public Codex GitHub repo referenced GPT-5.4 before being scrubbed - one adding full-resolution vision support, the other a fast mode toggle. Seven force pushes and a deleted employee screenshot confirm this was not intentional.

GPT-5.2 is OpenAI's most capable model with three modes, 400K context, and record-setting professional benchmarks - but speed and pricing raise questions.

A thorough review of ChatGPT in 2026 - OpenAI's flagship product powered by GPT-5.2 and o3 reasoning, covering all tiers from Free to the $200/month Pro plan, with honest takes on what works and what doesn't.

OpenAI's Atlas combines a Chromium browser with GPT-5.2 agent capabilities. It browses, books, shops, and researches on your behalf - when it works. We tested it for two weeks to find out how often that is.

Perplexity's new Computer product breaks tasks into sub-agents routed across Claude, Gemini, GPT-5.2, and Grok, running autonomously for days or months in isolated cloud sandboxes. Available now for Max subscribers at $200/month.

A King's College London study pitted Claude Sonnet 4, GPT-5.2, and Gemini 3 Flash against each other in nuclear crisis simulations. Tactical nuclear weapons were deployed in 20 out of 21 games. No model ever surrendered.