Name: GPT-5.5 Instant
Author: OpenAI

GPT-5.5 Instant is the new default ChatGPT model as of May 5, 2026, replacing GPT-5.3 Instant across all tiers - Free, Plus, Pro, Team, and Enterprise. The "Instant" name follows the same pattern OpenAI has used since GPT-5.2: it's the fast conversational version of the current generation, tuned for everyday chat rather than extended reasoning chains. It runs on the same underlying architecture as GPT-5.5 (released April 23), but with reasoning effort set to minimal by default.

TL;DR

New ChatGPT default (May 5, 2026): replaces GPT-5.3 Instant, 52.5% fewer hallucinations on high-stakes queries
AIME 2025 score jumps from 65.4 to 81.2; MMMU-Pro multimodal reasoning up from 69.2 to 76
API pricing is $5/$30 per million tokens - same as full GPT-5.5, not a budget tier like GPT-5.3 Instant was

The release is two months after GPT-5.3 Instant, which drew criticism for safety regressions - its graphic violence and sexual content compliance scores fell notably versus its predecessor. GPT-5.5 Instant doesn't come with the same caveats. OpenAI's system card for this update doesn't flag similar regressions, and the model's improvements focus on factual accuracy rather than behavioral loosening.

One distinction worth noting upfront: unlike GPT-5.3 Instant, which had its own API pricing tier at $1.75/$14 per million tokens, GPT-5.5 Instant has no separate endpoint. In the API, you access this behavior through gpt-5.5 with reasoning_effort: "minimal" - and you pay the same $5/$30 rate as the full model.

Key Specifications

Specification	Details
Provider	OpenAI
Model Family	GPT-5
Parameters	Not disclosed
Context Window	1M tokens (API); varies in ChatGPT web UI
Max Output Tokens	128,000 tokens
Input Price	$5.00/M tokens
Output Price	$30.00/M tokens
Cached Input	$0.50/M tokens
API Model Name	`gpt-5.5` (use `reasoning_effort: "minimal"` for Instant behavior)
ChatGPT Alias	`chat-latest`
Knowledge Cutoff	December 2025
Release Date	May 5, 2026
Modalities	Text in, Image in, Text out
License	Proprietary

Benchmark Performance

OpenAI published Instant-specific metrics comparing GPT-5.5 Instant against GPT-5.3 Instant - the predecessor rather than a cross-model leaderboard. For standard reasoning benchmarks, the model shares its architecture with GPT-5.5, so the full model's scores apply where reasoning effort is the controlling factor.

Evaluation	GPT-5.5 Instant	GPT-5.3 Instant	Change
AIME 2025 (math)	81.2	65.4	+15.8
MMMU-Pro (multimodal reasoning)	76.0	69.2	+6.8
HealthBench	51.4	49.6	+1.8
HealthBench Professional	38.4	32.9	+5.5
Hallucinations (high-stakes, with web)	-52.5% vs baseline	Baseline	-
User-flagged factual errors	-37.3% vs baseline	Baseline	-

The AIME improvement is the most striking number. A 15.8-point jump on a math competition benchmark suggests the underlying GPT-5.5 architecture delivers truly stronger reasoning - not just a tuning adjustment. The MMMU-Pro gains (+6.8 points) point to improved multimodal understanding across text-image tasks.

The hallucination figures need the standard caveat: these come from OpenAI's internal evaluations on high-stakes domains (medicine, law, finance) and haven't been independently replicated yet. The 52.5% figure is based on the same evaluation corpus used for previous Instant releases, which makes comparisons internally consistent, but doesn't tell you how the model performs on other hallucination benchmarks. See the hallucination benchmarks leaderboard for third-party tracking.

For broader positioning: GPT-5.5 (full model, same architecture) scores 93.6% on GPQA Diamond, 58.6% on SWE-Bench Pro, and 82.7% on Terminal-Bench 2.0. These numbers reflect high-reasoning-effort runs and aren't specific to the Instant configuration, but they establish the ceiling of the architecture.

Factuality benchmark comparison showing GPT-5.5 Instant's hallucination reduction across high-stakes domains OpenAI's internal factuality benchmarks for GPT-5.5 Instant versus GPT-5.3 Instant across medicine, law, and finance prompts. Source: decrypt.co

Key Capabilities

Hallucination reduction and factual accuracy. The headline claim is 52.5% fewer hallucinated claims on high-stakes prompts when the model uses web search, and 37.3% fewer on conversations users had previously flagged for factual errors. These improvements concentrate in medicine, law, and finance - domains where the previous Instant generation was already a weak point. The without-web improvement is narrower but real: the model's baseline factual recall is better than GPT-5.3 Instant independently of retrieval.

Personalization from memory and Gmail. GPT-5.5 Instant introduces "memory sources" - a transparency layer showing users which context ChatGPT used to personalize a response, including saved memories, past chats, and connected Gmail data. Plus and Pro subscribers get Gmail integration first; Free, Go, Business, and Enterprise users are in a later rollout wave. Users can inspect, delete, or correct memory sources, and temporary chats remain fully available for sessions that shouldn't touch memory at all.

Response conciseness. Responses are 30.2% shorter by word count and 29.2% shorter by line count compared to GPT-5.3 Instant. The formatting shift is notable: fewer unsolicited bullet points, fewer trailing emoji, less throat-clearing before the actual answer. This isn't just stylistic - shorter responses at the same accuracy level directly reduces output token cost in API use cases.

Multimodal input. Image understanding is available across all tiers. The MMMU-Pro improvement (+6.8 points) suggests real gains in complex visual reasoning tasks, not just basic captioning. Audio and video input aren't supported in the API; those remain on separate model tracks.

ChatGPT interface showing the new default GPT-5.5 Instant model with improved response quality ChatGPT's default model updated to GPT-5.5 Instant on May 5, 2026, visible across Free, Plus, and Pro tiers. Source: decrypt.co

Pricing and Availability

GPT-5.5 Instant is available to all ChatGPT users as of May 5, 2026 - no tier gate. The API situation is different from previous Instant generations: there's no separate gpt-5.5-instant model ID. Developers who want the low-latency conversational configuration call gpt-5.5 with reasoning_effort: "minimal" and pay the standard $5/$30 per million tokens.

This pricing represents a significant change from GPT-5.3 Instant, which had its own API tier at $1.75 input / $14.00 output per million tokens. That gap matters for high-volume applications. If you were using GPT-5.3 Instant for cost efficiency in chat applications, switching to GPT-5.5 Instant means a roughly 2.9x increase in input costs and a 2.1x increase in output costs. For latency-critical use cases where GPT-5.5's deeper reasoning modes aren't needed, this is worth tracking.

GPT-5.3 Instant is deprecated but will remain available to paid ChatGPT users for approximately three months before retirement. API users can pin to explicit model snapshots to preserve GPT-5.3 Instant behavior while the window is open.

For cost comparison: Claude Opus 4.7 runs at higher per-token rates for reasoning workloads. At the conversational, low-reasoning-effort tier, GPT-5.5 Instant's $5/$30 pricing sits in the mid-range - cheaper than top-tier reasoning models but no longer the budget option the Instant line used to represent.

Memory sources interface showing ChatGPT's new transparency controls for personalization data The new "memory sources" panel shows users which context GPT-5.5 Instant used to personalize a response - and allows deletions. Source: decrypt.co

Strengths and Weaknesses

Strengths

Sizable hallucination reduction (52.5% on high-stakes prompts) with no documented safety regressions
AIME 2025 score of 81.2 - a 15.8-point jump from GPT-5.3 Instant, showing real reasoning gains
Concise responses by default: 30.2% fewer words, fewer unnecessary emoji and bullet points
Gmail and memory personalization with full user controls (inspect, delete, or opt out via temporary chats)
Available across all ChatGPT tiers right away, including Free tier

Weaknesses

API pricing ($5/$30/M tokens) is 2.9x more expensive than GPT-5.3 Instant was for input tokens - no budget tier for Instant-class conversational work
Hallucination benchmark numbers are OpenAI internal evaluations, not independently confirmed
Gmail personalization limited to Plus and Pro initially; Free/Go/Business/Enterprise timeline not specified
Context window in ChatGPT web UI is unspecified - the 1M token limit is API-only
Knowledge cutoff remains December 2025 despite the May 2026 release

FAQ

What replaced GPT-5.3 Instant?

GPT-5.5 Instant, released May 5, 2026, is the new default ChatGPT model. GPT-5.3 Instant remains accessible to paid users for roughly 3 months before retirement.

Is GPT-5.5 Instant cheaper than GPT-5.3 Instant?

No. GPT-5.5 Instant is priced at $5/$30 per million tokens, the same as GPT-5.5. GPT-5.3 Instant was $1.75/$14 per million tokens, making GPT-5.5 Instant about 2.9x more expensive for input tokens.

What API model name do I use for GPT-5.5 Instant?

Use gpt-5.5 with reasoning_effort: "minimal" in the API. The chat-latest alias also resolves to this model. There's no separate gpt-5.5-instant model ID.

Does GPT-5.5 Instant have safety regressions like GPT-5.3 Instant did?

No documented regressions have been published for GPT-5.5 Instant. GPT-5.3 Instant had measurable drops in sexual content and graphic violence compliance; this update does not show similar issues in OpenAI's system card.

When will Free-tier users get Gmail personalization?

OpenAI hasn't given a specific timeline. The initial rollout covers Plus and Pro users on the web, with Free, Go, Business, and Enterprise described as "coming later."

GPT-5.5 - The full API model that shares GPT-5.5 Instant's architecture
GPT-5.3 Instant - The predecessor model this replaces
OpenAI Launches GPT-5.5 for Agents and Work - News coverage of the full GPT-5.5 launch
GPT-5.5 Brings Mythos-Like Hacking to the Masses - Coverage of GPT-5.5's cybersecurity capabilities
Chatbot Arena ELO Rankings - Human preference rankings across models
Cost Efficiency Leaderboard - Per-token pricing comparison across all major models
Hallucination Benchmarks Leaderboard - Third-party hallucination tracking