Name: GPT-5.5-Cyber
Author: OpenAI

Overview

OpenAI launched GPT-5.5-Cyber on June 22, 2026, as part of the Patch the Planet effort - a coordinated push to direct frontier AI capabilities toward defensive security at scale. The model is a fine-tune of GPT-5.5 with lowered refusal thresholds on security-relevant tasks, distributed through OpenAI's Daybreak Cyber Partner Program to enterprise security vendors. Like its predecessor GPT-5.4-Cyber, it isn't available through a public API.

TL;DR

Fine-tune of GPT-5.5 for defensive security work, with 85.6% on CyberGym (up from 81.8% for base GPT-5.5)
Access restricted to enterprise security vendors through the Daybreak Cyber Partner Program - Trail of Bits, HackerOne, Cisco, CrowdStrike, IBM, Palo Alto Networks, Accenture, Okta, and Wiz among named partners
Predecessor GPT-5.4-Cyber scored 88.23% on professional CTFs; GPT-5.5-Cyber targets practical enterprise deployment over raw CTF performance

The Patch the Planet project, announced June 22, 2026, sits under OpenAI's broader Daybreak umbrella and marks a shift in scope. Where Daybreak originally focused on research preview users and security tool integrations, Patch the Planet routes GPT-5.5-Cyber directly to commercial security vendors - companies that sell vulnerability management, threat intelligence, and penetration testing products. The partners aren't running experiments; they're embedding the model into shipping software.

The launch followed a series of high-profile vulnerability discoveries attributable to the model's predecessor. GPT-5.4-Cyber found a WebAssembly flaw in Firefox significant enough that five of six Pwn2Own Berlin entries withdrew rather than compete against it - a result documented in Pwn2Own Berlin 2026's capacity overflow. The Firefox flaw, five exploitable bugs in Chrome V8, more than ten WebKit vulnerabilities, a 23-year-old kernel flaw in OpenBSD, eight pointer leak proof-of-concepts plus 24 local privilege escalation exploits in the Linux kernel, and vulnerabilities in dnsmasq all came through the cyber model line. GPT-5.5-Cyber is the generation that follows that track record.

Key Specifications

Specification	Details
Provider	OpenAI
Model Family	GPT-5
Base Model	GPT-5.5
Parameters	Not disclosed
Context Window	1M tokens (inherited)
Pricing	Not disclosed, Daybreak enrollment required
Release Date	June 22, 2026
License	Proprietary, restricted access
Availability	Daybreak Cyber Partner Program (enterprise vendors)
Initiative	Patch the Planet / Trusted Access for Cyber

For architecture, context window behavior, and general pricing, see the GPT-5.5 model page.

Benchmark Performance

OpenAI published the CyberGym result as the headline evaluation for GPT-5.5-Cyber. CyberGym measures performance across a structured set of defensive security scenarios including vulnerability discovery, triage, and exploit analysis.

Benchmark	GPT-5.5-Cyber	GPT-5.5 (base)	GPT-5.4-Cyber	Claude Mythos Preview
CyberGym	85.6%	81.8%	Not published	83.1%
Professional CTFs (pass@12)	Not published	-	88.23%	-
CVE-Bench web vulns (pass@1)	Not published	-	86.27%	-
UK AISI expert CTF	Not published	-	Not evaluated	73%
SWE-bench Verified	Not published	-	-	93.9%

The 3.8-point CyberGym improvement over base GPT-5.5 (85.6% vs 81.8%) comes from fine-tuning rather than architecture changes. Against Claude Mythos Preview's 83.1% on the same benchmark, GPT-5.5-Cyber leads by 2.5 points - a reversal from the GPT-5.4-Cyber era where Mythos led on CyberGym. The comparison is limited because Mythos was assessed on an older CyberGym configuration and the two models are measured differently on most other cyber benchmarks.

The predecessor's CTF numbers (88.23% pass@12 on professional CTFs) remain the best published scores for the line. OpenAI hasn't released equivalent CTF data for GPT-5.5-Cyber, which either means the benchmark hasn't been run on the same track or the results are being held back. The absence of a CTF score matters less for the Patch the Planet use case, which is oriented toward enterprise vulnerability workflows, not point-in-time contest performance.

A laptop screen with code and a lock icon visible, representing software vulnerability research GPT-5.5-Cyber is built for practical vulnerability discovery workflows - feeding compiled binaries and source code through an AI that can flag exploitable patterns without human-guided triage at each step. Source: pexels.com

Key Capabilities

Practical Vulnerability Discovery

The headline capability is the same as GPT-5.4-Cyber's: lowered refusal thresholds on legitimate security tasks that the base GPT-5.5 model declines. Binary reverse engineering, malware dissection, exploit chain analysis, and CVE triage all run without the friction the public API imposes. The improvement over the 5.4 generation targets depth over CTF coverage - better handling of real enterprise codebases, longer dependency chains, and multi-file vulnerability patterns that don't surface cleanly in contest-format evaluations.

The vulnerability track record attributable to the model line is the most concrete evidence of capability. The Firefox WebAssembly flaw was severe enough to reshape Pwn2Own competition dynamics. The Linux kernel work - eight pointer leak PoCs plus 24 LPE exploit chains - covered multiple subsystems in parallel. The 23-year-old OpenBSD kernel flaw and the dnsmasq findings both required understanding of deep system internals with no public writeup to guide the model.

Partner Ecosystem Integration

GPT-5.5-Cyber is positioned differently from GPT-5.4-Cyber. The 5.4 model targeted individual researchers and security teams through the Trusted Access for Cyber verification process. The 5.5 variant goes to vendors - Trail of Bits, HackerOne, Cisco, CrowdStrike, IBM, Palo Alto Networks, Accenture, Okta, and Wiz are the nine named Patch the Planet partners. These companies embed the model into their existing security products rather than running it as a standalone tool.

That's a structural change in how OpenAI reaches the defender community. A CrowdStrike integration means GPT-5.5-Cyber's analysis runs inside Falcon workflows; a HackerOne integration means bug bounty triage gets AI-assisted severity scoring. The model reaches security operations teams who'd never apply for TAC access directly.

Codex Security Pipeline

Like its predecessor, GPT-5.5-Cyber connects to OpenAI's Codex Security infrastructure. The Codex Security beta scanned 1.2 million commits, surfaced 792 critical and 10,561 high-severity issues, and shipped 14 CVEs across OpenSSH, GnuTLS, Chromium, PHP, libssh, and gpg-agent - all before Daybreak formalized the commercial program. GPT-5.5-Cyber brings a stronger base model into that pipeline.

Digital padlock and shield symbol representing enterprise security software The Patch the Planet partner list - Trail of Bits, HackerOne, Cisco, CrowdStrike, IBM, Palo Alto Networks, Accenture, Okta, Wiz - reaches security operations teams through existing enterprise product integrations. Source: pexels.com

Pricing and Availability

OpenAI hasn't published pricing for GPT-5.5-Cyber. Access runs through the Daybreak Cyber Partner Program, which requires enrollment through OpenAI sales rather than a self-serve API key. The base GPT-5.5 lists at $5/$30 per million input/output tokens, but that pricing doesn't automatically carry over to the cyber variant - previous cyber models have all been priced under separate negotiated arrangements.

Access tiers

The Daybreak program structures access across three levels:

Standard GPT-5.5 via the public API - available to any developer
Trusted Access for Cyber - verified defenders (thousands enrolled) with reduced refusal thresholds on security tasks
Daybreak Cyber Partner Program - enterprise security vendors embedding the model into commercial products; GPT-5.5-Cyber lives here

Individual researchers aren't the primary audience for GPT-5.5-Cyber. The previous generation's TAC program scaled to verified defenders and independent consultants. Patch the Planet routes the 5.5 model to the vendors those defenders use. If you're an individual security researcher looking for elevated access, the GPT-5.4-Cyber TAC enrollment path remains the relevant option while the 5.5 program matures.

Strengths and Weaknesses

Strengths

Leads published CyberGym comparisons at 85.6% - 2.5 points above Claude Mythos Preview on that benchmark
3.8-point improvement over base GPT-5.5 confirms fine-tuning adds measurable security capability
Partner model (Trail of Bits, CrowdStrike, HackerOne, etc.) reaches defenders through tools they already use
Predecessor's real-world vulnerability record - Firefox WebAssembly, Linux kernel LPE chains, OpenBSD kernel, dnsmasq - gives the model line credibility beyond benchmark numbers
Inherits GPT-5.5's 1M context window for long codebases and multi-file analysis
Integrated into Codex Security pipeline with an established track record of shipped CVEs

Weaknesses

No CTF benchmark data published - the 88.23% professional CTF score belongs to GPT-5.4-Cyber, not this model
Partner-only access excludes individual researchers and small teams who were the target audience for TAC
No public pricing - procurement planning is opaque until vendor relationships are established
Mythos Preview leads significantly on coding benchmarks (93.9% SWE-bench Verified) and was purpose-built for cyber; GPT-5.5-Cyber is still a fine-tune of a general model
CyberGym comparison with Mythos uses different evaluation configurations - the 2.5-point lead should be read with caution
Access gated through partner vendors means security teams depend on third-party integrations for model updates

OpenAI Launches Daybreak Cybersecurity Platform - The program this model ships under
GPT-5.4-Cyber - The previous generation, with published CTF benchmarks
Claude Mythos Preview - Anthropic's direct competitor in restricted cyber AI
Pwn2Own Berlin 2026 Capacity Overflow - Context for the Firefox WebAssembly flaw
GPT-5.5 - The base model
Jailbreak and Red Team Leaderboard - Safety evaluation context

FAQ

Who can access GPT-5.5-Cyber?

Enterprise security vendors enrolled in OpenAI's Daybreak Cyber Partner Program. Named partners include Trail of Bits, HackerOne, Cisco, CrowdStrike, IBM, Palo Alto Networks, Accenture, Okta, and Wiz. Individual researchers should look at the Trusted Access for Cyber program for GPT-5.4-Cyber instead.

How does GPT-5.5-Cyber differ from GPT-5.4-Cyber?

The base model upgrades from GPT-5.4 to GPT-5.5, and CyberGym improves from unpublished (5.4) to 85.6% (5.5). The distribution model shifts from individual TAC verification to enterprise vendor partnerships under Patch the Planet.

How does it compare to Claude Mythos Preview?

GPT-5.5-Cyber leads on CyberGym (85.6% vs 83.1%). Mythos leads substantially on coding benchmarks (93.9% SWE-bench Verified) and was purpose-built for cyber. The comparison is also complicated by different evaluation setups and access models.

Is pricing available?

Not publicly. Pricing runs through Daybreak partner agreements negotiated with OpenAI sales. The base GPT-5.5 ($5/$30 per million tokens) doesn't automatically apply.

What real-world vulnerabilities has the model line found?

The predecessor GPT-5.4-Cyber found a Firefox WebAssembly flaw (caused five Pwn2Own Berlin entries to withdraw), five exploitable Chrome V8 bugs, ten or more WebKit bugs, a 23-year-old OpenBSD kernel vulnerability, eight Linux kernel pointer leak PoCs plus 24 LPE exploits, and dnsmasq vulnerabilities.