AI Models Transform Vulnerability Hunting

Leading AI labs and defenders are accelerating the use of powerful LLMs as security tools. OpenAI has begun offering a less-restricted GPT-5.5-Cyber to vetted cyber defenders for bug hunting, malware analysis and attack simulation, mirroring Anthropic’s limited Mythos program. Mozilla reports Mythos helped uncover 271 Firefox vulnerabilities with almost no false positives by pairing the model with a custom agent harness, build integration and a second LLM for verification. Separately, platforms enabling full-context access and GPT-5.5 Instant suggest longer context and persistent memory are shifting LLMs from episodic assistants to continuous agents, reshaping how organizations scale automated security discovery and incident response.

Latest Changes

OpenAI is offering a less-restricted GPT-5.5-Cyber to vetted cyber defenders via a Trusted Access for Cyber program

Anthropic’s Mythos, paired with a custom agent harness, reportedly found 271 Firefox vulnerabilities with almost no false positives

A platform opened full-context access and integrated GPT-5.5 Instant, enabling longer context and persistent memory for agents

Timeline

2026-05-07 — Mozilla announces Mythos plus a custom agent harness found 271 Firefox vulnerabilities with almost no false positives

2026-05-08 — A platform publicly opened full-context access and integrated GPT-5.5 Instant within 48 hours

2026-05-10 — OpenAI begins wider rollout of GPT-5.5-Cyber ('Spud') to vetted cyber defenders through Trusted Access for Cyber

Recent News (4)

OpenAI 将其与 Anthropic 的 Mythos 竞争的产品向网络安全防御人员更广泛地开放

OpenAI is rolling out a less-restricted GPT-5.5 variant, dubbed GPT-5.5-Cyber or “Spud,” to vetted cyber defenders through its Trusted Access for Cyber program to help hunt bugs, analyze malware and simulate attacks. The model, reportedly close in capability to Anthropic’s Mythos, will allow approved defenders to generate proofs-of-concept, map attack surfaces and review patches while still blocking clearly malicious tasks like credential theft and malware authoring. The move comes amid tests showing GPT-5.5 can complete complex simulated attacks and a wider industry debate over safe rollouts; Anthropic has limited Mythos to about 40 organizations while OpenAI offers tiered access. The White House is considering policy responses.

Buzzing6d ago

Mozilla 表示，Mythos 发现的 271 个漏洞“几乎没有误报”

Mozilla says Anthropic’s Mythos helped find 271 Firefox vulnerabilities in two months with “almost no false positives,” thanks largely to model improvements and a custom agent harness. Engineers built a harness that instructs the LLM, gives it tooling access (read/write files, run test builds), and loops until a deterministic success signal is reached — e.g., making Firefox’s sanitizer build crash to confirm memory-safety bugs. A second LLM grades results for added verification. Mozilla published detailed Bugzilla reports for a subset of the findings and provided test cases that meet its standards. The approach highlights how tightly integrated tooling and verification can make AI-assisted security discovery practical.

Buzzing6d ago

AI Models Transform Vulnerability Hunting

Why It Matters

Latest Changes

Timeline

What to Watch

Recent News (4)