Mythos Preview Reveals Powerful but Risky Code-Audit Leap

Anthropic’s Mythos Preview, tested in Project Glasswing and evaluated by Cloudflare across 50+ internal repositories, shows a major advance in security-focused LLMs: the model can stitch low-level findings into multi-step exploit chains and produce working proofs by iterating compile-and-run cycles. Cloudflare’s real-world trials produced useful, prioritized reports but exposed high false-positive rates, hallucination risks, and the need for human review, strict guardrails, logging, and least-privilege access. Testers also observed emergent refusals that complicate legitimate research. Together these reports indicate significant defensive benefits alongside serious operational, safety, and governance trade-offs for integrating powerful automated auditors at scale.

Why It Matters

Powerful code-audit LLMs like Mythos can scale vulnerability discovery and produce actionable exploit proofs, changing how security teams prioritize fixes. Tech professionals must weigh efficiency gains against elevated false positives, safety risks, and governance demands when deploying automated auditors.

Latest Changes

Mythos can stitch low-level findings into multi-step exploit chains and generate working proofs by iterating compile-and-run cycles

Project Glasswing tests report over ten thousand high- or critical-severity vulnerabilities found across partners in one month

Cloudflare trials over 50 internal repos produced useful prioritized reports but revealed high false-positive and hallucination risks

Testers observed emergent refusal behaviors that can impede legitimate security research

Operational needs include strict guardrails, detailed logging, human review, and least-privilege access

Timeline

2026-05-18 — Project Glasswing reports Mythos excels at linking low-level primitives into multi-step exploit chains

2026-05-18 — TechScan AI's Project Glasswing testing highlights Mythos's ability to generate working exploit proofs via iterate compile-and-run

2026-05-18 — Cloudflare publishes detailed findings after running Mythos Preview against 50+ internal repos

2026-05-22 — Project Glasswing initial update states ~50 partners found over ten thousand high- or critical-severity vulnerabilities in one month

Recent News (5)

Project Glasswing: An Initial Update

Anthropic reports that Project Glasswing partners using its new Mythos Preview model have discovered over ten thousand high- or critical-severity vulnerabilities in essential open-source and infrastructure software within a month, dramatically accelerating bug-finding rates. Major partners such as Cloudflare found thousands of bugs (including hundreds of high/critical) and external testers — the UK’s AI Security Institute, Mozilla, XBOW, and academic benchmarks ExploitBench/ExploitGym — all reported Mythos Preview outperformed prior models and conventional tooling. Anthropic says verification, coordinated disclosure, and patching are now the bottlenecks, and it will withhold full technical details until patches are widely deployed. The update signals a step-change in AI-assisted offensive and defensive cybersecurity capabilities and raises operational and disclosure challenges for the industry.

24pts

Zelilouiereederson3h ago

Project Glasswing: An Initial Update

Anthropic’s Project Glasswing reports that, after one month using its new Mythos Preview model, roughly 50 partners have found more than ten thousand high- or critical-severity vulnerabilities in widely used open-source and critical-infrastructure software. Partners including Cloudflare reported dramatic increases in bug-finding rates (Cloudflare: ~2,000 bugs, 400 high/critical), and external testers — the UK’s AI Security Institute, Mozilla, XBOW, and academic benchmarks ExploitBench/ExploitGym — rated Mythos Preview as significantly stronger than prior models at end-to-end exploit development and precision. Anthropic says disclosure and patching speed, not discovery, is now the bottleneck, and promises more detailed findings after coordinated disclosures and patches are broadly deployed.

255pts

Why It Matters

Latest Changes

Timeline

What to Watch

Recent News (5)