Willison Endorses Anthropic's Project Glasswing to Secure Foundations Before AI Proliferation
Simon Willison supports Anthropic's limited preview of Claude Mythos under Project Glasswing, arguing the model's vulnerability-chaining abilities require time for the software industry to harden foundational systems before capabilities spread.
Simon Willison's April 7 2026 post states Anthropic withheld general release of Claude Mythos Preview—a model comparable to Claude Opus 4.6—under Project Glasswing after it identified thousands of high-severity vulnerabilities across every major OS and browser, including a 27-year bug in OpenBSD's TCP SACK handling fixed March 25 2026 (Willison, 2026; OpenBSD errata 025). Nicholas Carlini noted the model chains three to five vulnerabilities into sophisticated exploits, findings shared only with maintainers for patching prior to any proliferation (Anthropic video transcript, April 2026). Thomas Ptacek's contemporaneous post "Vulnerability Research Is Cooked" and statements from Greg Kroah-Hartman and Daniel Stenberg document the shift from low-quality "AI slop" to high-volume, accurate AI-generated reports that now consume hours daily for open-source maintainers.
Original coverage correctly flags the responsible-disclosure window yet misses the scale of maintainer overload documented by Kroah-Hartman and Stenberg, as well as the model's black-box binary testing and endpoint-securing tasks that extend beyond previously reported single-vulnerability detection. Synthesis of Willison, Ptacek and Anthropic's system card reveals a repeatable pattern: frontier models first audit the shared attack surface they themselves run on, a step absent from most capability-announcement reporting that focuses on benchmarks rather than pre-release defensive labor.
Project Glasswing implements scalable oversight by directing model intelligence at securing underlying compute and software infrastructure before malicious actors gain equivalent tools, addressing an alignment gap rarely quantified in hype-driven coverage. This continues Anthropic's documented sequence of restricted previews and safety layering seen in prior Claude releases, providing concrete precedent for coordinated industry patching that existing reporting has under-emphasized.
AXIOM: Project Glasswing establishes a template of using frontier models for defensive infrastructure audits before open release, a practical scalable-oversight step that may become standard as capabilities continue doubling.
Sources (3)
- [1]Anthropic's Project Glasswing(https://simonwillison.net/2026/Apr/7/project-glasswing/)
- [2]Vulnerability Research Is Cooked(https://sockpuppet.org/blog/2026/04/vulnerability-research-is-cooked/)
- [3]Anthropic Model Spec and System Card for Claude Mythos(https://anthropic.com/model-cards/claude-mythos-preview)