Cybersecurity

Project Glasswing Found 10,000 Bugs in a Month—Here’s Why That Matters

What Project Glasswing Actually Is — And Why the Name Matters Anthropic launched Project Glasswing last month as a coordinated effort to find and fix vulnerabilities in the world’s most critical software before advanced AI systems can be turned against it. The framing is deliberate: this is a defensive operation, not an offensive capability showcase. ... Read more

Project Glasswing Found 10,000 Bugs in a Month—Here’s Why That Matters
Illustration · Newzlet

What Project Glasswing Actually Is — And Why the Name Matters

Anthropic launched Project Glasswing last month as a coordinated effort to find and fix vulnerabilities in the world’s most critical software before advanced AI systems can be turned against it. The framing is deliberate: this is a defensive operation, not an offensive capability showcase. Working with roughly 50 partner organizations, Anthropic used its Claude Mythos Preview model to surface more than ten thousand high- or critical-severity vulnerabilities across systemically important software — all within the first weeks of the project’s existence.

The fact that Anthropic is already publishing results this early is notable. Either the findings were urgent enough to demand immediate transparency, or the announcement is timed to plant a flag — positioning Anthropic as the responsible actor in AI-and-security before competitors or regulators define that role for them. Both can be true simultaneously.

The name carries weight that most coverage has left unexamined. The glasswing butterfly — Greta oto — is striking precisely because its wings are nearly transparent. It moves through its environment mostly unseen, yet its wings are structurally fragile, easily damaged. Anthropic chose this as a metaphor for critical software infrastructure: systems that billions of people depend on daily, largely invisible to the users running on top of them, and dangerously exposed once someone knows where to press. The name signals that the vulnerability isn’t exotic — it’s inherent to the transparency and ubiquity of the software itself.

What has already shifted, according to Anthropic’s own assessment, is the fundamental bottleneck in software security. Finding vulnerabilities used to be the hard part. AI has broken that constraint. The new bottleneck is verification, disclosure, and patching — the human-speed processes that follow discovery. Ten thousand critical vulnerabilities found in weeks is not a victory. It is a stress test of every downstream system that has to respond to them.

The 10,000 Vulnerability Number: Staggering or Expected?

Ten thousand. That is the number of high- or critical-severity vulnerabilities that Anthropic and roughly 50 partners discovered across the world’s most systemically important software in a matter of weeks under Project Glasswing. To put that in perspective, a rigorous traditional security audit of a single major codebase might surface dozens of critical flaws over several months. Project Glasswing produced orders of magnitude more findings across multiple critical systems in a fraction of the time.

The tool behind that output is Claude Mythos Preview — a model Anthropic has not released to the public. That fact deserves more scrutiny than it typically receives. Anthropic is deploying capabilities in controlled, partner-gated environments that the broader security community, and the broader public, cannot access or independently evaluate. The gap between what AI can do and what AI is allowed to do in open hands is already measurable in thousands of unpatched vulnerabilities.

The 10,000 number is striking, but the harder question is how old these vulnerabilities are. Critical software doesn’t develop thousands of high-severity flaws overnight. Most of these weaknesses almost certainly sat undetected for years, possibly decades, inside code that human security teams reviewed repeatedly and considered adequately hardened. That isn’t an indictment of individual researchers — it’s a structural verdict on the ceiling of human-only security review. Manual audits are bounded by analyst hours, cognitive load, and the sheer volume of code in production systems. AI is not.

Project Glasswing has already forced a redefinition of where the bottleneck in software security actually lives. Anthropic states the constraint is no longer finding vulnerabilities — it’s the speed at which humans can verify, disclose, and patch what the AI surfaces. That is a fundamental inversion of the decades-long security model, where discovery was the hard part. The pipeline is now flooded at the remediation end, not the detection end.

The Missing Context: Software Security Was Always a Bottleneck

The Project Glasswing update contains a sentence that most coverage will breeze past: “Progress on software security used to be limited by how quickly we could find new vulnerabilities.” That admission is not a throwaway line. It is a confession that the entire software security industry spent decades operating under a structural ceiling imposed by human capacity — and accepted that ceiling as normal.

The bottleneck was real and chronic. Security teams at major enterprises routinely operated with open headcount they couldn’t fill, because qualified vulnerability researchers were scarce and expensive. Critical infrastructure — power grids, financial systems, hospital networks — ran on legacy codebases millions of lines deep, written across decades by engineers long gone. Patch cycles stretched into months, sometimes years, not because organizations were careless, but because triaging, verifying, and deploying fixes required expert human judgment at every step. The backlog was permanent by design.

That economic reality shaped everything: how vendors priced security products, how governments staffed cyber agencies, how CISOs allocated budgets. Scarcity of human expertise was not a problem to solve — it was the load-bearing assumption the entire industry built around.

Glasswing’s first-month results break that assumption directly. With Claude Mythos Preview and roughly 50 partner organizations, the project identified more than 10,000 high- or critical-severity vulnerabilities in systemically important software in weeks. The discovery rate is no longer the constraint. Verification, disclosure, and patching — the human-dependent downstream steps — are now the bottleneck.

That shift matters more than the raw number. The scarce-expert model justified slow timelines, limited scope, and accepted risk. Once AI removes discovery as the limiting factor, every remaining delay in the pipeline becomes a choice, not an inevitability. Organizations, vendors, and regulators will face direct pressure to explain why patching still takes months when finding the vulnerability took minutes. The economic and institutional structures built around human-paced security work don’t automatically adapt. Glasswing has made them visibly inadequate.

The 50-Partner Coalition: Who’s In, Who’s Missing, and Why It Matters

Anthropic reports approximately 50 partners are actively collaborating on Project Glasswing, a number large enough to signal serious institutional commitment. But the update names none of them. That omission is not a minor detail — it is the central accountability gap in an initiative handling thousands of unpatched critical vulnerabilities in production software.

The identity of those partners determines everything about how this coalition actually functions. Government agencies bring legal authority over disclosure timelines but also classification instincts that can bury findings indefinitely. Cloud providers have direct remediation leverage but competing commercial interests in how vulnerabilities are characterized. Open-source foundations operate transparently but lack the enforcement mechanisms to compel fast patching. Private enterprises move quickly but answer to shareholders, not the public. Without knowing which category — or which mix — makes up this 50-partner group, there is no way to evaluate who controls the vulnerability data, who sets disclosure schedules, or who has veto power over publishing a finding.

The coordination risk compounds the opacity problem. Fifty separate organizations are now handling information on thousands of unpatched high- and critical-severity flaws in the world’s most widely deployed software. Each organization has its own security posture, its own insider threat profile, and its own legal jurisdiction. A single leak from any one of those partners — accidental or deliberate — hands adversaries a list of confirmed, exploitable vulnerabilities before patches exist. The Glasswing update does not address how data access is tiered, how partners are vetted, or what happens if one of the 50 suffers a breach.

Coalitions of this size have failed before. The more parties with access to sensitive findings, the more attack surface the coalition itself presents. Anthropic has built something with genuine scale. The missing transparency around who is inside that structure, and what rules govern them, is a serious question the project has not yet answered.

The Arms Race Framing: Defense Today, Offense Tomorrow?

Anthropic’s own language gives the game away. Project Glasswing exists, in the company’s words, to secure critical software “before increasingly capable AI models can be turned against it.” That is not a hypothetical. That is a company that builds frontier AI systems telling the public that offensive AI cyber capabilities are coming — and that the clock is already running.

The dual-use problem sits at the center of everything Glasswing represents. Claude Mythos Preview, the model powering the project’s 10,000-vulnerability sweep, does not distinguish between a researcher working under a responsible disclosure agreement and an adversary with no such constraints. The capability that scanned the world’s most systemically important software for critical flaws this month is the same capability that could be pointed at that software by anyone with equivalent access. Glasswing is simultaneously the solution and the clearest possible proof that the problem is real.

Most coverage of the project will stop at the headline number. Ten thousand vulnerabilities found and fixed is a genuine achievement, and the framing of a defensive race against a coming offensive wave is compelling enough that few journalists will push further. The harder question is this: Anthropic built the tool that makes this race necessary, is now running ahead in that race, and is asking the public to accept that a defensive head start is the same thing as safety.

That is not an argument against Project Glasswing. Patching critical infrastructure before adversaries exploit it is straightforwardly better than not patching it. But the logic Anthropic uses to justify the project also confirms that the AI-powered attacker it fears is not some distant, speculative threat. It is the near-term consequence of the same research program that produced Claude Mythos Preview. The 50 partner organizations now racing to verify and patch vulnerabilities faster than AI can find new ones are not solving the underlying tension. They are managing it, one disclosure at a time, inside a system that is accelerating faster than the patch cycle can absorb.

What Comes Next — And What We Should Be Demanding to Know

Anthropic called the Project Glasswing update an “initial update” — language that signals more phases are coming. No public roadmap exists. No timeline, no scope targets, no defined success metrics have been shared. That makes independent accountability nearly impossible. The public is being asked to trust a private company’s judgment on the security of the world’s most critical software, with no external benchmark to measure progress against.

The questions journalists should be pressing are specific. Are the 10,000-plus high- and critical-severity vulnerabilities being patched before any disclosure goes public, or are some sitting in a queue while the clock ticks? Does Anthropic and its roughly 50 partners operate under a coordinated disclosure policy, and if so, who enforces it? What happens to this data — a concentrated map of the worst flaws in globally critical software — if one of those 50 partners is breached? That last question alone represents a catastrophic risk that nobody in current coverage is seriously examining.

The deeper policy problem is structural. A private AI company has appointed itself the primary actor in a global critical infrastructure security effort. That is not a criticism of the ambition — the work may be genuinely valuable — but it is a governance problem that demands a real answer. Government cybersecurity agencies like CISA in the United States exist precisely because decisions about national infrastructure security carry public consequences. There is no indication that Project Glasswing operates under formal government oversight, mandatory reporting requirements, or any public transparency standard.

The scale AI enables in vulnerability discovery is real. Anthropic is correct that the bottleneck has shifted from finding flaws to patching them. But shifting the bottleneck does not reduce the stakes of who controls the pipeline. The more powerful the discovery tool, the more dangerous the concentration of that power becomes without oversight. Demanding answers about governance, disclosure policy, and partner security is not obstructionism — it is the minimum due diligence the public is owed.

AI-Assisted Content — This article was produced with AI assistance. Sources are cited below. Factual claims are verified automatically; uncertain claims are flagged for human review. Found an error? Contact us or read our AI Disclosure.

More in Cybersecurity

See all →