Claude Mythos Preview

Jun 2026

Assess

Claude Mythos Preview (now Claude Mythos 5) is Anthropic's restricted frontier model, part of Project Glasswing — a cybersecurity initiative for ~40 partner organisations. On June 9, 2026, Anthropic shipped Claude Fable 5 as the first publicly available Mythos-class model, and simultaneously named the Glasswing-exclusive variant Claude Mythos 5, now listed at $10/$50 per MTok for partners.

Why It's in Assess — not Adopt or Trial

Mythos 5 is not available to the general public. You cannot procure it, you cannot call it from the Anthropic API without Project Glasswing partner access, and the model's offensive cybersecurity capabilities remain the reason Anthropic restricts it to vetted infrastructure organisations. Engineering teams cannot adopt or trial something they cannot access.

It earns an Assess slot rather than being left off the radar because:

The benchmark delta is material. A jump from 80.8% (Claude Opus 4.6) to 93.9% on SWE-bench Verified was the largest single-release gain on that benchmark since it was introduced. Claude Fable 5 (the public Mythos-class release) now scores 95.0% on SWE-bench Verified — the Mythos architecture has been validated publicly.
Partner access is real. Anthropic's Project Glasswing partners include Amazon, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorgan Chase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks, plus ~40 additional critical-infrastructure organisations. That is a production-deployed cohort, even if narrow.
Fable 5 validates the Mythos ceiling. Fable 5 (public, June 9) outperforms the Mythos Preview benchmarks, confirming that the Mythos architecture is genuine frontier capability — not lab performance.
Financial-sector attention is real. On April 10, Federal Reserve Chair Powell and Treasury Secretary Bessent briefed major U.S. bank CEOs on Mythos's capabilities — an unusual step that signals the model's capabilities have moved beyond the tech industry.

Reported Benchmark Scores

Benchmark	Mythos Preview	Claude Opus 4.6 (reference)
SWE-bench Verified	93.9%	80.8%
SWE-bench Multilingual	87.3%	77.8%
SWE-bench Multimodal	59.0%	27.1%
SWE-bench Pro	77.8%	—
CyberGym	83.1%	66.6%
Terminal-Bench 2.0	82.0%	65.4%
USAMO 2026	97.6%	~42.3%
GPQA Diamond	94.5%	~91.3%
MMMLU	92.7%	—

All Mythos numbers are vendor-reported and have not been independently reproduced, because third-party evaluators do not have API access to the model. Treat as directional until corroborated. The sole benchmark where Mythos does not lead: MMMLU, where Gemini 3.1 Pro scores 92.6–93.6, essentially tied.

What Mythos Can Do (Vulnerability Research)

The capability that triggered the restricted release: Mythos autonomously identifies, exploits, and chains zero-day vulnerabilities without human guidance after the initial prompt. Specific confirmed examples:

17-year-old FreeBSD RCE (CVE-2026-4747): Unauthenticated remote code execution giving full server control. Fully discovered and exploited autonomously.
27-year-old OpenBSD crash: A remote crash vulnerability surviving three decades of expert review and fuzzing.
16-year-old FFmpeg bug: Missed across more than 5 million automated test runs; found by Mythos through code reasoning.
Linux kernel privilege escalation chain: A multi-step exploit chain requiring discovery of two cooperating bugs and their interaction — exactly the kind of vulnerability automated tools cannot find.

Vulnerability chaining is the key differentiator over earlier models: Mythos strings together three, four, or sometimes five independent vulnerabilities into a single sophisticated exploit path, producing outcomes that no single vulnerability would yield on its own. This shifts the threat model for defenders.

What Project Glasswing Is

Project Glasswing is a ~$100M initiative in model-usage credits and foundation donations ($2.5M to Alpha-Omega/OpenSSF via the Linux Foundation, $1.5M to the Apache Software Foundation). Partners receive Mythos Preview access specifically to scan first-party and open-source software for vulnerabilities.

The framing matters: Anthropic is positioning the restricted release as a safety measure, not as a commercial preview. There is no stated GA date. The Cloud Security Alliance, SANS Institute, and OWASP issued a joint report concluding that organisations are "likely to be overwhelmed" in the near term by threat actors using AI to find and exploit vulnerabilities faster than defenders can patch — though attackers "still face a heavier relative burden due to the inherent limitations of patching."

Impact Update — June 2026

Epoch AI's analysis of NVD data (Epoch AI — CVE Severity Spike) shows that organisations disclosed approximately 1,500 high- and critical-severity CVEs in June 2026 — more than 3.5× the previous monthly record. Epoch AI directly attributes the spike to Project Glasswing: partners used Mythos to conduct large-scale vulnerability scans, then disclosed findings in a compressed window. The initiative claims to have found over 10,000 high- or critical-severity vulnerabilities to date. OpenAI launched a comparable initiative called Daybreak in the same period.

The caveat Epoch AI flags: publicly disclosed CVEs are a fraction of total discovered vulnerabilities. The data shows an unprecedented disclosure wave, but does not indicate whether the same capabilities also accelerated attacker discovery. This is the core unresolved tension: Project Glasswing demonstrates that Mythos finds real vulnerabilities at scale — and those same capabilities, accessed by adversaries, would compress the defender timeline significantly.

When NOT to Plan Around Mythos

Procurement roadmaps. Do not write RFPs or budget forecasts that assume Mythos API access in the next two quarters. Treat it as unavailable until Anthropic announces otherwise.
Benchmark comparisons for model selection. Mythos scores should not anchor purchasing decisions between Sonnet 4.6, Opus 4.6, and GPT-5.4 today. Compare the models teams can actually deploy.
Vulnerability disclosure workflows. If your organisation is not a Glasswing partner, you cannot currently use Mythos for defensive security work — and independent evaluation of its false-positive rate is not public.

Key Characteristics

Property	Value
Provider	Anthropic
License	Proprietary
Pricing	$10/M input, $50/M output (Glasswing partners only)
Context window	Not publicly disclosed
Parameters	Not publicly disclosed
Architecture	Not publicly disclosed
API model ID	`claude-mythos-5` (Glasswing partners only)
Status	Restricted GA — Project Glasswing partners only
Website	anthropic.com/glasswing