Anthropic Leak Reveals 'Claude Mythos': A Super-Model Far Ahead in Cyber Capabilities
Anthropic is testing 'Claude Mythos', a next-gen model in the new 'Capybara' tier with dramatically higher scores in coding, reasoning, and cybersecurity.
Anthropic Leaks Reveal Claude Mythos Super Model with Unmatched Cybersecurity Capabilities
Multiple tech outlets reported in late March 2026 that Anthropic is internally testing a new AI model codenamed Mythos, belonging to a new Capybara tier. Leaked internal benchmarks show the model dramatically outperforms GPT-5, Gemini 3.1, and all mainstream competitors in coding, cybersecurity, and mathematical reasoning. Notably, its cybersecurity performance shows ~40% higher success rates in red-team testing and vulnerability discovery versus the current best models.
Key innovations include adaptive reasoning architecture that dynamically adjusts depth based on task complexity, near-perfect information retrieval across 200K token context windows, and architecture-level code comprehension enabling cybersecurity breakthroughs. The model's security training reportedly consumed 30%+ of total training time.
The strategic implications are significant: with 3.5M+ unfilled cybersecurity positions globally, an AI capable of autonomous vulnerability discovery could fundamentally change cyber defense economics. However, dual-use concerns remain paramount. The leak itself sparked debate about internal tensions between Anthropic's safety researchers and commercial teams regarding release strategy.
Against the backdrop of Anthropic's Pentagon contract dispute, a model with extreme cybersecurity capabilities could redefine military AI procurement. Expected formal release in Q2 2026.