Our evaluation of Claude Mythos Preview's cyber capabilities | AISI Work
This UK AI Security Institute (AISI) blog post evaluates Anthropic's Claude Mythos Preview (announced April 7, 2026), finding continued improvement on capture-the-flag challenges and significant progress on multi-step cyber-attack simulations. On expert-level CTF tasks (which no model could complete before April 2025), Mythos Preview succeeds 73% of the time. More notably, it is the first model to solve "The Last Ones" (TLO), a 32-step corporate network attack simulation spanning initial reconnaissance to full network takeover (estimated to require 20 human hours), completing it successfully in 3 out of 10 attempts and averaging 22 of 32 steps across all attempts. Claude Opus 4.6, the next best model, averaged 16 steps. The model could not complete an operational technology focused range ("Cooling Tower"), though it got stuck on IT sections rather than OT-specific tasks. The evaluation notes that performance scales with inference compute (tested up to 100M token budget, with improvements continuing beyond). However, the ranges lack real-world security features such as active defenders, defensive tooling, and alert penalties, so the model may not be able to attack well-defended systems. AISI concludes that as capabilities improve, evaluation environments without defenses will no longer be sufficient to discriminate between the most capable models. Recommendations for organizations include cybersecurity basics (updates, access controls, logging, NCSC's Cyber Essentials), and AISI notes that future frontier models will be more capable, making investment in cyber defense vital.
https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos-previews-cyber-capabilities
Comments
Post a Comment