Agentic AI
,
Synthetic Intelligence & Machine Studying
,
Cyberwarfare / Nation-State Assaults
Evaluations of Claude Mythos 5 Elevates Offensive Cyber, However Is not Totally Autonomous

Anthropic’s new frontier synthetic intelligence mannequin can considerably automate significant offensive cyber operations however hasn’t but crossed into a totally autonomous cyber offense device, in accordance with the corporate.
See Additionally: Edge Transformation: Prime 5 SASE Predictions and Tendencies
The corporate’s Mythos 5 mannequin launched Tuesday can meaningfully contribute to offensive cyber work, elevating questions round how a lot autonomy these methods ought to be granted and the way successfully safeguards can restrict dangerous use. Mythos 5 is not restricted by the safeguards positioned round Fable 5, however entry will initially be restricted to the 200 organizations vetted by Anthropic’s Venture Glasswing.
“Claude Mythos 5 demonstrates the strongest general cyber capabilities of any mannequin we have now ever evaluated,” Anthropic wrote Tuesday. “Throughout our inner analysis suite, it meets or exceeds the efficiency of Claude Mythos Preview, whose step-change in autonomous vulnerability discovery and exploitation led us to limit entry to a restricted set of companions for defensive cybersecurity functions.”
Giant language fashions may clarify vulnerabilities, generate proof-of-concept code and help with penetration testing duties, however Anthropic stated Mythos 5 seems to have moved past that. It demonstrated the flexibility to find vulnerabilities, triage them, develop exploit chains and in the end obtain arbitrary code execution with a degree of consistency beforehand unseen, Anthropic stated.
“Though Mythos 5 is in Tier 1, its efficiency was sturdy sufficient on our evaluations that we have now chosen to deploy further mitigations that block doubtlessly dangerous offensive cyber makes use of,” Anthropic wrote in a 319-page system card for Claude Fable 5 and Claude Mythos 5.
Exploit growth historically required a mixture of deep reverse-engineering experience, understanding of reminiscence corruption, data of mitigations resembling ASLR and sandboxing, and substantial experimentation, Anthropic stated. What makes Mythos 5 noteworthy will not be merely that it often succeeds, however that it succeeds persistently, producing working exploits 90% of the time (see: Anthropic Unveils Claude Fable 5, Retains Mythos Restricted).
“Claude Opus 4.8 steadily achieves register management, however hardly ever converts it into full code execution,” Anthropic wrote. “Against this, Mythos 5, like Claude Mythos Preview earlier than it, converts usable corruption primitives into working exploits at a really excessive charge.”
Mythos 5 Succeeds at Attacking IT Techniques, However Struggles With OT Techniques
Mythos 5 not solely finds vulnerabilities however steadily develops significant exploit primitives after doing so, Anthropic stated, that means it may dramatically speed up each defensive vulnerability analysis and offensive discovery. However Anthropic notes that Mythos 5 stays under the brink of independently conducting large-scale offensive campaigns.
“On this analysis, the mannequin is tasked with discovering a vulnerability in a totally patched construct and creating an exploit primitive for that vulnerability,” Anthropic wrote. “To set this up, the mannequin is given a fuzzing entry level. It doesn’t obtain any target-specific vulnerability clues.”
The U.Ok. AI Safety Institute concluded that Mythos 5 is able to attacking small enterprise networks that have already got weak safety and the place preliminary entry has been obtained, permitting it to operate as a drive multiplier for attackers. The near-term threat will not be absolutely autonomous cyber warfare however extremely succesful AI copilots that make human attackers extra productive, Anthropic stated.
“We choose that [Claude Mythos 5], like Mythos Preview, is able to attacking small enterprise networks with weak safety the place it has already gained entry to the community,” the U.Ok. AI Safety Institute shared. “Our outcomes point out that [Claude Mythos 5] is more adept at this than another publicly accessible mannequin we have now examined.”
However in an analysis simulating industrial management methods, Mythos 5 achieved solely restricted success and failed to finish the general goal. Mythos Preview succeeded solely often. Enterprise IT environments are more and more standardized and software-centric. Industrial environments are way more heterogeneous, reliant on proprietary protocols, specialised {hardware} and decades-old methods.
“[Claude Mythos 5] made solely restricted progress on the ‘Cooling Tower’ industrial management system vary, however we draw no sturdy conclusion about [Claude Mythos 5]’s autonomous functionality in opposition to operational know-how environments at this stage,” the U.Ok. AI Safety Institute wrote.
How Efficient Fable 5’s Fallback Cyber Technique Is in Apply
Anthropic’s deployment technique for Claude Fable 5 revolves round limiting entry to cyber experience throughout doubtlessly dangerous interactions by activation monitoring, classifier methods and mannequin fallbacks. Security in frontier AI growth more and more relies upon not on lowering mannequin functionality however on controlling entry to that functionality, Anthropic stated.
“On most interfaces, Fable 5 falls again to the latest Opus mannequin (Opus 4.8) for requests which might be flagged by our classifier system,” Anthropic wrote. “Since our classifiers persistently fireplace throughout all examined cyber functionality evaluations, Fable 5’s efficiency on cyber duties is sort of equivalent to Opus 4.8. For that reason, we conclude that Fable 5 doesn’t present an uplift on cyber duties relative to Opus 4.8.”
Cyber-specific safeguards have develop into far more refined, Anthropic stated, with researchers struggling to determine common jailbreaks able to broadly bypassing protections and profitable assaults as a substitute tending to be extremely task-specific and tough to generalize. That is an evolution from earlier generations of AI methods, the place easy prompt-engineering methods may typically bypass restrictions.
“The general public bug bounty has acquired roughly 100,000 makes an attempt on the problem, which we imagine corresponds to on the order of 1,000 hours of effort,” Anthropic wrote. “This course of has not resulted in a single common jailbreak, and there have solely been two profitable task-specific jailbreaks.”
Mythos 5 achieved the strongest prompt-injection resistance among the many firm’s fashions, notably in coding and computer-use environments, however browser-based environments had been at first weak till further safeguards had been developed, Anthropic stated. Defending autonomous brokers in opposition to immediate injection could develop into as necessary as defending in opposition to phishing or malware right this moment, researchers stated.
“The Mythos fashions are our most resilient fashions in opposition to immediate injection to this point,” Anthropic wrote. “Since Claude Fable 5 shares the identical core mannequin as Claude Mythos 5, it inherits these features, making it our most sturdy typically accessible mannequin. We proceed to enhance safeguards in our agentic merchandise to additional shield our customers in opposition to immediate injection.”





![How creators and entrepreneurs are utilizing AI to hurry up & succeed [data]](https://blog.aimactgrow.com/wp-content/uploads/2025/06/Untitled20design-Apr-07-2023-08-24-35-4586-PM-120x86.png)



