
Image: Indian Express
AI tools like Anthropic’s Mythos show promise but fail at cybersecurity defense. Discover the alarming findings of a new study on AI's limitations against cyberattacks.
GlipzoIn today's digital landscape, the emergence of advanced artificial intelligence (AI) models, such as Anthropic’s Mythos, has transformed how we approach numerous industries, including cybersecurity. However, even as these tools become increasingly sophisticated, a stark reality emerges: the very AI technologies designed to enhance security can also be exploited by cybercriminals. A recent study by Simbian.ai, a prominent AI security operations platform, sheds light on a troubling phenomenon—AI's inability to effectively defend against cyberattacks.
Ambuj Kumar, the founder and CEO of Simbian.ai, revealed startling findings from their research, stating, "We tested 11 of the best AI models available, and none of them passed. None of them came even close to doing a good job on cyber defense." This revelation highlights a critical gap in our cybersecurity efforts, particularly as AI becomes a more prominent player in both offense and defense.
To better understand the capabilities of AI in cybersecurity, Kumar’s team developed the Cyber Defense Benchmark, a pioneering test aimed at evaluating how effectively AI can identify and neutralize cyber threats. The benchmark presented AI models with the daunting task of sifting through massive volumes of security logs, simulating a real-world scenario where malicious actors often hide among innocuous data.
The results were sobering. The models tested included notable names such as Claude Opus 4.6, GPT-5, and Gemini 3.1 Pro, and they faced 26 attack scenarios that encompassed 105 hacking techniques. Even the top performer, Claude Opus 4.6, only managed to identify a mere 4 to 5% of the actual malicious events, flagging just 44-45 out of 100 potential threats. This reveals a significant shortfall in the AI's ability to provide reliable cybersecurity defense.
The research conducted by Simbian.ai unearthed several fundamental issues surrounding AI's performance in cybersecurity defense:
1. Volume of Data: The sheer number of log entries overwhelmed the AI. With more than 100,000 entries to analyze while only processing 10 at a time, the AI struggled to formulate effective queries to uncover threats. 2. Detection vs. Reporting: Although AI models like Claude Opus 4.6 identified suspicious activities, they often failed to report them. For instance, it detected 159 malicious events but flagged only 113. 3. Subtlety of Attacks: Many hacking techniques leave minimal traces, making them nearly invisible to AI systems. The study found that the models frequently overlooked these faint signals, which could result in significant vulnerabilities.
As AI continues to advance in capabilities, cybercriminals are not standing still. Kumar pointed out that hackers are leveraging AI to enhance their attacks. He illustrated this point with a chilling example of a recent incident where scammers created a convincing AI-generated persona for a video call, ultimately duping a chief financial officer into authorizing a $25 million transfer.
This capability to impersonate individuals convincingly raises serious concerns about the effectiveness of current cybersecurity measures. As Kumar noted, “It’s very easy to impersonate people with AI because it can write nice emails.” The implications of such advancements in both offensive and defensive strategies are profound and troubling.
The findings from Simbian.ai's research prompt critical questions about the future of AI in cybersecurity. As AI tools become increasingly accessible, particularly open-source models, the potential for misuse grows exponentially. Kumar emphasized that these open-source models, while beneficial for innovation, can also pose significant risks, as they are often three steps behind in terms of security measures.
As organizations grapple with these challenges, it becomes imperative to develop more robust AI defenses capable of keeping pace with the evolving threat landscape. Here are some strategies to watch for in the coming months:
As organizations continue to embrace AI tools, the dual-use nature of these technologies—both as a means of defense and a weapon for attackers—must be acknowledged and addressed. The findings from Simbian.ai serve as a crucial reminder that while AI holds promise for enhancing cybersecurity, its current limitations in defending against cyber threats must be taken seriously. The road ahead will require concerted efforts from tech companies, cybersecurity experts, and regulatory bodies to ensure a safer digital environment.
In an era where cyber threats are becoming more sophisticated, the challenge remains: How can we harness the potential of AI without falling victim to its misuse?

Nvidia's new RTX Spark chip aims to transform personal computing with AI, marking a significant shift in technology. Discover what this means for consumers.
BBC Business
The explosion of Blue Origin's New Glenn rocket raises significant concerns over NASA's lunar ambitions and the future of Amazon's satellite projects. Discover the implications.
BBC Science
Discover how Meta's lack of engagement on user bans raises critical concerns about accountability and transparency in social media governance.
BBC Technology