Cyber-industry collaboration through AI

Issue 9 2020 Information Security

Sophos announced four new open artificial intelligence (AI) developments to help broaden and sharpen the industry’s defences against cyberattacks, including datasets, tools and methodologies designed to advance industry collaboration and cumulative innovation. This move accelerates a key Sophos objective to open its data science breakthroughs and make the use of AI in cybersecurity more transparent, all with the aim of better protecting organisations against all forms of cybercrime.

While it is common practice to share AI methodologies and findings in other industries, cybersecurity has lagged in this effort, creating a noisy understanding of how AI truly provides protection against cyber threats. Sophos and its team of SophosAI data scientists are catalysing this change toward openness, so that IT managers, security analysts, CFOs, CEOs, and others making security buying or management decisions, can discuss and assess AI benefits from a level and well-informed playing field.

“With SophosAI’s new initiative to open its research, we can help influence how AI is positioned and discussed in cybersecurity moving forward. Today’s cacophony of opaque or guarded claims about the capabilities or efficacy of AI in solutions makes it difficult to impossible for buyers to understand or validate these claims. This leads to buyer scepticism, creating headwinds to future progress at the very moment we’re starting to see great breakthroughs,” said Joe Levy, chief technology officer, Sophos. “Correcting this through external mechanisms like standards or regulation won’t happen quickly enough. Instead, it requires a grassroots effort and self-policing within our community to produce a set of practices and language that will advance the industry in a disruptive, open and transparent manner.”

It is difficult to overstate the criticality of this shift given the immense potential of how AI can benefit cybersecurity. Sophos evidence shows that defenders are increasingly facing human adversaries who are constantly upping their game, launching highly contextualised Business Email Compromise (BEC) forgery campaigns or relentlessly developing new ransomware attacks. Scalable and effective defences against these and most other types of cyberattacks require assistance from AI. Openness and peer review among those applying AI to address these security threats stimulate innovation and discoveries, driving the entire industry forward.

Sophos is providing datasets, tools and methodologies in four important areas:

SOREL-20M dataset for accelerating malware detection research

SOREL-20M, a joint project between SophosAI and ReversingLabs, is a production-scale dataset containing metadata, labels and features for 20 million Windows Portable Executable (PE) files. It includes 10 million disarmed malware samples available for download for the purpose of research on feature extraction to accelerate industry-wide improvements in security. This dataset is the first production scale malware research dataset available to the general public, with a curated and labelled set of samples and security-relevant metadata.

AI-powered impersonation protection method

SophosAI’s impersonation protection is designed to protect against email spear phishing attacks, where influential people are impersonated to trick recipients into taking some harmful action for the benefit of the attacker. This new protection compares the display name of inbound emails against high-level executive titles – those most likely to be spoofed in a spear phishing attack, such as a CEO, CFO or president – that are unique to specific organisations and flags these messages when they appear suspicious. Sophos has trained the AI working behind the scenes on a large sample set of millions of known attack emails. SophosAI has opened up this innovative new protection method, which it has also discussed publicly at Defcon 28 and in an Arxiv paper.

Digital epidemiology to determine undetected malware

SophosAI has also built a set of epidemiology-inspired statistical models for estimating the prevalence of malware infections in total, which enables Sophos to estimate – and in turn enabling a better chance to find – the needles in a PE file haystack. SophosAI has pioneered and made publicly available this method that helps to determine malicious ‘dark matter’, malware that might be missed or wrongly classified, and ‘future malware’ that is in development by attackers. The model is designed to be extensible to other classes of files and information system artefacts and is also discussed in the Sophos 2021 Threat Report.

YaraML automatic signature generation tools

Signature generation for the detection of malware families is a laborious, manual process. Over the years, researchers have proposed a variety of automatic signature generation methods, most of which have not found adoption because they underperform manual methods. SophosAI has developed a new method for automatic signature generation, called YaraML, that’s significantly different from previous options by taking an AI based approach to the problem.

SophosAI directly ‘compiles’ full-fledged, industrial strength machine learning models, the kinds used in commercial security products, into signature languages, essentially allowing AI to ‘write’ the signatures. This proves to be far more effective than previous approaches and represents a breakthrough for the security community. SophosAI has open-sourced YaraML.

These four advancements are the latest from SophosAI, which works creatively like a start-up incubator, but with the intellectual resources of a near billion-dollar global company, including SophosLabs, Sophos Managed Threat Response and hundreds of thousands of customers. Another advantage is that SophosAI can add new technology directly into shipping products. This model allows Sophos to react quickly to market needs, predict where the industry must head and advance openness for greater cybersecurity industry collaboration and innovation, all of which is essential when developing defences against fast-moving adversaries.

Find out more at www.sophos.com




Share this article:
Share via emailShare via LinkedInPrint this page



Further reading:

Who are you?
Access Control & Identity Management Information Security
Who are you? This question may seem strange, but it can only be answered accurately by implementing an Identity and Access Management (IAM) system, a crucial component of any company’s security strategy.

Read more...
Check Point launches African Perspectives on Cybersecurity report
News & Events Information Security
Check Point Software Technologies released its African Perspectives on Cybersecurity Report 2025, revealing a sharp rise in attacks across the continent and a major shift in attacker tactics driven by artificial intelligence

Read more...
What is your ‘real’ security posture?
BlueVision Editor's Choice Information Security Infrastructure AI & Data Analytics
Many businesses operate under the illusion that their security controls, policies, and incident response plans will hold firm when tested by cybercriminals, but does this mean you are really safe?

Read more...
What is your ‘real’ security posture? (Part 2)
BlueVision Editor's Choice Information Security Infrastructure
In the second part of this series of articles from BlueVision, we explore the human element: social engineering and insider threats and how red teaming can expose and remedy them.

Read more...
Sophos announces evolution of its security operations portfolio
Information Security
Sophos has announced significant enhancements to its security operations portfolio via Sophos XDR and Sophos MDR offerings, marking an important milestone in its integration journey following the acquisition of Secureworks in February 2025.

Read more...
Cybersecurity operations done right
LanDynamix SMART Security Solutions Technews Publishing Information Security
For smaller companies, the costs associated with acquiring the necessary skills and tools can be very high. So, how can these organisations establish and maintain their security profile amid constant attacks and evolving technology?

Read more...
AI security with AI Cloud Protect
Information Security
AI Cloud Protect is now available for on-premises enterprise deployments to secure AI model development, agentic AI applications, and inference workloads with zero impact on performance.

Read more...
Kaspersky finds security flaws that threaten vehicle safety.
News & Events Information Security Transport (Industry)
At its Security Analyst Summit 2025, Kaspersky presented the results of a security audit that exposed a significant security flaw enabling unauthorised access to all connected vehicles of one automotive manufacturer.

Read more...
The overlooked risks of everyday connectivity
Information Security
That free Wi-Fi you are using could end up costing you a lot more money than your hotspot data if it has been compromised, says Richard Frost, head of technology solutions and consulting at Armata Cyber Security.

Read more...
Syndicates exploit insider vulnerabilities in SA
Information Security Security Services & Risk Management
Today’s cyber criminals do not just exploit vulnerabilities in your systems; they exploit your people, turning trusted team members into unwitting accomplices or deliberate collaborators in their schemes.

Read more...










While every effort has been made to ensure the accuracy of the information contained herein, the publisher and its agents cannot be held responsible for any errors contained, or any loss incurred as a result. Articles published do not necessarily reflect the views of the publishers. The editor reserves the right to alter or cut copy. Articles submitted are deemed to have been cleared for publication. Advertisements and company contact details are published as provided by the advertiser. Technews Publishing (Pty) Ltd cannot be held responsible for the accuracy or veracity of supplied material.




© Technews Publishing (Pty) Ltd. | All Rights Reserved.