Unsupervised Anomaly-based Malware Detection using Hardware Features (1403.1631v2)

Published 7 Mar 2014 in cs.CR

Abstract: Recent works have shown promise in using microarchitectural execution patterns to detect malware programs. These detectors belong to a class of detectors known as signature-based detectors as they catch malware by comparing a program's execution pattern (signature) to execution patterns of known malware programs. In this work, we propose a new class of detectors - anomaly-based hardware malware detectors - that do not require signatures for malware detection, and thus can catch a wider range of malware including potentially novel ones. We use unsupervised machine learning to build profiles of normal program execution based on data from performance counters, and use these profiles to detect significant deviations in program behavior that occur as a result of malware exploitation. We show that real-world exploitation of popular programs such as IE and Adobe PDF Reader on a Windows/x86 platform can be detected with nearly perfect certainty. We also examine the limits and challenges in implementing this approach in face of a sophisticated adversary attempting to evade anomaly-based detection. The proposed detector is complementary to previously proposed signature-based detectors and can be used together to improve security.

Authors (3)

Adrian Tang (2 papers)
Simha Sethumadhavan (13 papers)
Salvatore Stolfo (3 papers)

Citations (232)

View on Semantic Scholar

Summary

The paper presents an unsupervised anomaly-based malware detection system that uses hardware performance counters (HPCs) to identify deviations from normal program execution.
The methodology employs one-class SVMs in non-temporal and temporal models, utilizing a selection of 19 hardware features to build baseline execution profiles.
Experiments showed high detection accuracy (nearly 100% sensitivity) against real-world exploits with minimal false positives, highlighting the potential of hardware-level features for robust anomaly detection.

Unsupervised Anomaly-based Malware Detection using Hardware Features: An Expert Overview

This paper presents a novel methodology for malware detection, leveraging unsupervised machine learning to establish anomaly-based detection paradigms using hardware features. Unlike traditional signature-based approaches that rely on known execution patterns, this paper introduces a detection system capable of identifying novel malware by establishing profiles of normal program execution via hardware performance counters (HPCs).

Key Contributions and Methodology

The authors propose a shift from signature-based detectors to anomaly-based detectors, exploring hardware performance counters as a source of detectable deviations in program execution due to malware activity. This unsupervised anomaly-based detection system constructs baseline models using a robust set of HPC events and identifies deviations that signal malware exploitation. The approach is tested on exploits targeting commonly attacked applications such as Internet Explorer and Adobe PDF Reader, showcasing nearly perfect detection sensitivity.

Detection Models:

The authors employ one-class SVMs to model execution characteristics based on event measurements from HPCs. They argue that this approach can differentiate between normal execution and deviations caused by malware, even in the face of sophisticated modifications intended to evade detection.
Two modeling paradigms are considered: non-temporal (single-epoch feature vectors) and temporal (consolidating multiple epochs). Both approaches utilize a selection of 19 candidate hardware features, encompassing both architectural and microarchitectural events.

Experimental Setup:

Experiments were conducted using real-world exploit scenarios validated on virtualized Windows/x86 environments. Collections from clean systems were contrasted with those under attack to build and test the models’ efficacy.

Evaluation and Results

The anomaly-based hardware detectors presented in the paper demonstrated remarkably high detection capabilities, achieving 100% identification of exploitation epochs with minimal false positives when using certain sampling strategies. Temporal models, which analyze extended execution contexts, exhibited even greater accuracy compared to non-temporal counterparts.

Sampling Overhead: The analysis indicates that finer-grained sampling improves detection accuracy but also increases performance overhead, with 512k instruction epochs striking a pragmatic balance.
Detection Resilience: The methodology’s resilience to mimicry attacks, where adversaries craft exploits to blend with normal execution profiles, further emphasizes its robustness.

Mixed event modeling strategies employing both architectural and microarchitectural signals yielded optimal results, underscoring the value of multispectral data in capturing execution anomalies. However, detection performance against ROP-based exploits was constrained by sampling granularity.

Implications and Future Directions

This research signifies an important evolution in malware detection strategy, emphasizing that deeper hardware-level observability can significantly enhance anomaly detection systems. The theoretical implications underscore the potential for more effective detection systems that do not rely solely on software-level cues, potentially leading to improved security in the computing environment.

In practical terms, hardware-enhanced security measures could become an integral part of future computing architectures. The suggestion for architectural enhancements, such as offering more performance counters and custom accelerators, aligns with the theoretical requirement for more robust detection models.

Looking ahead, integrating such hardware-driven anomaly detection systems with existing protective measures could establish a layered defense strategy against evolving malware threats. Further research may focus on optimizing performance overheads, refining detection algorithms with advanced machine learning methods, and exploring additional hardware features to offer even greater granularity and fidelity in anomaly detection.

In conclusion, the paper sets an academic precedent for employing hardware-level anomalies as a rich vein for mining malware indicators, suggesting a productive avenue for future security research and development.

PDF Markdown