Deep Learning for Anomaly Detection: A Review (2007.02500v3)

Published 6 Jul 2020 in cs.LG, cs.CV, and stat.ML

Abstract: Anomaly detection, a.k.a. outlier detection or novelty detection, has been a lasting yet active research area in various research communities for several decades. There are still some unique problem complexities and challenges that require advanced approaches. In recent years, deep learning enabled anomaly detection, i.e., deep anomaly detection, has emerged as a critical direction. This paper surveys the research of deep anomaly detection with a comprehensive taxonomy, covering advancements in three high-level categories and 11 fine-grained categories of the methods. We review their key intuitions, objective functions, underlying assumptions, advantages and disadvantages, and discuss how they address the aforementioned challenges. We further discuss a set of possible future opportunities and new perspectives on addressing the challenges.

Authors (4)

Guansong Pang (82 papers)
Chunhua Shen (404 papers)
Longbing Cao (85 papers)
Anton van den Hengel (188 papers)

Citations (771)

View on Semantic Scholar

Summary

The paper categorizes deep learning anomaly detection methods into three frameworks: feature extraction, normality representation, and end-to-end score learning.
It demonstrates that deep models can reduce dimensionality and capture intricate patterns in high-dimensional data, thereby enhancing detection performance.
The review highlights promising future directions, including weakly-supervised approaches and integrating domain-specific knowledge for robust anomaly detection.

Overview of "Deep Learning for Anomaly Detection: A Review"

The paper by Pang et al. offers an in-depth analysis of deep learning methods applied to anomaly detection, a prominent research domain with applications across various fields such as security, health, and risk management. The authors present a comprehensive taxonomy categorizing existing methods into three high-level frameworks: feature extraction, normality feature learning, and end-to-end anomaly score learning.

Deep Learning as Feature Extractors

In the first category, deep learning models serve primarily as feature extractors, decoupling anomaly scoring from feature learning. While straightforward and leveraging pre-trained models, this approach can yield suboptimal results since the extracted features are often not optimized for anomaly detection tasks. Nevertheless, these models excel in processing high-dimensional data and reducing dimensionality, which is beneficial for traditional anomaly detection methods struggling with these challenges.

Learning Normality Representations

The second category explores methods that integrate feature learning with existing anomaly detection measures, aiming to create representations optimized for specific detection frameworks. This category includes diverse approaches, such as autoencoders and generative adversarial networks (GANs), each with its strengths and limitations. Autoencoders, for example, reconstruct normal instances more accurately, though they can be biased by outliers in training data. GAN-based approaches are powerful in representation learning but face challenges like mode collapse and complex training.

These methods focus on learning normality features that emphasize underlying data regularities. This amalgamation allows for improved handling of high-dimensional data and complex dependencies, addressing "curse of dimensionality" issues typically encountered by traditional methods.

End-to-end Anomaly Score Learning

The third category involves direct learning of anomaly scores through deep models with novel loss functions. These methods unify the process by jointly learning feature representations and scoring, thus optimizing the anomaly scores themselves. Techniques such as anomaly ranking and one-class classification via adversarial networks fall into this category.

This approach is flexible, accommodating various data modalities and domains while providing an avenue for inherent interpretability essential for decision-making processes in practical applications. Moreover, these methods can leverage limited labeled data, enhancing model accuracy and robustness in detecting novel anomalies, a significant advancement over conventional methods.

Numerical Results and Implications

The paper does not focus extensively on providing specific numerical comparisons across different methods but emphasizes the categorization's value in identifying effective anomalies handling. It highlights the flexibility of deep models in learning intricate patterns from multimodal data sources, significantly improving accuracy over many traditional approaches.

Future Directions

The authors identify several promising research directions, including weakly-supervised settings and integrating domain-specific knowledge. They argue for large-scale normality learning to support generalizable models and suggest deep methods' potential in advanced applications like out-of-distribution detection and curiosity-driven learning. Furthermore, there is an emphasis on developing interpretable and actionable models that can provide insights into anomalies as well as suggest mitigation strategies, addressing critical needs in high-stakes domains.

Overall, the paper presents a thorough exploration into the application of deep learning for anomaly detection, providing a valuable resource for researchers focused on advancing this critical field by leveraging the unique capabilities of neural networks. Future research inspired by this work may lead to more robust, efficient, and interpretable anomaly detection methodologies.

PDF Markdown