Anomaly Detection in Univariate Time-series: A Survey on the State-of-the-Art (2004.00433v1)

Published 1 Apr 2020 in cs.LG and stat.ML

Abstract: Anomaly detection for time-series data has been an important research field for a long time. Seminal work on anomaly detection methods has been focussing on statistical approaches. In recent years an increasing number of machine learning algorithms have been developed to detect anomalies on time-series. Subsequently, researchers tried to improve these techniques using (deep) neural networks. In the light of the increasing number of anomaly detection methods, the body of research lacks a broad comparative evaluation of statistical, machine learning and deep learning methods. This paper studies 20 univariate anomaly detection methods from the all three categories. The evaluation is conducted on publicly available datasets, which serve as benchmarks for time-series anomaly detection. By analyzing the accuracy of each method as well as the computation time of the algorithms, we provide a thorough insight about the performance of these anomaly detection approaches, alongside some general notion of which method is suited for a certain type of data.

Authors (2)

Mohammad Braei (1 paper)
Sebastian Wagner (11 papers)

Citations (167)

View on Semantic Scholar

Summary

The paper systematically compares 20 methods, revealing that traditional statistical models often outperform newer ML and deep learning approaches in standard anomaly scenarios.
The study shows that while deep learning, especially LSTM, excels in detecting contextual anomalies, it demands higher computational resources.
The paper emphasizes the trade-off between prediction accuracy and speed, guiding researchers in choosing optimal approaches based on anomaly type.

An Insightful Overview of Anomaly Detection in Univariate Time-series

The paper "Anomaly Detection in Univariate Time-series: A Survey on the State-of-the-Art" presents a comprehensive evaluation of anomaly detection methods within the landscape of univariate time-series. The authors, Mohammad Braei and Dr.-Ing. Sebastian Wagner, provide an in-depth comparison of statistical, machine learning, and deep learning methodologies, analyzing their performance on benchmark datasets.

Summary of Key Findings

The advent of digitization has substantially increased the volume of time-series data, elevating the importance of automated anomaly detection. The paper outlines the historical reliance on statistical approaches and highlights the burgeoning interest in machine learning and neural networks for detecting anomalies. Despite the popularity and 'hype' surrounding deep learning, the authors find that statistical methods often outperform both machine learning and deep learning approaches in terms of accuracy and computational efficiency when detecting point and collective anomalies.

Methods Evaluated

The paper examines 20 univariate anomaly detection methods across three primary categories:

Statistical Approaches:
- The statistical models include AR, MA, ARIMA, and Exponential Smoothing techniques.
- These models are praised for their computational efficiency and robust performance in detecting traditional anomalies.
- Results indicate that AR and MA models consistently achieve high AUC values, maintaining a balance between prediction accuracy and speed.
Machine Learning Approaches:
- The evaluated algorithms include K-Means, DBSCAN, and Isolation Forest (iForest).
- iForest achieved competitive AUC values but did so with significantly greater computational demands, presenting scalability issues in real-time applications.
Deep Learning Approaches:
- Approaches include MLP, CNN, and LSTM architectures among others.
- While these methods did not generally outperform statistical models in standard datasets, they exhibited superior performance in datasets where contextual anomalies were prevalent.
- Notably, LSTM networks excelled in detecting contextual anomalies, highlighting deep learning's adaptive capability in complex scenarios.

Implications and Future Directions

The findings suggest a prominent role for statistical methods in typical anomaly detection scenarios, particularly where point and collective anomalies are the primary focus. However, in situations involving contextual anomalies, neural networks can offer distinct advantages due to their flexibility and adaptability.

The paper underscores the necessity for continued research into methods that dynamically balance computational efficiency and accuracy in various anomaly detection contexts. It also suggests potential explorations into multivariate anomaly detection and real-time data stream analysis, which were not within the scope of this paper.

Conclusion

The research offers valuable insights into the strengths and limitations of different anomaly detection methods in univariate time-series. It encourages practitioners to carefully consider the type of anomalies and data attributes present when selecting an appropriate detection approach. This paper sets a foundation for future studies to explore hybrid models that leverage the robustness of statistical approaches with the adaptive capabilities of deep learning, aiming to overcome the weaknesses identified in each category of methodologies.

PDF Markdown