Understanding Time Series Anomaly State Detection through One-Class Classification (2402.02007v1)
Abstract: For a long time, research on time series anomaly detection has mainly focused on finding outliers within a given time series. Admittedly, this is consistent with some practical problems, but in other practical application scenarios, people are concerned about: assuming a standard time series is given, how to judge whether another test time series deviates from the standard time series, which is more similar to the problem discussed in one-class classification (OCC). Therefore, in this article, we try to re-understand and define the time series anomaly detection problem through OCC, which we call 'time series anomaly state detection problem'. We first use stochastic processes and hypothesis testing to strictly define the 'time series anomaly state detection problem', and its corresponding anomalies. Then, we use the time series classification dataset to construct an artificial dataset corresponding to the problem. We compile 38 anomaly detection algorithms and correct some of the algorithms to adapt to handle this problem. Finally, through a large number of experiments, we fairly compare the actual performance of various time series anomaly detection algorithms, providing insights and directions for future research by researchers.
- Theoretical foundations and algorithms for outlier ensembles. Acm sigkdd explorations newsletter, 17(1):24–47, 2015.
- Fir and iir synapses, a new neural network architecture for time series modeling. Neural computation, 3(3):375–385, 1991.
- Isolation-based anomaly detection using nearest-neighbor ensembles. Computational Intelligence, 34(4):968–998, 2018.
- Tanogan: Time series anomaly detection with generative adversarial networks. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1778–1785. IEEE, 2020.
- Merlion: A machine learning library for time series. arXiv preprint arXiv:2109.09265, 2021.
- How many bins should be put in a regular histogram. ESAIM: Probability and Statistics, 10:24–45, 2006.
- Time series analysis forecasting and control-rev. 1976.
- Anomaly detection in univariate time-series: A survey on the state-of-the-art. arXiv preprint arXiv:2004.00433, 2020.
- Lof: identifying density-based local outliers. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data, pp. 93–104, 2000.
- Estimation of time series parameters in the presence of outliers. Technometrics, 30(2):193–204, 1988.
- k-means–: A unified approach to clustering and outlier detection. In Proceedings of the 2013 SIAM international conference on data mining, pp. 189–197. SIAM, 2013.
- Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555, 2014.
- Recurrent neural networks and robust time series prediction. IEEE transactions on neural networks, 5(2):240–254, 1994.
- Cook, R. D. Detection of influential observation in linear regression. Technometrics, 19(1):15–18, 1977.
- Anomaly pattern detection in categorical datasets. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 169–176, 2008.
- Systematic construction of anomaly detection benchmarks from real data. In Proceedings of the ACM SIGKDD workshop on outlier detection and description, pp. 16–21, 2013.
- Anomaly detection, analysis and prediction techniques in iot environment: A systematic literature review. IEEE Access, 7:81664–81681, 2019.
- Wrap-around l2-discrepancy of random sampling, latin hypercube and uniform designs. Journal of complexity, 17(4):608–624, 2001.
- Deep learning for medical anomaly detection–a survey. ACM Computing Surveys (CSUR), 54(7):1–37, 2021.
- Fox, A. J. Outliers in time series. Journal of the Royal Statistical Society Series B: Statistical Methodology, 34(3):350–363, 1972.
- Tadgan: Time series anomaly detection using generative adversarial networks. In 2020 IEEE International Conference on Big Data (Big Data), pp. 33–43. IEEE, 2020.
- Adbench: Anomaly detection benchmark. Advances in Neural Information Processing Systems, 35:32142–32159, 2022.
- On the difficulty of nearest neighbor search. arXiv preprint arXiv:1206.6411, 2012.
- Discovering cluster-based local outliers. Pattern recognition letters, 24(9-10):1641–1650, 2003.
- Support vector machines. IEEE Intelligent Systems and their applications, 13(4):18–28, 1998.
- Financial fraud: a review of anomaly detection techniques and recent advances. Expert systems With applications, 193:116429, 2022.
- Artificial intelligence based anomaly detection of energy consumption in buildings: A review, current trends and new perspectives. Applied Energy, 287:116601, 2021.
- Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
- Hoffmann, H. Kernel pca for novelty detection. Pattern recognition, 40(3):863–874, 2007.
- Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining, pp. 387–395, 2018.
- Volume 16: how to detect and handle outliers. Quality Press, 1993.
- Multi-window-finder: domain agnostic window size for time series data. Proceedings of the MileTS, 21, 2021.
- Stochastic outlier selection. Tilburg centre for Creative Computing, techreport, 1:2012, 2012.
- Using machine learning for dependable outlier detection in environmental monitoring systems. ACM Transactions on Cyber-Physical Systems, 5(3):1–30, 2021.
- A review of detection approaches for distributed denial of service attacks. Systems Science & Control Engineering, 5(1):301–320, 2017.
- Anomaly pattern detection for streaming data. Expert Systems with Applications, 149:113252, 2020.
- Auto-encoding variational bayes. arXiv preprint arXiv:1312.6114, 2013.
- Angle-based outlier detection in high-dimensional data. In Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 444–452, 2008.
- Outlier detection in axis-parallel subspaces of high dimensional data. In Advances in Knowledge Discovery and Data Mining: 13th Pacific-Asia Conference, PAKDD 2009 Bangkok, Thailand, April 27-30, 2009 Proceedings 13, pp. 831–838. Springer, 2009.
- Revisiting time series outlier detection: Definitions and benchmarks. In Thirty-fifth conference on neural information processing systems datasets and benchmarks track (round 1), 2021.
- Outlier detection with kernel density functions. In International Workshop on Machine Learning and Data Mining in Pattern Recognition, pp. 61–75. Springer, 2007.
- Evaluating real-time anomaly detection algorithms–the numenta anomaly benchmark. In 2015 IEEE 14th international conference on machine learning and applications (ICMLA), pp. 38–44. IEEE, 2015.
- Copod: copula-based outlier detection. In 2020 IEEE international conference on data mining (ICDM), pp. 1118–1123. IEEE, 2020.
- Ecod: Unsupervised outlier detection using empirical cumulative distribution functions. IEEE Transactions on Knowledge and Data Engineering, 2022.
- Isolation forest. In 2008 eighth ieee international conference on data mining, pp. 413–422. IEEE, 2008.
- Deepant: A deep learning approach for unsupervised anomaly detection in time series. Ieee Access, 7:1991–2005, 2018.
- Page, E. On problems in which a change in a parameter occurs at an unknown point. Biometrika, 44(1/2):248–252, 1957.
- Deep learning for anomaly detection: A review. ACM computing surveys (CSUR), 54(2):1–38, 2021.
- Volume under the surface: a new accuracy evaluation measure for time-series anomaly detection. Proceedings of the VLDB Endowment, 15(11):2774–2787, 2022a.
- Tsb-uad: an end-to-end benchmark suite for univariate time-series anomaly detection. Proceedings of the VLDB Endowment, 15(8):1697–1711, 2022b.
- A review on fault detection and process diagnostics in industrial processes. Processes, 8(9):1123, 2020.
- Pevnỳ, T. Loda: Lightweight on-line detector of anomalies. Machine Learning, 102:275–304, 2016.
- Efficient algorithms for mining outliers from large data sets. In Proceedings of the 2000 ACM SIGMOD international conference on Management of data, pp. 427–438, 2000.
- A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41(3):212–223, 1999.
- Deep one-class classification. In International conference on machine learning, pp. 4393–4402. PMLR, 2018.
- Active learning for anomaly detection in environmental data. Environmental Modelling & Software, 134:104869, 2020.
- Anomaly detection using autoencoders with nonlinear dimensionality reduction. In Proceedings of the MLSDA 2014 2nd workshop on machine learning for sensory data analysis, pp. 4–11, 2014.
- Estimating the support of a high-dimensional distribution. Neural computation, 13(7):1443–1471, 2001.
- A novel anomaly detection scheme based on principal component classifier. In Proceedings of the IEEE foundations and new directions of data mining workshop, pp. 172–179. IEEE Press, 2003.
- Rapid distance-based outlier detection via sampling. Advances in neural information processing systems, 26, 2013.
- Enhancing effectiveness of outlier detections for low density patterns. In Advances in Knowledge Discovery and Data Mining: 6th Pacific-Asia Conference, PAKDD 2002 Taipei, Taiwan, May 6–8, 2002 Proceedings 6, pp. 535–548. Springer, 2002.
- Precision and recall for time series. Advances in neural information processing systems, 31, 2018.
- Support vector data description. Machine learning, 54:45–66, 2004.
- Tsay, R. S. Outliers, level shifts, and variance changes in time series. Journal of forecasting, 7(1):1–20, 1988.
- Tukey, J. W. et al. Exploratory data analysis, volume 2. Reading, MA, 1977.