Temporal Convolution-based Hybrid Model Approach with Representation Learning for Real-Time Acoustic Anomaly Detection (2410.19722v1)
Abstract: The early detection of potential failures in industrial machinery components is paramount for ensuring the reliability and safety of operations, thereby preserving Machine Condition Monitoring (MCM). This research addresses this imperative by introducing an innovative approach to Real-Time Acoustic Anomaly Detection. Our method combines semi-supervised temporal convolution with representation learning and a hybrid model strategy with Temporal Convolutional Networks (TCN) to handle various intricate anomaly patterns found in acoustic data effectively. The proposed model demonstrates superior performance compared to established research in the field, underscoring the effectiveness of this approach. Not only do we present quantitative evidence of its superiority, but we also employ visual representations, such as t-SNE plots, to further substantiate the model's efficacy.
- Charu C. Aggarwal. 2016. Outlier Analysis (2nd ed.). Springer Publishing Company, Incorporated.
- An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. CoRR abs/1803.01271 (2018). arXiv:1803.01271 http://arxiv.org/abs/1803.01271
- Representation Learning: A Review and New Perspectives. arXiv:1206.5538Â [cs.LG]
- A Review on Outlier/Anomaly Detection in Time Series Data. Comput. Surveys 54, 3 (Jun 2021), 1–33. https://doi.org/10.1145/3444690
- Raghavendra Chalapathy and Sanjay Chawla. 2019. Deep Learning for Anomaly Detection: A Survey. CoRR abs/1901.03407 (2019). arXiv:1901.03407 http://arxiv.org/abs/1901.03407
- Anomaly detection. Comput. Surveys 41, 3 (Jul 2009), 1–58. https://doi.org/10.1145/1541880.1541882
- Paweł Daniluk. 2020. Ensemble Of Auto-encoder Based And Wavenet Like Systems For Unsupervised Anomaly Detection Technical Report. In DCASE Challenge 2020 Tech Report. DCASE, DCASE.
- Self-supervised classification for detecting anomalous sounds. In Detection and Classification of Acoustic Scenes and Events Workshop 2020. Amazon, Amazon Science. https://www.amazon.science/publications/self-supervised-classification-for-detecting-anomalous-sounds
- Self-Supervised Anomaly Detection: A Survey and Outlook. https://doi.org/10.48550/ARXIV.2205.05173
- Rob Hyndman and G. Athanasopoulos. 2021. Forecasting: Principles and Practice (3rd ed.). OTexts, Australia.
- DCASE Challenge 2020: Unsupervised Anomalous Sound Detection of Machinery with Deep Autoencoders. In DCASE Challenge 2020 Tech Report. DCASE, DCASE.
- Diederik P. Kingma and Jimmy Ba. 2017. Adam: A Method for Stochastic Optimization. arXiv:1412.6980 [cs.LG]
- Statistical and Machine Learning forecasting methods: Concerns and ways forward. PLOS ONE 13, 3 (Mar 2018), e0194889. https://doi.org/10.1371/journal.pone.0194889
- The M4 Competition: 100,000 time series and 61 forecasting methods. International Journal of Forecasting 36, 1 (Jan 2020), 54–74. https://doi.org/10.1016/j.ijforecast.2019.04.014
- Ayoub Malek. 2023. Spafe: Simplified python audio features extraction. Journal of Open Source Software 8, 81 (2023), 4739. https://doi.org/10.21105/joss.04739
- librosa: Audio and music signal analysis in python. In Proceedings of the 14th python in science conference, Vol. 8.
- Deep Learning for Anomaly Detection. Comput. Surveys 54, 2 (Mar 2021), 1–38. https://doi.org/10.1145/3439950
- MIMII Dataset: Sound Dataset for Malfunctioning Industrial Machine Investigation and Inspection. https://doi.org/10.5281/zenodo.3384388
- NTT Technical Review. 2017. Anomaly Detection Technique in Sound to Detect Faulty Equipment — NTT Technical Review. https://www.ntt-review.jp/archive/ntttechnical.php?contents=ntr201708fa5.html
- Deep Dense and Convolutional Autoencoders for Unsupervised Anomaly Detection in Machine Condition Sounds. Technical Report. DCASE2020 Challenge.
- A Unifying Review of Deep and Shallow Anomaly Detection. arxiv.org (Sep 2020). https://doi.org/10.1109/JPROC.2021.3052449
- Sound Spectrum Influences Auditory Distance Perception of Sound Sources Located in a Room Environment. Frontiers in Psychology 8 (Jun 2017). https://doi.org/10.3389/fpsyg.2017.00969
- Sequence to Sequence Learning with Neural Networks. CoRR abs/1409.3215 (2014). arXiv:1409.3215 http://arxiv.org/abs/1409.3215
- Laurens van der Maaten and Geoffrey Hinton. 2008. Visualizing Data using t-SNE. Journal of Machine Learning Research 9 (2008), 2579–2605. http://www.jmlr.org/papers/v9/vandermaaten08a.html
- A Review of Dimensionality Reduction Techniques for Efficient Computation. Procedia Computer Science 165 (2019), 104–111. https://doi.org/10.1016/j.procs.2020.01.079
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.