Papers
Topics
Authors
Recent
Search
2000 character limit reached

Temporal Convolutional Explorer Helps Understand 1D-CNN's Learning Behavior in Time Series Classification from Frequency Domain

Published 9 Oct 2023 in cs.LG | (2310.05467v1)

Abstract: While one-dimensional convolutional neural networks (1D-CNNs) have been empirically proven effective in time series classification tasks, we find that there remain undesirable outcomes that could arise in their application, motivating us to further investigate and understand their underlying mechanisms. In this work, we propose a Temporal Convolutional Explorer (TCE) to empirically explore the learning behavior of 1D-CNNs from the perspective of the frequency domain. Our TCE analysis highlights that deeper 1D-CNNs tend to distract the focus from the low-frequency components leading to the accuracy degradation phenomenon, and the disturbing convolution is the driving factor. Then, we leverage our findings to the practical application and propose a regulatory framework, which can easily be integrated into existing 1D-CNNs. It aims to rectify the suboptimal learning behavior by enabling the network to selectively bypass the specified disturbing convolutions. Finally, through comprehensive experiments on widely-used UCR, UEA, and UCI benchmarks, we demonstrate that 1) TCE's insight into 1D-CNN's learning behavior; 2) our regulatory framework enables state-of-the-art 1D-CNNs to get improved performances with less consumption of memory and computational overhead.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (44)
  1. On the optimization of deep networks: Implicit acceleration by overparameterization. In International Conference on Machine Learning. PMLR, 244–253.
  2. Arthur Asuncion and David Newman. 2007. UCI machine learning repository.
  3. The UEA multivariate time series classification archive, 2018. arXiv preprint arXiv:1811.00075 (2018).
  4. The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data mining and knowledge discovery 31, 3 (2017), 606–660.
  5. Correlative channel-aware fusion for multi-view time series classification. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 6714–6722.
  6. Should we really use post-hoc tests based on mean-ranks? The Journal of Machine Learning Research 17, 1 (2016), 152–161.
  7. James W Cooley and John W Tukey. 1965. An algorithm for the machine calculation of complex Fourier series. Mathematics of computation 19, 90 (1965), 297–301.
  8. The UCR time series archive. IEEE/CAA Journal of Automatica Sinica 6, 6 (2019), 1293–1305.
  9. ROCKET: exceptionally fast and accurate time series classification using random convolutional kernels. Data Mining and Knowledge Discovery 34, 5 (2020), 1454–1495.
  10. Minirocket: A very fast (almost) deterministic transform for time series classification. In Proceedings of the 27th ACM SIGKDD conference on knowledge discovery & data mining. 248–257.
  11. Janez Demšar. 2006. Statistical comparisons of classifiers over multiple data sets. The Journal of Machine learning research 7 (2006), 1–30.
  12. Deep learning for time series classification: a review. Data Mining and Knowledge Discovery 33, 4 (2019), 917–963.
  13. Ben D Fulcher and Nick S Jones. 2017. hctsa: A computational framework for automated time-series phenotyping using massive feature extraction. Cell systems 5, 5 (2017), 527–531.
  14. Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the thirteenth international conference on artificial intelligence and statistics. JMLR Workshop and Conference Proceedings, 249–256.
  15. Learning time-series shapelets. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 392–401.
  16. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
  17. Sture Holm. 1979. A simple sequentially rejective multiple test procedure. Scandinavian journal of statistics (1979), 65–70.
  18. Inceptiontime: Finding alexnet for time series classification. Data Mining and Knowledge Discovery 34, 6 (2020), 1936–1962.
  19. Adaptive activation functions accelerate convergence in deep and physics-informed neural networks. J. Comput. Phys. 404 (2020), 109136.
  20. Multivariate LSTM-FCNs for time series classification. Neural networks 116 (2019), 237–245.
  21. Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
  22. Multi-scale deep neural network (mscalednn) for solving poisson-boltzmann equation in complex domains. arXiv preprint arXiv:2007.11207 (2020).
  23. Taking ROCKET on an efficiency mission: Multivariate time series classification with LightWaves. arXiv preprint arXiv:2204.01379 (2022).
  24. The IEEE standard on transitions, pulses, and related waveforms, Std-181-2003. IEEE Transactions on Instrumentation and Measurement 53, 4 (2004), 1209–1217.
  25. On the spectral bias of neural networks. In International Conference on Machine Learning. PMLR, 5301–5310.
  26. The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Mining and Knowledge Discovery 35, 2 (2021), 401–449.
  27. Patrick Schäfer. 2015. The BOSS is concerned with time series classification in the presence of noise. Data Mining and Knowledge Discovery 29, 6 (2015), 1505–1530.
  28. Patrick Schäfer. 2016. Scalable time series classification. Data Mining and Knowledge Discovery 30, 5 (2016), 1273–1298.
  29. Patrick Schäfer and Mikael Högqvist. 2012. SFA: a symbolic fourier approximation and index for similarity search in high dimensional datasets. In Proceedings of the 15th international conference on extending database technology. 516–527.
  30. Patrick Schäfer and Ulf Leser. 2017. Multivariate time series classification with WEASEL+ MUSE. arXiv preprint arXiv:1711.11343 (2017).
  31. Grad-cam: Visual explanations from deep networks via gradient-based localization. In Proceedings of the IEEE international conference on computer vision. 618–626.
  32. Generalizing DTW to the multi-dimensional case requires an adaptive approach. Data mining and knowledge discovery 31, 1 (2017), 1–31.
  33. Utkarsh Singh and Shyam Narain Singh. 2017. Application of fractional Fourier transform for classification of power quality disturbances. IET Science, Measurement & Technology 11, 1 (2017), 67–76.
  34. Going deeper with convolutions. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1–9.
  35. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE conference on computer vision and pattern recognition. 2818–2826.
  36. MultiRocket: multiple pooling operators and transformations for fast and effective time series classification. Data Mining and Knowledge Discovery 36, 5 (2022), 1623–1646.
  37. High-frequency component helps explain the generalization of convolutional neural networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 8684–8694.
  38. Time series classification from scratch with deep neural networks: A strong baseline. In 2017 International joint conference on neural networks (IJCNN). IEEE, 1578–1585.
  39. Zhiqin John Xu and Hanxu Zhou. 2021. Deep frequency principle towards understanding why deeper learning is faster. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 10541–10550.
  40. Frequency principle: Fourier analysis sheds light on deep neural networks. arXiv preprint arXiv:1901.06523 (2019).
  41. Training behavior of deep neural network in frequency domain. In International Conference on Neural Information Processing. Springer, 264–274.
  42. Ling Yang and Shenda Hong. 2022. Unsupervised time-series representation learning with iterative bilinear temporal-spectral fusion. In International Conference on Machine Learning. PMLR, 25038–25054.
  43. A transformer-based framework for multivariate time series representation learning. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2114–2124.
  44. A type of generalization error induced by initialization in deep neural networks. In Mathematical and Scientific Machine Learning. PMLR, 144–164.
Citations (4)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.