Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
94 tokens/sec
Gemini 2.5 Pro Premium
55 tokens/sec
GPT-5 Medium
38 tokens/sec
GPT-5 High Premium
24 tokens/sec
GPT-4o
106 tokens/sec
DeepSeek R1 via Azure Premium
98 tokens/sec
GPT OSS 120B via Groq Premium
518 tokens/sec
Kimi K2 via Groq Premium
188 tokens/sec
2000 character limit reached

Interpretable Spectral Variational AutoEncoder (ISVAE) for time series clustering (2310.11940v1)

Published 18 Oct 2023 in stat.ML and cs.LG

Abstract: The best encoding is the one that is interpretable in nature. In this work, we introduce a novel model that incorporates an interpretable bottleneck-termed the Filter Bank (FB)-at the outset of a Variational Autoencoder (VAE). This arrangement compels the VAE to attend on the most informative segments of the input signal, fostering the learning of a novel encoding ${f_0}$ which boasts enhanced interpretability and clusterability over traditional latent spaces. By deliberately constraining the VAE with this FB, we intentionally constrict its capacity to access broad input domain information, promoting the development of an encoding that is discernible, separable, and of reduced dimensionality. The evolutionary learning trajectory of ${f_0}$ further manifests as a dynamic hierarchical tree, offering profound insights into cluster similarities. Additionally, for handling intricate data configurations, we propose a tailored decoder structure that is symmetrically aligned with FB's architecture. Empirical evaluations highlight the superior efficacy of ISVAE, which compares favorably to state-of-the-art results in clustering metrics across real-world datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (30)
  1. Similarity-based clustering for patterns of extreme values, Stat 12 (2023) e560.
  2. Comparison of time series clustering methods for identifying novel subphenotypes of patients with infection, Journal of the American Medical Informatics Association 30 (2023) 1158–1166.
  3. A. Dixit, S. Jain, Intuitionistic fuzzy time series forecasting method for non-stationary time series data with suitable number of clusters and different window size for fuzzy rule generation, Information Sciences 623 (2023) 132–145.
  4. A. Sherstinsky, Fundamentals of recurrent neural network (RNN) and long short-term memory (lstm) network, Physica D: Nonlinear Phenomena (2020).
  5. J. MacQueen, Some methods for classification and analysis of multivariate observations, in: Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, 1967, pp. 281–297.
  6. J. H. W. Jr., Hierarchical grouping to optimize an objective function, Journal of the American Statistical Association 58 (1963) 236–244.
  7. J. Wu, Z. Lin, Research on customer segmentation model by clustering, in: Proceedings of the 7th International Conference on Electronic Commerce, 2005, pp. 316–318.
  8. Medical image segmentation using k-means clustering and improved watershed algorithm, in: Proceedings of the 2006 IEEE Southwest Symposium on Image Analysis and Interpretation, IEEE, 2006, pp. 61–65.
  9. Clustering-based anomaly detection in multivariate time series data, Applied Soft Computing 100 (2021) 106919.
  10. Comparative genomic analysis of 60 mycobacteriophage genomes: Genome clustering, gene acquisition, and gene size, Journal of Molecular Biology 397 (2010) 119–143.
  11. Decoupling local and global representations of time series, in: Proceedings of the 25th International Conference on Artificial Intelligence and Statistics, volume 151, 2022.
  12. Learning representations for time series clustering, in: Proceedings of the International Conference on Advances in Neural Information Processing Systems, volume 32, 2019.
  13. Sequence to sequence learning with neural networks, in: Proceedings of the International Conference on Advances in Neural Information Processing Systems, volume 2, 2014, pp. 3104–3112.
  14. The graph neural network model, IEEE Transactions on Neural Networks 20 (2009) 61–80.
  15. L. N. Ferreira, L. Zhao, Time series clustering via community detection in networks, Information Sciences 326 (2016) 227–242.
  16. Language through a prism: A spectral approach for multiscale language representations, in: Proceedings of the International Conference on Advances in Neural Information Processing Systems, volume 33, 2020, pp. 5492–5504.
  17. Spectral temporal graph neural network for multivariate time-series forecasting, in: Proceedings of the International Conference on Advances in Neural Information Processing Systems, volume 33, 2020, pp. 17766–17778.
  18. Deep autoregressive models with spectral attention, Pattern Recognition 133 (2023) 109014.
  19. Deep spectral clustering using dual autoencoder network, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 4066–4075.
  20. S. Van Vaerenbergh, I. Santamaría, A spectral clustering approach to underdetermined post-nonlinear blind source separation of sparse sources, IEEE Transactions on Neural Networks 17 (2006) 811–814.
  21. G. E. Lowitz, What the fourier transform can really bring to clustering, Pattern Recognition 17 (1984) 657–665.
  22. G. M. Goerg, A nonparametric frequency domain EM algorithm for time series classification with applications to spike sorting and macro-economics, Statistical Analysis and Data Mining: The ASA Data Science Journal 4 (2011) 590–603.
  23. S. H. Holan, N. Ravishanker, Time series clustering and classification via frequency domain methods, Wiley Interdisciplinary Reviews: Computational Statistics 10 (2018) e1444.
  24. Deep time-series clustering: A review, Electronics 10 (2021).
  25. Attention autoencoder for generative latent representational learning in anomaly detection, Sensors 22 (2022).
  26. Environmental sound classification using temporal-frequency attention based convolutional neural network, Scientific Reports 11 (2021) 21552.
  27. Convolutional neural filtering for intelligent communications signal processing in harsh environments, IEEE Access 9 (2021) 8212–8219.
  28. S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Computation 9 (1997) 1735–1780.
  29. D. Deng, DBSCAN clustering algorithm based on density, in: Proceedings of the 7th International Forum on Electrical Engineering and Automation (IFEEA), 2020, pp. 949–953.
  30. Hierarchical, multi-sensor based classification of daily life activities: Comparison with state-of-the-art algorithms using a benchmark dataset, PloS one 8 (2013) e75196.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube