Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fuzzy clustering of circular time series based on a new dependence measure with applications to wind data (2402.08687v1)

Published 26 Jan 2024 in stat.AP and cs.LG

Abstract: Time series clustering is an essential machine learning task with applications in many disciplines. While the majority of the methods focus on time series taking values on the real line, very few works consider time series defined on the unit circle, although the latter objects frequently arise in many applications. In this paper, the problem of clustering circular time series is addressed. To this aim, a distance between circular series is introduced and used to construct a clustering procedure. The metric relies on a new measure of serial dependence considering circular arcs, thus taking advantage of the directional character inherent to the series range. Since the dynamics of the series may vary over the time, we adopt a fuzzy approach, which enables the procedure to locate each series into several clusters with different membership degrees. The resulting clustering algorithm is able to group series generated from similar stochastic processes, reaching accurate results with series coming from a broad variety of models. An extensive simulation study shows that the proposed method outperforms several alternative techniques, besides being computationally efficient. Two interesting applications involving time series of wind direction in Saudi Arabia highlight the potential of the proposed approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. T. W. Liao, Clustering of time series data—a survey, Pattern recognition 38 (2005) 1857–1874.
  2. Time-series clustering–a decade review, Information systems 53 (2015) 16–38.
  3. Fuzzy clustering of time series data using dynamic time warping distance, Engineering Applications of Artificial Intelligence 39 (2015) 235–244.
  4. M. Łuczak, Hierarchical clustering of time series data with parametric derivative dynamic time warping, Expert Systems with Applications 62 (2016) 116–130.
  5. M. Corduas, D. Piccolo, Time series clustering and classification by the autoregressive metric, Computational statistics & data analysis 52 (2008) 1860–1872.
  6. Autoregressive model-based fuzzy clustering and its application for detecting information redundancy in air pollution monitoring networks, Soft Computing 17 (2013) 83–131.
  7. Garch-based robust clustering of time series, Fuzzy Sets and Systems 305 (2016) 1–28.
  8. P. D’Urso, E. A. Maharaj, Autocorrelation-based fuzzy clustering of time series, Fuzzy Sets and Systems 160 (2009) 3565–3589.
  9. P. D’Urso, E. A. Maharaj, Wavelets-based clustering of multivariate time series, Fuzzy Sets and Systems 193 (2012) 33–61.
  10. Quantile-based fuzzy clustering of multivariate time series in the frequency domain, Fuzzy Sets and Systems 443 (2022a) 115–154.
  11. Quantile-based fuzzy c-means clustering of multivariate time series: Robust techniques, International Journal of Approximate Reasoning 150 (2022b) 55–82.
  12. A. Singhal, D. E. Seborg, Clustering multivariate time-series data, Journal of Chemometrics: A Journal of the Chemometrics Society 19 (2005) 427–438.
  13. Improved time-series clustering with umap dimension reduction method, in: 2020 25th International Conference on Pattern Recognition (ICPR), IEEE, 2021, pp. 5658–5665.
  14. C. Etienne, O. Latifa, Model-based count series clustering for bike sharing system usage mining: a case study with the vélib’system of paris, ACM Transactions on Intelligent Systems and Technology (TIST) 5 (2014) 1–21.
  15. Ingarch-based fuzzy clustering of count time series with a football application, Machine Learning with Applications 10 (2022) 100417.
  16. C. Pamminger, Frühwirth-Schnatter, Model-based clustering of categorical time series, Bayesian Analysis 5 (2010) 345–368.
  17. M. García-Magariños, J. A. Vilar, A framework for dissimilarity-based partitioning clustering of categorical time series, Data mining and knowledge discovery 29 (2015) 466–502.
  18. N. Fisher, A. Lee, Time series analysis of circular data, Journal of the Royal Statistical Society: Series B (Methodological) 56 (1994) 327–339.
  19. Hidden markov models for circular and linear-circular time series, Environmental and Ecological Statistics 13 (2006) 325–347.
  20. Modelling circular time series, Journal of Econometrics (2023).
  21. Offshore wind data assessment near the iberian peninsula over the last 25 years, Environmental Research: Climate 2 (2023) 025008.
  22. R. Y. Liu, K. Singh, Ordering directional data: concepts of data depth on circles and spheres, The Annals of Statistics 20 (1992) 1468–1484.
  23. A boxplot for circular data, Biometrics 74 (2018) 1492–1501.
  24. R. Koenker, Z. Xiao, Quantile autoregression, Journal of the American statistical association 101 (2006) 980–990.
  25. N. I. Fisher, A. J. Lee, A correlation coefficient for circular data, Biometrika 70 (1983) 327–332.
  26. S. R. Jammalamadaka, Y. R. Sarma, A correlation coefficient for angular variables, Statistical theory and data analysis II (1988) 349–364.
  27. A fuzzy relative of the k-medoids algorithm with application to web document and snippet clustering, in: FUZZ-IEEE’99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No. 99CH36315), volume 3, IEEE, 1999, pp. 1281–1286.
  28. J. C. Dunn, A fuzzy relative of the isodata process and its use in detecting compact well-separated clusters (1973).
  29. Two novel distances for ordinal time series and their application to fuzzy clustering, Fuzzy Sets and Systems (2023) 108590.
  30. X. L. Xie, G. Beni, A validity measure for fuzzy clustering, IEEE Transactions on Pattern Analysis & Machine Intelligence 13 (1991) 841–847.
  31. Á. López-Oriona, J. A. Vilar, Quantile cross-spectral density: A novel and effective tool for clustering multivariate time series, Expert Systems with Applications 185 (2021) 115677.
  32. Hard and soft clustering of categorical time series based on two novel distances with an application to biological sequences, Information Sciences 624 (2023) 467–492.
  33. B. Lafuente-Rego, J. A. Vilar, Clustering of time series using quantile autocovariances, Advances in Data Analysis and classification 10 (2016) 391–415.
  34. Quantile autocovariances: a powerful tool for hard and soft partitional clustering of time series, Fuzzy Sets and Systems 340 (2018) 38–72.
  35. Robust fuzzy clustering based on quantile autocovariances, Statistical papers 61 (2020) 2393–2448.
  36. T. Bollerslev, Generalized autoregressive conditional heteroskedasticity, Journal of econometrics 31 (1986) 307–327.
  37. E. A. Maharaj, P. D’Urso, Fuzzy clustering of time series in the frequency domain, Information Sciences 181 (2011) 1187–1211.
  38. Overlapping clustering: A new method for product positioning, Journal of Marketing Research 18 (1981) 310–317.
  39. Spatial weighted robust clustering of multivariate time series based on quantile dependence with an application to mobility during covid-19 pandemic, IEEE Transactions on Fuzzy Systems 30 (2022) 3990–4004. doi:10.1109/TFUZZ.2021.3136005.
  40. R. J. Campello, A fuzzy extension of the rand index and other related indexes for clustering and classification assessment, Pattern Recognition Letters 28 (2007) 833–841.
  41. A fuzzy clustering model for multivariate spatial time series, Journal of Classification 27 (2010) 54–88.

Summary

We haven't generated a summary for this paper yet.