Adaptive Quantum Optimized Centroid Initialization (2401.11258v1)
Abstract: One of the major benefits of quantum computing is the potential to resolve complex computational problems faster than can be done by classical methods. There are many prototype-based clustering methods in use today, and selection of the starting nodes for the center points is often done randomly. For prototype-based clustering algorithms, this could lead to much slower convergence times. One of the causes of this may be prototype-based clustering accepting a local minima as a valid solution when there are possibly better solutions. Quantum computing, specifically quantum annealing, offers a solution to these problems by mapping the initial centroid problem to an Ising Hamiltonian where over time the lowest energy in the spectrum correlates to a valid, but better solution. A first approach to this problem utilizing quantum annealing was known as Quantum Optimized Centroid Initialization (QOCI), but this approach has some limitations both in results and performance. We will present a modification of QOCI known as Adaptive Quantum Optimized Centroid Initialization (AQOCI) which addresses many of the limitations in QOCI. The results presented are comparable to those obtained using classical techniques as well as being superior to those results found using QOCI.
- Nicholas R. Allgood, Ajinkya Borle & Charles K. Nicholas (2023): Quantum Optimized Centroid Initialization. In Kohei Arai, editor: Proceedings of the Future Technologies Conference (FTC) 2023, Volume 2, Springer Nature Switzerland, Cham, pp. 71–85.
- Christian Bauckhage (2015): k-Means Clustering Is Matrix Factorization, 10.48550/ARXIV.1512.07548.
- Ajinkya Borle & Samuel J. Lomonaco (2018): Analyzing the Quantum Annealing Approach for Solving Linear Least Squares Problems. arXiv 1809.07649.
- Elżbieta Burek, Michał Misztal & Michał Wroński (2021): Algebraic attacks on block ciphers using quantum annealing. Cryptology ePrint Archive, Report 2021/620. https://ia.cr/2021/620.
- Knowledge-Based Systems 194, p. 105567, https://doi.org/10.1016/j.knosys.2020.105567. Available at https://www.sciencedirect.com/science/article/pii/S0950705120300587.
- Scientific Reports 9(1), p. 10258, 10.1038/s41598-019-46729-0. Available at https://doi.org/10.1038/s41598-019-46729-0.
- D-Wave: D-wave. Available at https://www.dwave.com.
- Edward W. Forgy (1965): Cluster analysis of multivariate data: efficiency versus interpretability of classifications. Biometrics 21(3), pp. 768–769.
- Computers and Security 124, Issue C.
- Jain A. K. & Dubes R. C. (1988): Algorithms for Clustering Data. Prentice-Hall.
- Rousseeuw PJ Kaufman L (1987): Clustering by means of medoids. Proc. Statistical Data Analysis Based on the L1 Norm Conference, Neuchatel, 1987, pp. 405–416.
- Jingu Kim & Haesun Park (2008): Sparse Nonnegative Matrix Factorization for Clustering. Technical Report GT-CSE-08-01, Georgia Institute of Technology. Available at https://smartech.gatech.edu/handle/1853/20058.
- Daniel D. Lee & H. Sebastian Seung (2000): Algorithms for Non-Negative Matrix Factorization. In: Proceedings of the 13th International Conference on Neural Information Processing Systems, NIPS’00, MIT Press, Cambridge, MA, USA, p. 535–541.
- S Lloyd (1982): Least squares quantization in PCM. IEEE Transactions on Information Theory 28(2), pp. 129–137.
- J. B. MacQueen (1967): Some Methods for classification and Analysis of Multivariate Observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability: 281-297, University of California Press.
- Michael A. Nielsen & Isaac L. Chuang (2000): Quantum Computation and Quantum Information. Cambridge University Press.
- Gintaras Palubeckis (2004): Multistart Tabu Search Strategies for the Unconstrained Binary Quadratic Optimization Problem. In: Annals of Operations Research. Available at https://doi.org/10.1023/B:ANOR.0000039522.58036.68.
- Physical Review A 104(3), 10.1103/physreva.104.032426. Available at https://doi.org/10.1103%2Fphysreva.104.032426.
- Expert Systems with Applications 37(7), pp. 4966–4973, https://doi.org/10.1016/j.eswa.2009.12.017. Available at https://www.sciencedirect.com/science/article/pii/S095741740901063X.