2000 character limit reached
Finding a Cluster in Incomplete Data (2312.07628v1)
Published 12 Dec 2023 in cs.DS
Abstract: We study two variants of the fundamental problem of finding a cluster in incomplete data. In the problems under consideration, we are given a multiset of incomplete $d$-dimensional vectors over the binary domain and integers $k$ and $r$, and the goal is to complete the missing vector entries so that the multiset of complete vectors either contains (i) a cluster of $k$ vectors of radius at most $r$, or (ii) a cluster of $k$ vectors of diameter at most $r$. We give tight characterizations of the parameterized complexity of the problems under consideration with respect to the parameters $k$, $r$, and a third parameter that captures the missing vector entries.
- kđkitalic_k-subspaces with missing data. 2012 IEEE Statistical Signal Processing Workshop (SSP), pages 612â615, 2012.
- Closest string with outliers. BMC Bioinformatics, 12(S-1):S55, 2011.
- Consensus strings with small maximum distance and small distance sum. Algorithmica, 82(5):1378â1409, 2020.
- Geometric clustering: Fixed-parameter tractability and lower bounds with respect to the dimension. ACM Trans. Algorithms, 7(4):43:1â43:27, 2011.
- Clustering to minimize the sum of cluster diameters. Journal of Computer and System Sciences, 68(2):417 â 441, 2004.
- Parameterized Algorithms. Springer, 2015.
- Fundamentals of Parameterized Complexity. Texts in Computer Science. Springer, 2013.
- A simple heuristic for the pđpitalic_p-centre problem. Oper. Res. Lett., 3(6):285â288, 1985.
- The parameterized complexity of clustering incomplete data. In The Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, pages 7296â7304. AAAI Press, 2021.
- Ehsan Elhamifar. High-rank matrix completion and clustering under self-expressive models. In D. D. Lee, M. Sugiyama, U. V. Luxburg, I. Guyon, and R. Garnett, editors, Advances in Neural Information Processing Systems 29, pages 73â81. Curran Associates, Inc., 2016.
- Sparse subspace clustering: Algorithm, theory, and applications. IEEE Trans. Pattern Anal. Mach. Intell., 35(11):2765â2781, 2013.
- Optimal algorithms for approximate clustering. In Proceedings of the Twentieth Annual ACM Symposium on Theory of Computing, STOC â88, pages 434â444. ACM, 1988.
- Parameterized Complexity Theory, volume XIV of Texts in Theoretical Computer Science. An EATCS Series. Springer, Berlin, 2006.
- M. Frances and A. Litman. On covering problems of codes. Theory of Computing Systems, 30(2):113â119, 1997.
- Parameterized algorithms for the matrix completion problem. In ICML, volume 80 of JMLR Workshop and Conference Proceedings, pages 1642â1651, 2018.
- Efficient approximation algorithms for the Hamming center problem. In Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 905â906, 1999.
- Approximation algorithms for Hamming clustering problems. Journal of Discrete Algorithms, 2(2):289 â 301, 2004.
- Teofilo F. Gonzalez. Clustering to minimize the maximum intercluster distance. Theoretical Computer Science, 38:293 â 306, 1985.
- Fixed-parameter algorithms for CLOSEST STRING and related problems. Algorithmica, 37(1):25â42, 2003.
- Parameterized complexity analysis for the closest string with wildcards problem. Theoretical Computer Science, 600:11â18, 2015.
- Parameterized algorithms for matrix completion with radius constraints. In Inge Li Gørtz and Oren Weimann, editors, 31st Annual Symposium on Combinatorial Pattern Matching, CPM 2020, June 17-19, 2020, Copenhagen, Denmark, volume 161 of LIPIcs, pages 20:1â20:14. Schloss Dagstuhl - Leibniz-Zentrum fĂźr Informatik, 2020.
- Binary matrix completion under diameter constraints. In Markus Bläser and Benjamin Monmege, editors, 38th International Symposium on Theoretical Aspects of Computer Science, STACS 2021, March 16-19, 2021, SaarbrĂźcken, Germany (Virtual Conference), volume 187 of LIPIcs, pages 47:1â47:14. Schloss Dagstuhl - Leibniz-Zentrum fĂźr Informatik, 2021.
- Parameterized complexity and kernelizability of max ones and exact ones problems. TOCT, 8(1):1:1â1:28, 2016.
- H. W. Lenstra and Jr. Integer programming with a fixed number of variables. Math. Oper. Res., 8(4):538â548, 1983.
- On the closest string and substring problems. J. ACM, 49(2):157â171, 2002.
- DĂĄniel Marx. Parameterized complexity of constraint satisfaction problems. Computational Complexity, 14(2):153â183, 2005.
- DĂĄniel Marx. Closest substring problems with small distances. SIAM J. Comput., 38(4):1382â1410, 2008.
- Robust ensemble clustering by matrix completion. In 2012 IEEE 12th International Conference on Data Mining, pages 1176â1181, 2012.