On Clustering Induced Voronoi Diagrams (2404.18906v1)
Abstract: In this paper, we study a generalization of the classical Voronoi diagram, called clustering induced Voronoi diagram (CIVD). Different from the traditional model, CIVD takes as its sites the power set $U$ of an input set $P$ of objects. For each subset $C$ of $P$, CIVD uses an influence function $F(C,q)$ to measure the total (or joint) influence of all objects in $C$ on an arbitrary point $q$ in the space $\mathbb{R}d$, and determines the influence-based Voronoi cell in $\mathbb{R}d$ for $C$. This generalized model offers a number of new features (e.g., simultaneous clustering and space partition) to Voronoi diagram which are useful in various new applications. We investigate the general conditions for the influence function which ensure the existence of a small-size (e.g., nearly linear) approximate CIVD for a set $P$ of $n$ points in $\mathbb{R}d$ for some fixed $d$. To construct CIVD, we first present a standalone new technique, called approximate influence (AI) decomposition, for the general CIVD problem. With only $O(n\log n)$ time, the AI decomposition partitions the space $\mathbb{R}{d}$ into a nearly linear number of cells so that all points in each cell receive their approximate maximum influence from the same (possibly unknown) site (i.e., a subset of $P$). Based on this technique, we develop assignment algorithms to determine a proper site for each cell in the decomposition and form various $(1-\epsilon)$-approximate CIVDs for some small fixed $\epsilon>0$. Particularly, we consider two representative CIVD problems, vector CIVD and density-based CIVD, and show that both of them admit fast assignment algorithms; consequently, their $(1-\epsilon)$-approximate CIVDs can be built in $O(n \log{\max{3,d+1}}n)$ and $O(n \log{2} n)$ time, respectively.
- R. Andersen, D.F. Gleich, and V. Mirrokni, “Overlapping Clusters for Distributed Computation,” Proc. 5th ACM International Conference in Web Search and Data Mining, 2012, pp. 273-282.
- S. Arya and T. Malamatos, “Linear-Size Approximate Voronoi Diagrams,” Proceedings of the 13th annual ACM-SIAM symposium on Discrete algorithms (SODA’02), pp. 147–155, 2002.
- S. Arya, T. Malamatos, and D. M. Mount, “Space-Efficient Approximate Voronoi Diagrams,” Proc. 34th ACM Symp. on Theory of Computing (STOC 2002), pp. 721–730, 2002.
- S. Arya, D. M. Mount, N. S. Netanyahu, R. Silverman, and A. Wu, “An Optimal Algorithm for Approximate Nearest Neighbor Searching,” Journal of the ACM, 45 (1998), pp. 891–923.
- F. Aurenhammer, “Power Diagrams: Properties, Algorithms and Applications,” SIAM J. on Computing, 16(1)(1987), 78-96.
- F. Aurenhammer, “Voronoi Diagrams – A Survey of a Fundamental Geometric Data Structure,” ACM Computing Surveys, 23(1991), 345-405.
- A. Banerjee, C. Krumpelman, S. Basu, R.J. Mooney, and J. Ghosh, “Model-based Overlapping Clustering,” Proc. 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2005, pp. 532–537.
- G. Barequet, M.T. Dickerson, and R.L.S. Drysdale III, “2-Point Site Voronoi Diagrams,” Discrete Applied Mathematics, 122(1-3)(2002), 37-54.
- G. Barequet, M.T. Dickerson, D. Eppstein, D. Hodorkovsky, and K. Vyatkina, “On 2-Site Voronoi Diagrams under Geometric Distance Functions,” Proc. 8th International Symp. on Voronoi Diagrams in Science and Engineering, 2011, pp. 31-38.
- F. Bonchi, A. Gionis, and A. Ukkonen, “Overlapping Correlation Clustering,” Proc. IEEE 11th International Conference on Data Mining, 2011, pp. 51-60.
- P. Callahan and R. Kosaraju, “A Decomposition of Multidimensional Point Sets with Applications to k𝑘kitalic_k-nearest-neighbors and n𝑛nitalic_n-body Potential Fields,” JACM, 42(1)(1995), 67-90.
- F. Cao, M. Ester, W. Qian, and A. Zhou, “Density-based Clustering over an Evolving Data Stream with Noise,” Proceedings of the 6th SIAM International Conference on Data Mining, 2006, pp. 328-339.
- D.Z. Chen, M.H.M. Smid, and Bin Xu, “Geometric Algorithms for Density-based Data Clustering,” Int. J. Comput. Geometry Appl., 15(3)(2005), 239-260.
- N. Chen, J. Zhu, F. Sun, and E.P. Xing, “Large-margin Predictive Latent Subspace Learning for Multi-view Data Analysis,” IEEE Transaction on Pattern Analysis and Machine Intelligence, 34(12)(2012), 2365-2378.
- Y. Chen and L. Tu, “Density-based Clustering for Real-time Stream Data,” Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2007, pp. 133-142.
- C.M. Christoudias, R. Urtasun, and T. Darrell, “Multi-view Learning in the Presence of View Disagreement,” arXiv:1206.3242, June 2012.
- G. Cleuziou, L. Martin, and C. Vrain, “Poboc: An Overlapping Clustering Algorithm,” Proc. 16th European Conference on Artificial Intelligence, 2004, pp. 440-444.
- M.T. Dickerson and D. Eppstein, “Animating a Continuous Family of Two-site Voronoi Diagrams (and a Proof of a Bound on the Number of Regions),” Proc. 25th ACM Symp. Computational Geometry, 2009, pp. 92-93.
- M.T. Dickerson and M.T. Goodrich, “Two-site Voronoi Diagrams in Geographic Networks”, Proc. 16th ACM SIGSPATIAL International Conf. Advances in Geographic Information Systems, 2008, doi:10.1145/1463434.1463504.
- H. Ding and J. Xu, “Solving Chromatic Cone Clustering via Minimum Spanning Sphere,” Proc. 38th International Colloquium on Automata, Languages and Programming (ICALP), 2011, pp. 773-784.
- D. Greene and P. Cunningham, “Multi-view Clustering for Mining Heterogeneous Social Network Data,” Proc. 31st European Conference on Information Retrieval, Workshop on Information Retrieval over Social Networks, LNCS, Vol. 5478, 2009.
- L. Greengard. “The Numerical Solution of the N-body Problem,” Computers in Physics, 4, pp. 142–152 (1990).
- L. Greengard. “Fast Algorithms for Classical Physics.” Science 265, 909–914 (1994).
- I. Hanniel and G. Barequet, “On the Triangle-Perimeter Two-site Voronoi Diagram,” Trans. on Computational Science, 9(2010), 54-75.
- S. Har-Peled, “A Replacement for Voronoi Diagrams of Near Linear Size,” Proc. 42nd Annu. IEEE Sympos. Found. Comput. Sci. (FOCS 2001), pp. 94–103, 2001.
- Sariel Har-Peled and Nirman Kumar, “Down the Rabbit Hole: Robust Proximity Search and Density Estimation in Sublinear Space.” FOCS 2012: 430-439.
- D. Hodorkovsky, “2-Point Site Voronoi Diagrams,” M.Sc. Thesis, Technion, Haifa, Israel, 2005.
- P. Indyk and R. Motwani, “Approximate Nearest Neighbors: Towards Removing the Curse of Dimensionality,” Proc. 30th ACM Symp. on Theory of Computing (STOC 1998), pp. 604–613, 1998.
- H.-P. Kriegel and M. Pfeifle, “Density-based Clustering of Uncertain Data,” Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery in Data Mining, 2005, pp. 672-677.
- D.T. Lee and R.L.S. Drysdale, III, “Generalization of Voronoi Diagrams in the Plane,” SIAM J. Comput., 10(1)(1981), 73-87.
- A.Y. Liu and D.N. Lam, “Using Consensus Clustering for Multi-view Anomaly Detection,” Proc. IEEE CS Security and Privacy Workshops, 2012, pp. 117-125.
- E. Papadopoulou, “The Hausdorff Voronoi Diagram of Point Clusters in the Plane,” Algorithmica, 40(2004), 63-82.
- J. Sander, M. Ester, H.-P. Kriegel, and X. Xu, “Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications,” Data Mining and Knowledge Discovery, 2(2)(1998), 169-194.
- K. Vyatkina and G. Barequet, “On 2-Site Voronoi Diagrams under Arithmetic Combinations of Point-to-Point Distances,” Proc. 7th International Symp. Voronoi Diagrams in Science and Engineering, 2010, pp. 33-41.