Efficient $k$-Clique Listing: An Edge-Oriented Branching Strategy (2311.13798v2)
Abstract: $k$-clique listing is a vital graph mining operator with diverse applications in various networks. The state-of-the-art algorithms all adopt a branch-and-bound (BB) framework with a vertex-oriented branching strategy (called VBBkC), which forms a sub-branch by expanding a partial $k$-clique with a vertex. These algorithms have the time complexity of $O(k m (\delta/2){k-2})$, where $m$ is the number of edges in the graph and $\delta$ is the degeneracy of the graph. In this paper, we propose a BB framework with a new edge-oriented branching (called EBBkC), which forms a sub-branch by expanding a partial $k$-clique with two vertices that connect each other (which correspond to an edge). We explore various edge orderings for EBBkC such that it achieves a time complexity of $O(\delta m + k m (\tau/2){k-2})$, where $\tau$ is an integer related to the maximum truss number of the graph and we have $\tau < \delta$. The time complexity of EBBkC is better than that of VBBkC algorithms for $k>3$ since both $O(\delta m)$ and $O(k m (\tau/2){k-2})$ are bounded by $O(k m (\delta/2){k-2})$. Furthermore, we develop specialized algorithms for sub-branches on dense graphs so that we can early-terminate them and apply the specialized algorithms. We conduct extensive experiments on 19 real graphs, and the results show that our newly developed EBBkC-based algorithms with the early termination technique consistently and largely outperform the state-of-the-art (VBBkC-based) algorithms.
- Real Graphs. http://lcs.ios.ac.cn/~caisw/Resource/realworld%20graphs.tar.gz.
- CFinder: locating cliques and overlapping modules in biological networks. Bioinformatics 22, 8 (2006), 1021–1023.
- Dense subgraph maintenance under streaming edge weight updates for real-time story identification. The VLDB journal 23, 2 (2014), 175–199.
- Vladimir Batagelj and Matjaz Zaversnik. 2003. An O (m) algorithm for cores decomposition of networks. arXiv preprint cs/0310049 (2003).
- Higher-order organization of complex networks. Science 353, 6295 (2016), 163–166.
- Coen Bron and Joep Kerbosch. 1973. Algorithm 457: finding all cliques of an undirected graph. Commun. ACM 16, 9 (1973), 575–577.
- Lijun Chang. 2019. Efficient maximum clique computation over large sparse graphs. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 529–538.
- Lijun Chang. 2020. Efficient maximum clique computation and enumeration over large sparse graphs. The VLDB Journal 29, 5 (2020), 999–1022.
- Accelerating truss decomposition on heterogeneous processors. Proceedings of the VLDB Endowment 13, 10 (2020), 1751–1764.
- Norishige Chiba and Takao Nishizeki. 1985. Arboricity and subgraph listing algorithms. SIAM Journal on computing 14, 1 (1985), 210–223.
- Jonathan Cohen. 2008. Trusses: Cohesive subgraphs for social network analysis. National security agency technical report 16, 3.1 (2008).
- Listing k-cliques in sparse real-world graphs. In Proceedings of the 2018 World Wide Web Conference. 589–598.
- Listing all maximal cliques in sparse graphs in near-optimal time. In Algorithms and Computation: 21st International Symposium, ISAAC 2010, Jeju Island, Korea, December 15-17, 2010, Proceedings, Part I 21. Springer, 403–414.
- Paul Erdös and George Szekeres. 1935. A combinatorial problem in geometry. Compositio mathematica 2 (1935), 463–470.
- Efficient algorithms for densest subgraph discovery. Proceedings of the VLDB Endowment 12, 11 (2019), 1719–1732.
- Clique counting in mapreduce: Algorithms and experiments. Journal of Experimental Algorithmics (JEA) 20 (2015), 1–20.
- Parallel k-clique community detection on large-scale networks. IEEE Transactions on Parallel and Distributed Systems 24, 8 (2012), 1651–1660.
- Ordering heuristics for parallel graph coloring. In Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures. 166–177.
- Pan Hui and Jon Crowcroft. 2008. Human mobility models and opportunistic communications system design. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 366, 1872 (2008), 2005–2016.
- SK Jayanthi. 2012. Clique-attacks detection in web search engine for spamdexing using k-clique percolation technique. International Journal of Machine Learning and Computing 2, 5 (2012), 648.
- On fast enumeration of maximal cliques in large graphs. Expert Systems with Applications 187 (2022), 115915.
- Richard M Karp. 2010. Reducibility among combinatorial problems. Springer.
- Matthieu Latapy. 2008. Main-memory triangle computations for very large (sparse (power-law)) graphs. Theoretical computer science 407, 1-3 (2008), 458–473.
- Ordering Heuristics for k-clique Listing. Proc. VLDB Endow. (2020).
- Finding the maximum clique in massive graphs. Proceedings of the VLDB Endowment 10, 11 (2017), 1538–1549.
- Kazuhisa Makino and Takeaki Uno. 2004. New algorithms for enumerating all maximal cliques. In Scandinavian workshop on algorithm theory. Springer, 260–272.
- Kevin A Naudé. 2016. Refined pivot selection for maximal clique enumeration in graphs. Theoretical Computer Science 613 (2016), 28–37.
- Mark Ortmann and Ulrik Brandes. 2014. Triangle listing algorithms: Back from the diversion. In 2014 Proceedings of the Sixteenth Workshop on Algorithm Engineering and Experiments (ALENEX). SIAM, 1–8.
- Patric RJ Östergård. 2002. A fast algorithm for the maximum clique problem. Discrete Applied Mathematics 120, 1-3 (2002), 197–207.
- Uncovering the overlapping community structure of complex networks in nature and society. nature 435, 7043 (2005), 814–818.
- Fast algorithms for the maximum clique problem on massive graphs with applications to overlapping community detection. Internet Mathematics 11, 4-5 (2015), 421–448.
- Ryan Rossi and Nesreen Ahmed. 2015. The network data repository with interactive graph analytics and visualization. In Twenty-Ninth AAAI Conference on Artificial Intelligence.
- A large-scale study of link spam detection by graph algorithms. In Proceedings of the 3rd international workshop on Adversarial information retrieval on the web. 45–48.
- Finding the hierarchy of dense subgraphs using nucleus decompositions. In Proceedings of the 24th International Conference on World Wide Web. 927–937.
- A scalable, parallel algorithm for maximal clique enumeration. Journal of parallel and distributed computing 69, 4 (2009), 417–428.
- UNO Takeaki. 2012. Implementation issues of clique enumeration algorithm. Special issue: Theoretical computer science and discrete mathematics, Progress in Informatics 9 (2012), 25–30.
- The worst-case time complexity for generating all maximal cliques and computational experiments. Theoretical computer science 363, 1 (2006), 28–42.
- Charalampos Tsourakakis. 2015. The k-clique densest subgraph problem. In Proceedings of the 24th international conference on world wide web. 1122–1132.
- Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining. 104–112.
- Jia Wang and James Cheng. 2012. Truss decomposition in massive networks. Proceedings of the VLDB Endowment 5, 9 (2012), 812–823.
- Efficient k-Clique Listing: An Edge-Oriented Branching Strategy (Technical Report). https://github.com/wangkaixin219/EBBkC/blob/main/EBBkC_TR.pdf.
- Local search for diversified top-k clique search problem. Computers & Operations Research 116 (2020), 104867.
- Diversified top-k clique search. The VLDB Journal 25, 2 (2016), 171–196.
- Effective and efficient dynamic graph coloring. Proceedings of the VLDB Endowment 11, 3 (2017), 338–351.
- Efficient k-clique listing with set intersection speedup. ICDE. IEEE (2022).
- Xiao Zhou and Takao Nishizeki. 1994. Edge-coloring and f-coloring for various classes of graphs. In Algorithms and Computation: 5th International Symposium, ISAAC’94 Beijing, PR China, August 25–27, 1994 Proceedings 5. Springer, 199–207.