Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GuP: Fast Subgraph Matching by Guard-based Pruning (2306.06557v1)

Published 11 Jun 2023 in cs.DB, cs.DS, and cs.IR

Abstract: Subgraph matching, which finds subgraphs isomorphic to a query, is the key to information retrieval from data represented as a graph. To avoid redundant exploration in the data, existing methods restrict the search space by extracting candidate vertices and candidate edges that may constitute isomorphic subgraphs. However, it still requires expensive computation because candidate vertices induce many subgraphs that are not isomorphic to the query. In this paper, we propose GuP, a subgraph matching algorithm with pruning based on guards. Guards are a pattern of intermediate search states that never find isomorphic subgraphs. GuP attaches a guard on each candidate vertex and edge and filters out them adaptively to the search state. The experimental results showed that GuP can efficiently solve various queries, including those that the state-of-the-art methods could not solve in practical time.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (40)
  1. EmptyHeaded: A Relational Engine for Graph Processing. ACM Transactions on Database Systems 42, 4 (2017).
  2. CECI: Compact Embedding Cluster Index for Scalable Subgraph Matching. In Proceedings of the 2019 International Conference on Management of Data. 1447–1462.
  3. Efficient Subgraph Matching by Postponing Cartesian Products. In Proceedings of the 2016 International Conference on Management of Data, Vol. 1. 1199–1214.
  4. Enhancing Graph Database Indexing by Suffix Tree Structure. In Proceedings of the 5th IAPR International Conference on Pattern Recognition in Bioinformatics. 195–203.
  5. A subgraph isomorphism algorithm and its application to biochemical data. BMC Bioinformatics 14, 7 (2013), S13.
  6. Fg-index: Towards Verification-free Query Processing on Graph Databases. In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data. 857–872.
  7. Fan Chung. 2010. Graph theory in the information age. Notices of the American Mathematical Society 57 (2010).
  8. A (Sub)Graph Isomorphism Algorithm for Matching Large Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence 26, 10 (2004), 1367–1372.
  9. Introduction to Algorithms, Third Edition (3rd ed.).
  10. Eugene C. Freuder and Richard J. Wallace. 1995. Generalizing Inconsistency Learning for Constraint Satisfaction. In Proceedings of the 14th International Joint Conference on Artificial Intelligence - Volume 1. 563–569.
  11. J. Gaschnig. 1978. Experimental case studies of backtrack vs. Waltz-type vs. new algorithms for satisficing assignment problems. Proceedings of the Canadian Artificial Intelligence Conference (1978), 268–277.
  12. Matthew L. Ginsberg. 1993. Dynamic Backtracking. Journal of Artificial Intelligence Research 1, 1 (1993), 25–46.
  13. GRAPES: A Software for Parallel Searching on Biological Graphs Targeting Multi-Core Architectures. PLoS One 8, 10 (2013).
  14. Efficient Subgraph Matching: Harmonizing Dynamic Programming, Adaptive Matching Order, and Failing Set Together. In Proceedings of the 2019 International Conference on Management of Data. 1429–1446.
  15. TurboISO: Towards Ultrafast and Robust Subgraph Isomorphism Search in Large Graph Databases. In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data. 337–348.
  16. Huahai He and Ambuj K. Singh. 2008. Graphs-at-a-time: Query Language and Access Methods for Graph Databases. In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data. 405–418.
  17. Scalable SPARQL Querying of Large RDF Graphs. Proceedings of the VLDB Endowment 4, 11 (2011), 1123–1134.
  18. Maintaining Arc-Consistency within Dynamic Backtracking. In Principles and Practice of Constraint Programming – CP 2000, Rina Dechter (Ed.). 249–261.
  19. Subgraph Querying with Parallel Use of Query Rewritings and Alternative Algorithms. In Proceedings of 20th International Conference on Extending Database Technology. 25–36.
  20. Fast subgraph query processing and subgraph matching via static and dynamic equivalences. The VLDB Journal (2022).
  21. DUALSIM: Parallel Subgraph Enumeration in a Massive Graph on a Single Machine. In Proceedings of the 2016 International Conference on Management of Data. 1231–1245.
  22. Taming Subgraph Isomorphism for RDF Query Processing. Proceedings of the VLDB Endowment 8, 11 (2015), 1238–1249.
  23. Scalable Subgraph Enumeration in MapReduce. Proceedings of the VLDB Endowment 8, 10 (2015), 974–985.
  24. Scalable Distributed Subgraph Enumeration. Proc. VLDB Endow. 10, 3 (2016), 217–228.
  25. An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases. Proceedings of the VLDB Endowment 6, 2 (2012), 133–144.
  26. Ciaran McCreesh and Patrick Prosser. 2015. A Parallel, Backjumping Subgraph Isomorphism Algorithm Using Supplemental Graphs. In Principles and Practice of Constraint Programming, Gilles Pesant (Ed.). 295–312.
  27. The Glasgow Subgraph Solver: Using Constraint Programming to Tackle Hard Subgraph Isomorphism Problem Variants. In Graph Transformation, Fabio Gadducci and Timo Kehrer (Eds.). 316–324.
  28. Amine Mhedhbi and Semih Salihoglu. 2019. Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins. Proceedings of the VLDB Endowment 12, 11 (2019), 1692–1704.
  29. Krzysztof Michalak and Jerzy Korczak. 2011. Graph mining approach to suspicious transaction detection. In 2011 Federated Conference on Computer Science and Information Systems (FedCSIS). 69–75.
  30. Patrick Prosser. 1993. Hybrid algorithms for the constraint satisfaction problem. Computational intelligence 9, 3 (1993), 268–299.
  31. Real-Time Constrained Cycle Detection in Large Dynamic Graphs. Proc. VLDB Endow. 11, 12 (2018), 1876–1888.
  32. Handbook of Constraint Programming.
  33. Taming Verification Hardness: An Efficient Algorithm for Testing Subgraph Isomorphism. Proceedings of the VLDB Endowment 1, 1 (2008), 364–375.
  34. Richard M. Stallman and Gerald J. Sussman. 1977. Forward reasoning and dependency-directed backtracking in a system for computer-aided circuit analysis. Artificial Intelligence 9, 2 (1977), 135–196.
  35. Shixuan Sun and Qiong Luo. 2020. In-Memory Subgraph Matching: An In-depth Study. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 1083–1098.
  36. Shixuan Sun and Qiong Luo. 2022. Subgraph Matching with Effective Matching Order and Indexing. IEEE Transactions on Knowledge and Data Engineering 34, 1 (2022), 491–505.
  37. RapidMatch: A Holistic Approach to Subgraph Query Processing. Proceedings of the VLDB Endowment 14, 2 (2020), 176–188.
  38. Julian R. Ullmann. 1976. An Algorithm for Subgraph Isomorphism. J. ACM 23, 1 (1976), 31–42.
  39. TripleBit: A Fast and Compact System for Large Scale RDF Data. Proceedings of the VLDB Endowment 6, 7 (2013), 517–528.
  40. Peixiang Zhao and Jiawei Han. 2010. On Graph Query Optimization in Large Networks. Proceedings of the VLDB Endowment 3, 1-2 (2010), 340–351.
Citations (11)

Summary

We haven't generated a summary for this paper yet.