Graph Kernels: A Survey (1904.12218v2)

Published 27 Apr 2019 in stat.ML and cs.LG

Abstract: Graph kernels have attracted a lot of attention during the last decade, and have evolved into a rapidly developing branch of learning on structured data. During the past 20 years, the considerable research activity that occurred in the field resulted in the development of dozens of graph kernels, each focusing on specific structural properties of graphs. Graph kernels have proven successful in a wide range of domains, ranging from social networks to bioinformatics. The goal of this survey is to provide a unifying view of the literature on graph kernels. In particular, we present a comprehensive overview of a wide range of graph kernels. Furthermore, we perform an experimental evaluation of several of those kernels on publicly available datasets, and provide a comparative study. Finally, we discuss key applications of graph kernels, and outline some challenges that remain to be addressed.

Citations (105)

View on Semantic Scholar

Summary

The paper provides a comprehensive review and categorization of graph kernels based on structural features and empirical performance.
It examines the trade-off between computational efficiency and expressiveness, highlighting the strengths of methods like the Weisfeiler-Lehman kernel.
The survey outlines practical applications in domains such as bioinformatics, cybersecurity, and image classification, while suggesting future research directions.

An Expert Overview of "Graph Kernels: A Survey"

The paper "Graph Kernels: A Survey" provides a comprehensive review of the development and application of graph kernels, a technique for machine learning on graph-structured data. Over the past two decades, the field has seen the creation of numerous graph kernels, each addressing specific structural nuances of graphs. This survey categorizes these kernels, evaluates their performance across several datasets, and discusses their applicability in various domains, offering valuable insights for both practitioners and researchers.

Categorization and Evaluation of Graph Kernels

The survey categorizes graph kernels based on the structural components they focus on, such as random walks, subtrees, cycles, paths, and small subgraphs. This categorization aids in understanding the diverse methodologies and the contexts in which each kernel is most effective. For instance, random walk kernels assess similarity by counting common walks, while subtree kernels focus on matching subtree patterns.

An essential aspect of the paper is its experimental evaluation of various graph kernels. By testing these kernels on benchmark datasets, the authors provide empirical insights into their performance and computational efficiency. The findings reveal that kernels employing neighborhood aggregation mechanisms, like the Weisfeiler-Lehman subtree kernel, generally deliver robust performance across different tasks. However, they also highlight the computational trade-offs, as some kernels remain inefficient for large datasets.

Insights into Computational Complexity and Expressiveness

The survey explores the computational complexity and expressiveness of graph kernels. It acknowledges the trade-off between the expressiveness of a kernel and its computational feasibility. While complete graph kernels could theoretically distinguish all graph isomorphisms, such computations are intractable in practice. The paper discusses advances in kernel design that strive for a balance, offering both efficiency and meaningful graph similarity measures. The Weisfeiler-Lehman framework, for example, optimizes this balance effectively for many real-world applications.

Application Domains and Future Research Directions

Graph kernels have been successfully applied in fields such as bioinformatics, chemoinformatics, cybersecurity, and computer vision. They have been used for tasks like molecular property prediction, protein function prediction, malware detection, and image classification, demonstrating their versatility and effectiveness in deriving insights from graph-structured data.

Despite these successes, the paper identifies several future research directions. Notably, there is a pressing need for developing scalable graph kernels that can handle graphs with continuous attributes efficiently. Moreover, the paper suggests further exploration of assignment kernels, which provide promising complexity advantages over $R$ -convolution kernels.

Conclusion

In conclusion, this survey offers a detailed exploration of graph kernels, highlighting their development, theoretical foundations, and practical applications. It serves as a crucial resource for understanding the current landscape of graph kernels, providing both a performance-oriented evaluation and guidance on their application in real-world scenarios. As the demand for robust graph analysis tools continues to grow, this paper underscores the need for ongoing innovation in the field of graph kernels.

PDF Markdown

Related Papers

YouTube

Show All Videos