TransG : A Generative Mixture Model for Knowledge Graph Embedding (1509.05488v7)

Published 18 Sep 2015 in cs.CL

Abstract: Recently, knowledge graph embedding, which projects symbolic entities and relations into continuous vector space, has become a new, hot topic in artificial intelligence. This paper addresses a new issue of multiple relation semantics that a relation may have multiple meanings revealed by the entity pairs associated with the corresponding triples, and proposes a novel Gaussian mixture model for embedding, TransG. The new model can discover latent semantics for a relation and leverage a mixture of relation component vectors for embedding a fact triple. To the best of our knowledge, this is the first generative model for knowledge graph embedding, which is able to deal with multiple relation semantics. Extensive experiments show that the proposed model achieves substantial improvements against the state-of-the-art baselines.

Citations (274)

View on Semantic Scholar

Summary

The paper introduces TransG, a generative model that uses Bayesian non-parametrics to capture multiple latent semantics in knowledge graph relations.
It employs a Chinese Restaurant Process to dynamically infer semantic components, yielding improvements like 26.0% gain on FB15K for link prediction tasks.
TransG outperforms traditional models such as TransE, TransH, and TransR, offering promising implications for semantic web and information retrieval applications.

TransG: A Generative Model for Knowledge Graph Embedding

The paper "TransG: A Generative Model for Knowledge Graph Embedding" proposes a novel approach for embedding knowledge graphs that addresses the inherent complexity of relations possessing multiple semantic meanings. Authored by Han Xiao, Minlie Huang, and Xiaoyan Zhu, this work introduces TransG, a generative model that leverages Bayesian non-parametric methods to discern latent semantics within knowledge graph triples. This discussion not only presents a new perspective on embedding relations with multiple meanings but also offers substantial improvements over existing methods.

The core motivation for TransG stems from an observed limitation in existing translation-based models, such as TransE, TransH, and TransR. These models assume a single, fixed vector representation for each relation, thereby neglecting the possibility that a relation can have multiple latent semantics depending on the context provided by the associated entities. The visualization in their experiments reveals clustering that indicates these latent semantics, requiring a more sophisticated model to capture these patterns.

Methodological Approach

TransG distinguishes itself by applying a Bayesian non-parametric infinite mixture model to knowledge graph embedding. This method allows the model to generate multiple components for a relation, each representing a distinct translation vector capturing different semantics. Crucially, TransG uses a Chinese Restaurant Process (CRP) to automatically infer the number of semantic components for each relation, thereby adapting the embedding representation grounded on data complexity.

The generative process begins by assigning a normal distribution as a prior for each entity embedding vector. When forming a triple, a semantic component is drawn for the relation using CRP, followed by drawing head and tail entity vectors, ensuring they reflect the latent semantics discovered by the relation embedding vectors.

Results and Implications

The experimental results on standard benchmarks such as WN18 and FB15K for link prediction demonstrate the robustness of TransG. The model achieves notable improvements, particularly in HITS@10 accuracy, compared to state-of-the-art baselines. The paper reports specific gains, such as 2.9% improvement over TransR on WN18 and a significant 26.0% gain on FB15K. These results underscore TransG's superiority in capturing the intricate semantic nuances of relations.

Additionally, the model's performance on the triple classification task aptly demonstrates its discriminative capability, especially for relations with multiple semantic interpretations, further validating the approach.

Conclusion and Future Directions

TransG offers a comprehensive solution to an understudied aspect of knowledge graph embedding—multiple relation semantics. By integrating a mixture model with the flexibility of Bayesian non-parametrics, it advances the ability to generate precise embeddings that dynamically accommodate the relational complexity inherent in knowledge bases.

The paper opens avenues for future research, particularly in exploring more sophisticated generative models that can further refine semantic clustering. The implications for deploying such models in AI tasks, including information retrieval and semantic web applications, are substantial. As the understanding of relation complexity deepens, TransG serves as a foundational model inspiring subsequent developments in the field. Researchers may consider extending this work by integrating TransG with neural architectures, potentially enhancing its scalability and applicability in wider AI contexts.

PDF Markdown