Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment (1611.03954v3)

Published 12 Nov 2016 in cs.AI and cs.CL

Abstract: Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs. Inasmuch as related knowledge bases are built in several different languages, achieving cross-lingual knowledge alignment will help people in constructing a coherent knowledge base, and assist machines in dealing with different expressions of entity relationships across diverse human languages. Unfortunately, achieving this highly desirable crosslingual alignment by human labor is very costly and errorprone. Thus, we propose MTransE, a translation-based model for multilingual knowledge graph embeddings, to provide a simple and automated solution. By encoding entities and relations of each language in a separated embedding space, MTransE provides transitions for each embedding vector to its cross-lingual counterparts in other spaces, while preserving the functionalities of monolingual embeddings. We deploy three different techniques to represent cross-lingual transitions, namely axis calibration, translation vectors, and linear transformations, and derive five variants for MTransE using different loss functions. Our models can be trained on partially aligned graphs, where just a small portion of triples are aligned with their cross-lingual counterparts. The experiments on cross-lingual entity matching and triple-wise alignment verification show promising results, with some variants consistently outperforming others on different tasks. We also explore how MTransE preserves the key properties of its monolingual counterpart TransE.

Authors (4)

Muhao Chen (159 papers)
Yingtao Tian (32 papers)
Mohan Yang (9 papers)
Carlo Zaniolo (20 papers)

Citations (495)

View on Semantic Scholar

Summary

Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment

In the domain of knowledge graphs, the challenge of achieving cross-lingual knowledge alignment is both critical and complex. The paper under review introduces MTransE, a model aimed at embedding multilingual knowledge graphs to automate this alignment process.

Overview and Objectives

The researchers propose MTransE, a translation-based model that encodes entities and relations within separate embedding spaces for each language. This model facilitates transitions between these spaces while maintaining the original functionalities of monolingual embeddings. The ultimate aim is to offer a seamless integration of multilingual data that allows for an enriched, coherent knowledge base.

Methodology

The paper details three techniques to represent cross-lingual transitions:

Axis Calibration: Adjusts the axes in embedding spaces to align equivalent entities and relations across languages.
Translation Vectors: Utilizes vectors to represent translations between languages, treating them similarly to relational translations.
Linear Transformations: Leverages transformation matrices to map entities and relations across languages.

The authors implement five variants of MTransE, each with distinct loss functions, and the models are trained on partially aligned graphs. The capability of MTransE to work with sparse data sets highlights its robustness and flexibility.

Results

The experiments are conducted on tasks such as cross-lingual entity matching and triple-wise alignment verification, utilizing partially aligned trilingual graphs from Wikipedia. Notably, the models exhibit varied performance, with linear transformation-based variants generally outperforming others. These techniques demonstrated superior accuracy in aligning triples and entities across languages, indicating their effectiveness in preserving structural consistency while mapping cross-lingual relations.

Implications

MTransE presents a significant advancement in the automated alignment of multilingual knowledge graphs, offering implications for various applications, including Q&A systems, semantic search, and the creation of more comprehensive global knowledge bases. By reducing reliance on manually crafted features and extensive human labor, this model promises enhanced efficiency and accuracy.

Future Directions

The paper suggests avenues for further research, such as integrating more sophisticated monolingual models with MTransE, enhancing cross-lingual completion tasks, and employing additional data sources like multilingual text corpora. Such developments could further extend the applicability and accuracy of multilingual embeddings.

In conclusion, the MTransE model provides a structured and effective approach to cross-lingual knowledge alignment, setting a foundation for future exploration and optimization within this domain.

PDF Markdown

Related Papers

Find Related Papers