MST5 -- Multilingual Question Answering over Knowledge Graphs (2407.06041v1)

Published 8 Jul 2024 in cs.CL

Abstract: Knowledge Graph Question Answering (KGQA) simplifies querying vast amounts of knowledge stored in a graph-based model using natural language. However, the research has largely concentrated on English, putting non-English speakers at a disadvantage. Meanwhile, existing multilingual KGQA systems face challenges in achieving performance comparable to English systems, highlighting the difficulty of generating SPARQL queries from diverse languages. In this research, we propose a simplified approach to enhance multilingual KGQA systems by incorporating linguistic context and entity information directly into the processing pipeline of a LLM. Unlike existing methods that rely on separate encoders for integrating auxiliary information, our strategy leverages a single, pretrained multilingual transformer-based LLM to manage both the primary input and the auxiliary data. Our methodology significantly improves the LLM's ability to accurately convert a natural language query into a relevant SPARQL query. It demonstrates promising results on the most recent QALD datasets, namely QALD-9-Plus and QALD-10. Furthermore, we introduce and evaluate our approach on Chinese and Japanese, thereby expanding the language diversity of the existing datasets.

Authors (6)

Nikit Srivastava (2 papers)
Mengshi Ma (1 paper)
Daniel Vollmers (2 papers)
Hamada Zahera (2 papers)
Diego Moussallem (23 papers)
Axel-Cyrille Ngonga Ngomo (63 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

MST5 -- Multilingual Question Answering over Knowledge Graphs (2407.06041v1)

Summary

Related Papers