- The paper introduces a novel Knowledge-Aware Retrieval (KAR) framework that leverages KG relations to enhance query expansion.
- It employs document-based relation filtering to align LLM-generated query expansions with both textual content and relational structure.
- Experimental results on AMAZON, MAG, and PRIME datasets demonstrate significant improvements in hit rates and retrieval accuracy.
Knowledge-Aware Query Expansion with LLMs for Textual and Relational Retrieval
The paper addresses the limitations of current query expansion techniques in information retrieval, especially their focus on enhancing textual similarity while neglecting document relational structures. The authors propose a novel framework named Knowledge-Aware Retrieval (KAR), which utilizes LLMs augmented with knowledge graph (KG) relations to improve performance for queries with both textual and relational elements.
Main Contributions
- Knowledge-Aware Query Expansion Framework: The framework incorporates structured document relationships derived from KGs, addressing the gap in handling semi-structured queries with both textual and relational aspects.
- Document-Based Relation Filtering: By using document texts as KG node representations, the method filters relations based on textual similarity, allowing for more targeted query expansion and retrieval.
- Extensive Evaluation: Experiments on three diverse datasets (AMAZON, MAG, and PRIME) showcase the superiority of KAR over state-of-the-art baselines in terms of textual and relational retrieval.
Methodology
The paper introduces an innovative approach to query expansion that begins with parsing entities from the query using an LLM and retrieving related document texts and KG nodes. It employs document-based relation filtering to refine KG relations relevant to the query, enhancing the precision of the retrieval process. By leveraging these filtered document triples as inputs, LLM-generated query expansions are more effectively aligned with user intent and document corpus structure.
Results and Implications
The experimental results demonstrate that KAR shows significant improvements in hit rates and mean reciprocal rank across all tested datasets, with particularly strong performance in domains with denser relational structures such as MAG and PRIME.
- AMAZON: Rich in textual information, leading to competitive performance among methods, yet KAR still achieves notable results.
- MAG and PRIME: KAR utilizes the dense relational data effectively, outperforming others by maintaining structural accuracy in query expansion.
The paper highlights the potential of KG-augmented query expansion frameworks in improving retrieval systems, particularly in scenarios demanding nuanced understanding of both textual content and relational context.
Future Directions
The authors suggest further exploration into optimizing KG-enhanced LLMs for different query types and corpora, aiming for broader applicability across diverse domains. Additionally, investigating efficient integration methods to reduce retrieval latency remains an area for potential improvement.
Conclusion
This paper provides a compelling approach to overcoming existing challenges in retrieval systems by integrating relational structures into LLM-based query expansions. The KAR framework not only demonstrates its efficacy in controlled settings but also suggests broader applications for multi-faceted search tasks, making a noteworthy contribution to the field of information retrieval.