GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding (2402.06764v3)

Published 9 Feb 2024 in cs.AI

Abstract: Integrating LLMs with knowledge graphs derived from domain-specific data represents an important advancement towards more powerful and factual reasoning. As these models grow more capable, it is crucial to enable them to perform multi-step inferences over real-world knowledge graphs while minimizing hallucination. While LLMs excel at conversation and text generation, their ability to reason over domain-specialized graphs of interconnected entities remains limited. For example, can we query a LLM to identify the optimal contact in a professional network for a specific goal, based on relationships and attributes in a private database? The answer is no--such capabilities lie beyond current methods. However, this question underscores a critical technical gap that must be addressed. Many high-value applications in areas such as science, security, and e-commerce rely on proprietary knowledge graphs encoding unique structures, relationships, and logical constraints. We introduce a fine-tuning framework for developing Graph-aligned LLMs (GLaM) that transforms a knowledge graph into an alternate text representation with labeled question-answer pairs. We demonstrate that grounding the models in specific graph-based knowledge expands the models' capacity for structure-based reasoning. Our methodology leverages the large-LLM's generative capabilities to create the dataset and proposes an efficient alternate to retrieval-augmented generation styled methods.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (26)

Authors (5)

Stefan Dernbach (5 papers)
Khushbu Agarwal (13 papers)
Alejandro Zuniga (1 paper)
Michael Henry (1 paper)
Sutanay Choudhury (36 papers)

Citations (4)

View on Semantic Scholar

Tweets

https://twitter.com/SolidReturnLda/status/1757269383016251694

GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment via Neighborhood Partitioning and Generative Subgraph Encoding (2402.06764v3)

Related Papers

Tweets