Papers
Topics
Authors
Recent
Search
2000 character limit reached

Leveraging LLMs and Heterogeneous Knowledge Graphs for Persona-Driven Session-Based Recommendation

Published 8 Apr 2026 in cs.IR | (2604.06928v2)

Abstract: Session-based recommendation systems (SBRS) aim to capture user's short-term intent from interaction sequences. However, the common assumption of anonymous sessions limits personalization, particularly under sparse or cold-start conditions. Recent advances in LLM augmented recommendation have shown that LLMs can generate rich item representations, but modeling user personas with LLMs remains challenging due to anonymous sessions. In this work, we propose a persona driven SBRS framework that explicitly models latent user personas inferred from a heterogeneous knowledge graph (KG) and integrates them into a data-driven SBRS. Our framework adopts a two-stage architecture consisting of personalized information extraction and personalized information utilization. In the personalized information extraction stage, we construct a heterogeneous KG that integrates time-independent user-item interactions, item-item relations, item-feature associations, and external metadata from DBpedia. We then learn latent user personas in an unsupervised manner using a Heterogeneous Deep Graph Infomax (HDGI) objective over a KG initialized with LLM-derived item embeddings. In the personalized information utilization stage, the learned persona representations together with LLM-derived item embeddings are incorporated into a modified architecture of data-driven SBRS to generate a candidate set of relevant items, followed by reranking using the base sequential model to emphasize short-term session intent. Unlike prior approaches that rely solely on sequence modeling or text-based user representations, our method grounds user persona modeling in structured relational signals derived from a heterogeneous KG. Experiments on Amazon Books and Amazon Movies & TV demonstrate that our approach consistently improves over sequential models with user embeddings derived using session history.

Summary

  • The paper introduces a novel framework that integrates LLM-derived item embeddings and heterogeneous knowledge graphs to learn stable user personas for improved session recommendations.
  • The paper shows that KG-grounded persona embeddings significantly boost candidate recall by leveraging structured multi-relational data in sparse and cold-start scenarios.
  • The paper validates the approach on Amazon datasets, demonstrating enhanced hit rates and ranking quality when combining long-term persona signals with session-specific models.

Persona-Driven Session-Based Recommendation via LLMs and Heterogeneous Knowledge Graphs

Motivation and Contributions

Session-based recommender systems (SBRS) are optimized for capturing transient user intent from short sequences of item interactions. However, these systems frequently assume session anonymity, which severely constrains their ability to deliver robust personalization—particularly in sparse and cold-start regimes where long-term behavioral data is unavailable or unreliable. Existing SBRS approaches largely rely on sequential modeling or input-side augmentation, inadequately separating enduring user characteristics from ephemeral preferences.

This paper presents a framework that addresses this gap by integrating heterogeneous knowledge graphs (KGs) and LLMs for user modeling. The central claim is that stable user personas—representing domain-specific affinities, stylistic or topical interests—can be inferred from the multi-relational structure of a KG which aggregates item interactions, item-attribute associations, and external metadata (e.g., DBpedia). The KG is initialized with LLM-derived item embeddings to encode fine-grained semantic signals, and user personas are learned via Heterogeneous Deep Graph Infomax (HDGI), enabling unsupervised capture of high-order relational semantics.

A two-stage architecture is proposed: (i) Personalized Information Extraction constructs the KG and learns persona embeddings; (ii) Personalized Information Utilization incorporates these persona representations, together with semantic item embeddings, into a retrieval architecture, followed by reranking with a base sequential model to emphasize short-term session-specific intent. Figure 1

Figure 1: Overview of the persona-driven session-based recommendation framework leveraging the heterogeneous knowledge graph as user persona backbone.

Framework Overview

The pipeline begins with KG construction, integrating user-item interactions, item-to-item structural relations, item-feature links, and semantic metadata. Item and attribute nodes are encoded using LLMs (Qwen-family), while user embeddings are initialized randomly and refined through HDGI contrastive learning. The heterogeneous KG thus serves as the substrate for unsupervised persona inference, grounded in multi-hop relational context and textual semantics.

Personalized Information Extraction operates as follows:

  • KG Construction: The Amazon-KG dataset is used for graph instantiation, enriching the item interaction data with DBpedia-derived attributes. Entity types and relation types are dataset-specific, supporting complex semantic and structural connectivity. Figure 2

    Figure 2: Construction of the heterogeneous knowledge graph, aggregating multi-typed entities and interrelations for structured representation learning.

  • LLM Embedding Initialization: Item nodes and attribute nodes are encoded with an LLM to inject semantic information, aligning disjoint node and edge types into a unified embedding space.
  • HDGI Pretraining: User personas are learned by maximizing mutual information between user node representations and the global graph summary. Meta-path-based aggregation and attention fusion capture heterogeneous semantics, while the infomax objective enforces alignment of local and global feature distributions.

Personalized Information Utilization integrates user persona embeddings with session context into a data-driven retriever, followed by reranking with SASRec to model session-specific intent. This hybrid architecture ensures that candidate retrieval is guided by stable, KG-grounded preferences while final item ranking emphasizes recent behavioral relevance.

Experimental Results

Experiments were conducted on Amazon Books and Amazon Movies datasets. The system consistently outperformed sequential baselines—even in highly sparse settings—when KG-derived persona embeddings were injected into the retrieval module. The observed improvement in Hit Rate (HR@100) demonstrates that the KG architecture enables retrieval of more relevant items, especially long-tail candidates connected via structured relations.

Notably, KG-based user preference embeddings drove substantial gains in candidate recall, although the impact on top-rank precision (HR@10) was attenuated in the high-sparsity Books dataset. This differentiation highlights how KG-grounded representations improve overall recall capability by leveraging structured knowledge, even if short-term session signals are weak.

After candidate generation, reranking with SASRec improved ranking quality (MRR, NDCG) by accurately modeling session-level dependencies, confirming the thesis that integration of stable persona signals with sequential modeling yields optimal performance across multiple recommendation metrics.

Theoretical and Practical Implications

The proposed KG-enhanced persona modeling addresses several limitations of prior SBRS systems:

  • Interpretability: KG-grounded personas are structurally interpretable and transferable, contrasting with opaque embeddings from purely sequential models or LLM-only representations.
  • Scalability: The method sidesteps the computational bottleneck of generating user-side representations exclusively with LLMs, leveraging unsupervised HDGI for efficient persona induction.
  • Relational Grounding: Structured multi-relational links allow the model to generalize across cold-start scenarios and sparsity regimes, supporting robust retrieval of long-tail items.

The approach signals a shift from sequence-only SBRS toward unified architectures that intertwine structured knowledge and semantic representation learning, fundamentally increasing the robustness and personalization of modern recommender systems.

Future Directions

Potential extensions include:

  • Dynamic Persona Evolution: Modeling temporal dynamics in KG-derived personas to reflect preference drift.
  • Contextual KG Integration: Adapting KG construction to incorporate context-dependent attributes or external event signals.
  • Multi-domain Generalization: Leveraging shared entity spaces and relations for cross-domain recommendation transfer.
  • KG-Augmented CoT Reasoning: Using KG-augmented prompts to support chain-of-thought LLM reasoning for enhanced transparency and natural language explanations.

Conclusion

The paper introduces a hybrid approach for session-based recommendation, integrating heterogeneous knowledge graphs and LLM-derived item semantics to infer and utilize explicit user personas. KG-grounded personas consistently yield improved candidate retrieval and recommendation quality, particularly in sparse or anonymous session contexts. By bridging structured relational knowledge with sequence modeling, the framework advances the state-of-the-art for interpretable, scalable, and effective personalization (2604.06928).

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.