PRISM: Purified Representation and Integrated Semantic Modeling for Generative Sequential Recommendation

Published 23 Jan 2026 in cs.IR, cs.AI, and cs.LG | (2601.16556v1)

Abstract: Generative Sequential Recommendation (GSR) has emerged as a promising paradigm, reframing recommendation as an autoregressive sequence generation task over discrete Semantic IDs (SIDs), typically derived via codebook-based quantization. Despite its great potential in unifying retrieval and ranking, existing GSR frameworks still face two critical limitations: (1) impure and unstable semantic tokenization, where quantization methods struggle with interaction noise and codebook collapse, resulting in SIDs with ambiguous discrimination; and (2) lossy and weakly structured generation, where reliance solely on coarse-grained discrete tokens inevitably introduces information loss and neglects items' hierarchical logic. To address these issues, we propose a novel generative recommendation framework, PRISM, with Purified Representation and Integrated Semantic Modeling. Specifically, to ensure high-quality tokenization, we design a Purified Semantic Quantizer that constructs a robust codebook via adaptive collaborative denoising and hierarchical semantic anchoring mechanisms. To compensate for information loss during quantization, we further propose an Integrated Semantic Recommender, which incorporates a dynamic semantic integration mechanism to integrate fine-grained semantics and enforces logical validity through a semantic structure alignment objective. PRISM consistently outperforms state-of-the-art baselines across four real-world datasets, demonstrating substantial performance gains, particularly in high-sparsity scenarios.

Abstract PDF Upgrade to Chat

Summary

The paper introduces PRISM, which integrates a Purified Semantic Quantizer and an Integrated Semantic Recommender to address codebook collapse and lossy representations.
It employs adaptive collaborative denoising and hierarchical semantic anchoring to build robust, noise-tolerant codebooks while preserving detailed semantic nuances.
Empirical results on Amazon datasets demonstrate superior Recall@K, NDCG@K, and efficiency compared to existing generative recommendation models.

Purified Representation and Integrated Semantic Modeling for GSR

Introduction and Motivation

Recent advances in Generative Sequential Recommendation (GSR) have shifted the classical "retrieve-and-rank" framework to one that leverages autoregressive generation over discrete item representations ("semantic IDs" or SIDs), drawing inspiration from progress in LLMs. This formulation allows for robust knowledge sharing and improved semantic reasoning by capturing item relations with hierarchical discrete codes. However, significant obstacles remain for lightweight generative frameworks:

Impure and unstable semantic tokenization introduces codebook collapse, where most items are bucketed into a small range of codes, causing ambiguous representations and poor discrimination.
Lossy and weakly structured generation arises from exclusive reliance on quantized SIDs, severely limiting fine-grained nuance and disrupting the logical consistency of generated recommendations.
Figure 1: Two fundamental challenges in existing GSR: (a) codebook collapse, resulting in indistinguishable item codes; (b) information loss due to coarse discrete SIDs.

PRISM Framework Overview

To address these challenges, PRISM (Purified Representation and Integrated Semantic Modeling) establishes a two-stage generative recommendation architecture:

The Purified Semantic Quantizer (PSQ) constructs robust, noise-tolerant codebooks via adaptive denoising and hierarchical semantic anchoring.
The Integrated Semantic Recommender (ISR) dynamically fuses continuous fine-grained semantic features with SIDs during generation, while enforcing logical structural alignment.
Figure 2: The PRISM framework, combining robust codebook construction with enhanced generative modeling through semantic fusion and structure alignment.

Technical Contributions

Purified Semantic Quantizer

The PSQ innovatively tackles item representation degradation by combining:

Adaptive Collaborative Denoising (ACD): A gating mechanism selectively retains reliable collaborative signals (e.g., high-frequency interactions) and suppresses noise, supervised by interaction frequency statistics for optimization stability.
Hierarchical Semantic Anchoring (HSA): Category tags guide residual quantization with multi-level semantic priors, ensuring each token level in the codebook aligns with a specific semantic granularity and enforcing hierarchical structure.
Dual-Head Reconstruction (DHR): Parallel reconstruction objectives for content and collaborative modalities counteract gradient imbalance, ensuring both semantic facets are preserved in the quantized space.

Codebook collisions are globally deduplicated post hoc via a discrete optimal transport formulation using Sinkhorn-Knopp, thus ensuring a bijective item-to-SID mapping without semantic degradation.

Integrated Semantic Recommender

The ISR module is responsible for restoring fine-grained details lost by quantization and ensuring semantic-valid generation:

Dynamic Semantic Integration (DSI): Implements a Mixture-of-Experts (MoE) router to flexibly combine context-enriched SID embedding with content/collaborative features and depth-specific projections for each position in the SID hierarchy.
Semantic Structure Alignment (SSA): Imparts a structural regularizer by obligating the decoder's hidden states to regress not only the discrete codebook latent ( $\mathbf{q}_l^{(tgt)}$ ) but also hierarchical item tags, ensuring both category and fine structure are retained in autoregressive predictions.
Adaptive Temperature Scaling (ATS): Generation temperature is modulated according to Trie-branching density at each position, thus balancing exploration and confidence depending on the semantic granularity and reducing prediction ambiguity in dense branches.

Empirical Results

Overall Effectiveness

Across four Amazon datasets (Beauty, Sports, Toys, CDs), PRISM sets new state-of-the-art results in both Recall@K and NDCG@K, outperforming both traditional discriminative and other generative models (notably TIGER, EAGER, LETTER, ActionPiece), particularly with pronounced gains in highly sparse regimes and on challenging datasets like CDs.

Robustness to Sparsity

When segmenting by item popularity, PRISM demonstrates superior robustness on "long-tail" (low-frequency) items. Competing methods such as TIGER cannot maintain unique SID assignments in these sparse regions due to codebook collapse, while ActionPiece's advantage is primarily attributable to model width rather than improved semantic discrimination. PRISM achieves a strict Pareto improvement, preserving high accuracy for both popular and long-tail items.

Figure 3: PRISM achieves substantial gains across item popularity groups, especially under sparsity.

SID Discretization Quality

Ablations highlight the importance of each component in SID quality:

Full PRISM achieves a codebook perplexity of 248.5/256 and minimizes collision rate to 1.79%, far superior to TIGER (PPL 84.2, CR 31.57%) and other baselines.
Removing HSA, ACD, or DHR increases code collisions and decreases uniform codebook usage, confirming the necessity of coordinated denoising, semantic anchoring, and balanced reconstruction.

Latent Space Visualization

t-SNE projections of the codebook and item embeddings confirm PRISM's ability to establish well-separated, hierarchically ordered codebook spaces and produce item embeddings clustered according to semantic categories. This contrasts sharply with the collapsed, entangled structures exhibited by baseline models.

Figure 4: t-SNE visualization—PRISM's codebooks exhibit concentric hierarchical structure, unlike TIGER's collapsed embeddings.

Figure 5: t-SNE visualization—item embeddings from PRISM align with distinct semantic categories, demonstrating strong discriminative capability.

Efficiency Analysis

Efficiency benchmarks indicate PRISM maintains minimal activated parameter count (~5.5M) and low inference latency (<30ms on largest datasets), in contrast to competitors requiring 3–4 $\times$ larger backbones to approach similar accuracy, particularly in high-sparsity settings.

Figure 6: PRISM achieves the best efficiency-performance trade-off with low inference latency and compact model size.

Theoretical and Practical Implications

PRISM demonstrates that robust GSR does not require brute-force scaling but arises from:

Fine-grained purification of heterogeneous signals through adaptive noise filtering.
Tight semantic guidance via hierarchical anchoring in codebook learning.
Dynamic feature integration, supplementing lossy codes with context-aware semantics during generation.
Precise structural alignment to guarantee logical consistency in autoregressive recommendation.

These results suggest a path forward for scalable, efficient, and interpretable GSR even under extreme catalog sparsity, and provide a template for further integration of symbolic priors and structure-aware objectives in generative recommendation paradigms.

Conclusion

PRISM (Purified Representation and Integrated Semantic Modeling) offers a comprehensive solution to long-standing limitations in lightweight generative sequential recommendation. By enforcing semantic purity and structural robustness at every layer of the architecture, it establishes new benchmarks in both recommendation quality and efficiency, with compelling evidence of scalability and discriminative capacity even in data-scarce environments. Future developments may explore further generalization of purified codebook learning, reinforcement-based structure alignment, and extension to multi-modal or interactive recommendation settings.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Glossary

off on

Practical Applications

off on

Conceptual Simplification

off on

Open Problems

We found no open problems mentioned in this paper.

PRISM: Purified Representation and Integrated Semantic Modeling for Generative Sequential Recommendation

Summary

Purified Representation and Integrated Semantic Modeling for GSR

Introduction and Motivation

PRISM Framework Overview

Technical Contributions

Purified Semantic Quantizer

Integrated Semantic Recommender

Empirical Results

Overall Effectiveness

Robustness to Sparsity

SID Discretization Quality

Latent Space Visualization

Efficiency Analysis

Theoretical and Practical Implications

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (5)

Collections

Tweets

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

PRISM: Purified Representation and Integrated Semantic Modeling for Generative Sequential Recommendation

Summary

Purified Representation and Integrated Semantic Modeling for GSR

Introduction and Motivation

PRISM Framework Overview

Technical Contributions

Purified Semantic Quantizer

Integrated Semantic Recommender

Empirical Results

Overall Effectiveness

Robustness to Sparsity

SID Discretization Quality

Latent Space Visualization

Efficiency Analysis

Theoretical and Practical Implications

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (5)

Collections

Tweets

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research