Papers
Topics
Authors
Recent
Search
2000 character limit reached

ICDM: Debiasing ID-based CF Models

Updated 18 April 2026
  • ICDM is a module that disentangles true collaborative signals from popularity and conformity biases in ID-based CF models.
  • It employs similarity and orthogonality losses to separate debiased representations from biased factors for improved alignment.
  • Integrated in the DAS framework, ICDM leverages multi-view contrastive learning to robustly align collaborative and semantic signals.

The ID-based Collaborative Filtering Debias Module (ICDM) is a key component in modern large-scale recommender systems that addresses entrenched biases in ID-based collaborative filtering (CF) representations. Implemented within the single-stage Dual-Aligned Semantic IDs (DAS) framework, ICDM is designed to disentangle signal and bias in user and item representations, producing debiased embeddings for contrastive alignment with semantic code-based identifiers, thereby improving downstream recommendation performance and preserving semantic integrity (Ye et al., 14 Aug 2025).

1. Purpose and Placement of ICDM

ICDM targets two pervasive forms of bias in CF models: item popularity bias (a small subset of items dominating clicks) and user conformity bias (users disproportionately clicking on popular items). These biases distort learned embeddings and, without correction, can pollute the alignment with semantic identifiers derived from multi-modal LLM (MLLM) features. ICDM operates between the User & Item Semantic Model (which outputs quantized, codebook-based semantic IDs zu,ziz_u, z_i) and the downstream Multi-view Dual-Aligned Mechanism (MDAM), which aligns collaborative and semantic signals.

The module splits each user or item ID-based representation into a “true” (signal) factor and a “bias” factor, enforces orthogonality and similarity constraints for disentanglement, and utilizes only the debiased factors in the downstream alignment. This ensures that the semantic code space benefits from collaborative cues without inheriting distortion from CF biases.

2. Formal Objectives and Loss Functions

Let U,IRdCFU, I \in \mathbb{R}^{d_{CF}} denote raw CF embeddings for users and items. Four multi-layer perceptron (MLP) encoders decompose these:

  • User-Interest (debiased): cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h
  • User-Conformity (bias): cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h
  • Item-Content (debiased): cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h
  • Item-Popularity (bias): cipop=fpop(I)Rhc_i^{pop} = f_{pop}(I) \in \mathbb{R}^h

Biased representations are constructed as cu=MLP([  cuint,cucon  ])c_u = \text{MLP}([\;c_u^{int},\,c_u^{con}\;]) and ci=MLP([  cipro,cipop  ])c_i = \text{MLP}([\;c_i^{pro},\,c_i^{pop}\;]).

Disentangling Losses:

Two constraints are enforced:

  1. Similarity Penalty

Lsim=b=1B[(1cos(cubcon,cubint))+(1cos(cibpop,cibpro))]\mathcal{L}_{sim} = \sum_{b=1}^B \left[ (1 - \cos(c_{u_b}^{con}, c_{u_b}^{int})) + (1 - \cos(c_{i_b}^{pop}, c_{i_b}^{pro})) \right]

  1. Orthogonality Penalty

Lorth=b=1B[cos2(cubcon,cubint)+cos2(cibpop,cibpro)]\mathcal{L}_{orth} = \sum_{b=1}^B \left[ \cos^2(c_{u_b}^{con}, c_{u_b}^{int}) + \cos^2(c_{i_b}^{pop}, c_{i_b}^{pro}) \right]

The combined disentangling loss:

U,IRdCFU, I \in \mathbb{R}^{d_{CF}}0

Supervised CF Objectives:

  • Biased CF Loss:

U,IRdCFU, I \in \mathbb{R}^{d_{CF}}1

  • Unbiased CF Loss:

U,IRdCFU, I \in \mathbb{R}^{d_{CF}}2

The total ICDM loss:

U,IRdCFU, I \in \mathbb{R}^{d_{CF}}3

with U,IRdCFU, I \in \mathbb{R}^{d_{CF}}4 controlling the tradeoff.

3. Multi-view Contrastive Alignment and Mutual Information Framework

ICDM supplies debiased CF signals that are contrastively aligned with Semantic IDs across six cross-views using InfoNCE objectives, which lower-bound mutual information between respective pairs:

  1. Dual User-to-Item:
    • U,IRdCFU, I \in \mathbb{R}^{d_{CF}}5: U,IRdCFU, I \in \mathbb{R}^{d_{CF}}6
    • U,IRdCFU, I \in \mathbb{R}^{d_{CF}}7: U,IRdCFU, I \in \mathbb{R}^{d_{CF}}8
  2. Dual User-to-User / Item-to-Item:
    • U,IRdCFU, I \in \mathbb{R}^{d_{CF}}9: cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h0
    • cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h1: cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h2
  3. Dual Co-occurrence:
    • User co-click cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h3: cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h4
    • Item co-click cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h5: cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h6

Total alignment loss:

cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h7

Each term encourages mutual information maximization between semantic and collaborative representations, facilitating more robust multimodal recommendation.

4. Joint Training Objective and Unified Optimization

ICDM operates within a single-stage end-to-end system. The complete loss integrates semantic quantization, ICDM, and multi-view alignment:

cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h8

with hyperparameters cuint=fint(U)Rhc_u^{int} = f_{int}(U) \in \mathbb{R}^h9 tuning the relative importance of each sub-module.

All modules—including RQ-VAE codebooks for semantic vector quantization, four MLPs for factor disentanglement, and the contrastive aligner—are optimized jointly via AdamW. The one-stage nature of DAS avoids information loss and inflexibility associated with prior two-stage approaches.

5. Implementation Workflow

A high-level pseudocode outline for a training iteration is as follows:

cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h4

6. Hyperparameter Scheme, Ablations, and Empirical Findings

Key Hyperparameters

Module Value(s) Notes
Semantic quantization cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h0, cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h1 3 codebooks, 512 codes/level, dim=32
ICDM: cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h2, cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h3 cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h4, cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h5 Disentangle term, UISM-ICDM loss balance
Alignment: cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h6, cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h7 cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h8, cucon=fcon(U)Rhc_u^{con} = f_{con}(U) \in \mathbb{R}^h9 InfoNCE temperature, co-occ. bank cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h0
Optimization AdamW, lr cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h1 batch cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h2, 16cipro=fpro(I)Rhc_i^{pro} = f_{pro}(I) \in \mathbb{R}^h3A100 GPUs

Offline Ablations

Incremental ablations highlight that inclusion of ICDM improves AUC from 0.8048 to 0.8050 and GAUC from 0.7433 to 0.7434. Removal of any dual-u2i, dual-i2i/u2u, or co-occurrence terms each reduces AUC by 0.4–0.6 ‰, indicating the necessity of multi-view contrastive mechanisms for optimal information integration.

Online Performance

Deployment results show a +3.48% overall increase in eCPM, with +8.98% in cold-start user segments. This suggests effective transfer of collaborative signals to semantic ID space and improved generalizability in data-sparse regimes.

7. Significance and Role within Industrial Recommender Systems

ICDM functions as a debiasing intermediary that ensures collaborative signals can be contrastively integrated with semantic codes free from the confounding effects of popularity and conformity. In one-stage frameworks such as DAS, this enables simultaneous optimization of semantic quantization, collaborative filtering, and contrastive alignment. The design permits maximization of mutual information, minimizes information loss, and allows fine-grained control over the transfer of collaborative cues (Ye et al., 14 Aug 2025). The result is more robust, accurate, and generalizable recommender systems, as validated in extensive industrial deployments.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to ID-based CF Debias Model (ICDM).