Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Mitigating Spurious Correlations for Self-supervised Recommendation (2212.04282v2)

Published 8 Dec 2022 in cs.IR

Abstract: Recent years have witnessed the great success of self-supervised learning (SSL) in recommendation systems. However, SSL recommender models are likely to suffer from spurious correlations, leading to poor generalization. To mitigate spurious correlations, existing work usually pursues ID-based SSL recommendation or utilizes feature engineering to identify spurious features. Nevertheless, ID-based SSL approaches sacrifice the positive impact of invariant features, while feature engineering methods require high-cost human labeling. To address the problems, we aim to automatically mitigate the effect of spurious correlations. This objective requires to 1) automatically mask spurious features without supervision, and 2) block the negative effect transmission from spurious features to other features during SSL. To handle the two challenges, we propose an invariant feature learning framework, which first divides user-item interactions into multiple environments with distribution shifts and then learns a feature mask mechanism to capture invariant features across environments. Based on the mask mechanism, we can remove the spurious features for robust predictions and block the negative effect transmission via mask-guided feature augmentation. Extensive experiments on two datasets demonstrate the effectiveness of the proposed framework in mitigating spurious correlations and improving the generalization abilities of SSL models. The code is available at https://github.com/Linxyhaha/IFL.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. \bibcommenthead
  2. Self-supervised graph learning for recommendation. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2021. p. 726–735.
  3. Self-supervised learning for large-scale item recommendations. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM; 2021. p. 4321–4330.
  4. S3-rec: Self-supervised learning for sequential recommendation with mutual information maximization. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management. ACM; 2020. p. 1893–1902.
  5. Contrastive learning for cold-start recommendation. In: Proceedings of the 29th ACM International Conference on Multimedia. ACM; 2021. p. 5382–5390.
  6. Intent Disentanglement and Feature Self-supervision for Novel Recommendation. TKDE. 2022;.
  7. Self-supervised hypergraph convolutional networks for session-based recommendation. In: Proceedings of the AAAI conference on artificial intelligence. 5. AAAI Press; 2021. p. 4503–4511.
  8. Pearl J. Causality. Cambridge University Press; 2009.
  9. Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure. In: Chen Y, Kazai G, editors. Proceedings of the AAAI Conference on Human Computation and Crowdsourcing. AAAI Press; 2018. p. 126–135.
  10. Slice Finder: Automated Data Slicing for Model Validation. In: 2019 IEEE 35th International Conference on Data Engineering. IEEE; 2019. p. 1550–1553.
  11. Adaptive factorization network: Learning adaptive-order feature interactions. In: Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press; 2020. p. 3609–3616.
  12. Autofis: Automatic feature interaction selection in factorization models for click-through rate prediction. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM; 2020. p. 2636–2645.
  13. Data Augmentation and Deep Neuro-Fuzzy Network for Student Performance Prediction with MapReduce Framework. MIR. 2021;18(6):981–992.
  14. Toward Causal Representation Learning. Proceedings of the IEEE. 2021;109:612–634.
  15. BPR: Bayesian Personalized Ranking from Implicit Feedback. In: Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence; 2009. p. 452–461.
  16. Feature selection using stochastic gates. In: Proceedings of the 37th International Conference on Machine Learning. PMLR; 2020. p. 10648–10659.
  17. Heterogeneous risk minimization. In: Proceedings of the 38th International Conference on Machine Learning. PMLR; 2021. p. 6804–6814.
  18. Koyama M, Yamaguchi S. Out-of-distribution generalization with maximal invariant predictor; 2020. Available from: https://arxiv.org/abs/2008.01883.
  19. Invariant risk minimization. arXiv; 2019. Available from: https://arxiv.org/abs/1907.02893.
  20. Neural Graph Collaborative Filtering. In: Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2019. p. 165–174.
  21. Rendle S. Factorization Machines. In: 2010 f International Conference on Data Mining. IEEE; 2010. p. 995–1000.
  22. He X, Chua TS. Neural Factorization Machines for Sparse Predictive Analytics. In: Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2017. p. 355–364.
  23. DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. In: Proceedings of the 26th International Joint Conference on Artificial Intelligence. AAAI Press; 2017. p. 1725–1731.
  24. Interest-Aware Message-Passing GCN for Recommendation. In: Proceedings of the Web Conference 2021. ACM; 2021. p. 1296–1305.
  25. A Survey on Accuracy-oriented Neural Recommendation: From Collaborative Filtering to Information-rich Recommendation. TKDE. 2022;.
  26. Application of Machine Learning for Online Reputation Systems. MIR. 2021;18(3):492–502.
  27. Privacy-Preserving Synthetic Data Generation for Recommendation Systems. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2022. p. 1379–1389.
  28. Filter-enhanced MLP is All You Need for Sequential Recommendation. In: Proceedings of the ACM Web Conference 2022. ACM; 2022. p. 2388–2399.
  29. Neural Collaborative Filtering. In: Proceedings of the 26th International Conference on World Wide Web. IW3C2; 2017. p. 173–182.
  30. Tang J, Wang K. Personalized Top-N Sequential Recommendation via Convolutional Sequence Embedding. In: Proceedings of the 17th ACM International Conference on Web Search and Data Mining. ACM; 2018. p. 565–573.
  31. BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer. In: Proceedings of the 28th ACM International Conference on Information and Knowledge Management. ACM; 2019. p. 1441–1450.
  32. LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2020. p. 639–648.
  33. Learning Intents behind Interactions with Knowledge Graph for Recommendation. In: Proceedings of the ACM Web Conference 2021. ACM; 2021. p. 878–887.
  34. Fully Hyperbolic Graph Convolution Network for Recommendation. In: Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM; 2021. p. 3483–3487.
  35. Learning deep representations by mutual information estimation and maximization. In: International Conference on Learning Representations; 2019. .
  36. Contrastive self-supervised representation learning using synthetic data. MIR. 2021;18(4):556–567.
  37. A Simple Framework for Contrastive Learning of Visual Representations. In: Proceedings of the 37th International Conference on Machine Learning. JMLR.org; 2020. p. 1597–1607.
  38. Self-Supervised Learning for Recommender Systems: A Survey. arXiv; 2022. Available from: https://arxiv.org/abs/2203.15876.
  39. Causal Intervention for Leveraging Popularity Bias in Recommendation. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2021. p. 11–20.
  40. Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. In: Proceedings of the 13th International Conference on Web Search and Data Mining. ACM; 2020. p. 501–509.
  41. Deconfounded recommendation for alleviating bias amplification. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM; 2021. p. 1717–1725.
  42. FairGAN: GANs-based Fairness-aware Learning for Recommendations with Implicit Feedback. In: Proceedings of the ACM Web Conference 2022. ACM; 2022. p. 297–307.
  43. Causal Representation Learning for Out-of-Distribution Recommendation. In: Proceedings of the ACM Web Conference 2022. ACM; 2022. p. 3562–3571.
  44. Rubin DB. Causal Inference Using Potential Outcomes: Design, Modeling, Decisions. JASA. 2005;100(469):322–331.
  45. Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random. In: Proceedings of the 36th International Conference on Machine Learning. PMLR; 2019. p. 6638–6647.
  46. Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM; 2021. p. 1288–1297.
Citations (6)

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com