Causal Inference in Recommender Systems: A Survey and Future Directions (2208.12397v2)
Abstract: Recommender systems have become crucial in information filtering nowadays. Existing recommender systems extract user preferences based on the correlation in data, such as behavioral correlation in collaborative filtering, feature-feature, or feature-behavior correlation in click-through rate prediction. However, unfortunately, the real world is driven by causality, not just correlation, and correlation does not imply causation. For instance, recommender systems might recommend a battery charger to a user after buying a phone, where the latter can serve as the cause of the former; such a causal relation cannot be reversed. Recently, to address this, researchers in recommender systems have begun utilizing causal inference to extract causality, thereby enhancing the recommender system. In this survey, we offer a comprehensive review of the literature on causal inference-based recommendation. Initially, we introduce the fundamental concepts of both recommender system and causal inference as the foundation for subsequent content. We then highlight the typical issues faced by non-causality recommender system. Following that, we thoroughly review the existing work on causal inference-based recommender systems, based on a taxonomy of three-aspect challenges that causal inference can address. Finally, we discuss the open problems in this critical research area and suggest important potential future works.
- Junzhe Zhang and Elias Bareinboim . 2018. Fairness in Decision-Making – The Causal Explanation Formula. In AAAI 2018.
- Effective evaluation using logged bandit feedback from multiple loggers. In Proceedings of the 23rd ACM SIGKDD international conference on knowledge discovery and data mining. 687–696.
- I like it… i like it not: Evaluating user ratings noise in recommender systems. In International Conference on User Modeling, Adaptation, and Personalization. Springer, 247–258.
- Bandits with unobserved confounders: A causal approach. Advances in Neural Information Processing Systems 28 (2015), 1342–1350.
- Counterfactual Reasoning and Learning Systems: The Example of Computational Advertising. Journal of Machine Learning Research 14, 11 (2013).
- Does Offline Political Segregation Affect the Filter Bubble? An Empirical Analysis of Information Diversity for Dutch and Turkish Twitter Users. Computers in Human Behavior 41, C (2014), 405–415.
- Robin Burke. 2017. Multisided fairness for recommendation. arXiv preprint arXiv:1707.00093 (2017).
- Sequential Recommendation with Graph Neural Networks. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 378–387.
- AutoDebias: Learning to Debias for Recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 21–30.
- Bias and debias in recommender system: A survey and future directions. ACM Transactions on Information Systems 41, 3 (2023), 1–39.
- Try This Instead: Personalized and Interpretable Substitute Recommendation. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020).
- Intent Contrastive Learning for Sequential Recommendation. In Proceedings of the ACM Web Conference 2022. 2172–2182.
- David Maxwell Chickering. 2002. Optimal structure identification with greedy search. Journal of machine learning research 3, Nov (2002), 507–554.
- Is seeing believing? How recommender system interfaces affect users’ opinions. In Proceedings of the SIGCHI conference on Human factors in computing systems. 585–592.
- Deep neural networks for youtube recommendations. In RecSys. 191–198.
- Addressing unmeasured confounder for recommendation with sensitivity analysis. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 305–315.
- Graph neural networks for social recommendation. In The world wide web conference. 417–426.
- Graph neural networks for social recommendation. In The World Wide Web Conference. 417–426.
- Counterfactual vision-and-language navigation via adversarial path sampler. In European Conference on Computer Vision. Springer, 71–86.
- Cross-domain recommendation without sharing user-relevant data. In The world wide web conference. 491–502.
- Neural multi-task recommendation from multi-behavior data. In 2019 IEEE 35th international conference on data engineering (ICDE). IEEE, 1554–1557.
- CIRS: Bursting filter bubbles by counterfactual interactive recommender system. ACM Transactions on Information Systems 42, 1 (2023), 1–27.
- Offline Evaluation to Make Decisions About PlaylistRecommendation Algorithms. Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining (2019).
- DeepFM: a factorization-machine based neural network for CTR prediction. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 1725–1731.
- A survey on knowledge graph-based recommender systems. IEEE Transactions on Knowledge and Data Engineering (2020).
- A survey of learning causality with data: Problems and methods. ACM Computing Surveys (CSUR) 53, 4 (2020), 1–37.
- Enhanced Doubly Robust Learning for Debiasing Post-Click Conversion Rate Estimation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 275–284.
- Equality of Opportunity in Supervised Learning. In NIPS.
- Lightgcn: Simplifying and powering graph convolution network for recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 639–648.
- LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (2020).
- Neural collaborative filtering. In WWW. 173–182.
- Addressing confounding feature issue for causal recommendation. ACM Transactions on Information Systems 41, 3 (2023), 1–23.
- Causpref: Causal preference learning for out-of-distribution recommendation. In Proceedings of the ACM Web Conference 2022. 410–421.
- Learning Bayesian networks: The combination of knowledge and statistical data. Machine learning 20, 3 (1995), 197–243.
- Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71, 4 (2003), 1161–1189.
- Jonathan Ho and Stefano Ermon. 2016. Generative adversarial imitation learning. Advances in neural information processing systems 29 (2016).
- Conet: Collaborative cross networks for cross-domain recommendation. In Proceedings of the 27th ACM international conference on information and knowledge management. 667–676.
- Collaborative filtering for implicit feedback datasets. In ICDM. 263–272.
- Recsim: A configurable simulation platform for recommender systems. arXiv preprint arXiv:1909.04847 (2019).
- Dominik Janzing. 2019. Causal Regularization. Advances in Neural Information Processing Systems 32 (2019), 12704–12714.
- Multi-behavior recommendation with graph convolutional networks. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 659–668.
- Learning representations for counterfactual inference. In International conference on machine learning. PMLR, 3020–3029.
- Comparisons instead of ratings: Towards more stable preferences. In 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, Vol. 1. IEEE, 451–456.
- Junzhe Zhang and Elias Bareinboim. 2018. Equality of Opportunity in Classification: A Causal Approach. In NeurIPS.
- Fairness in Algorithmic Decision Making: An Excursion Through the Lens of Causality. The World Wide Web Conference (2019).
- Doubly robust off-policy evaluation for ranking policies under the cascade behavior model. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 487–497.
- Matrix factorization techniques for recommender systems. Computer 42, 8 (2009).
- Counterfactual Fairness. In NIPS.
- Causal bandits: learning good interventions via causal inference. In Proceedings of the 30th International Conference on Neural Information Processing Systems. 1189–1197.
- Offline reinforcement learning: Tutorial, review, and perspectives on open problems. arXiv preprint arXiv:2005.01643 (2020).
- Collaborative filtering with noisy ratings. In Proceedings of the 2019 SIAM International Conference on Data Mining. SIAM, 747–755.
- Be causal: De-biasing social network confounding in recommendation. ACM Transactions on Knowledge Discovery from Data 17, 1 (2023), 1–23.
- Offline evaluation of ranking policies with click models. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1685–1694.
- Towards Personalized Fairness Based on Causal Notion. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1054–1063.
- Mitigating sentiment bias for recommender systems. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 31–40.
- Roderick JA Little and Donald B Rubin. 2019. Statistical analysis with missing data. Vol. 793. John Wiley & Sons.
- Learning causal semantic representation for out-of-distribution prediction. arXiv preprint arXiv:2011.01681 (2020).
- Practical counterfactual policy learning for Top-K recommendations. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1141–1151.
- Causal effect inference with deep latent-variable models. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 6449–6459.
- Between clicks and satisfaction: Study on multi-phase user preferences and satisfaction for online news reading. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 435–444.
- A metric for Filter Bubble measurement in recommender algorithms considering the news domain. Applied Soft Computing 97, Part A (2020).
- Counterfactual evaluation of slate recommendations with sequential reward interactions. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 1779–1788.
- Causal interpretability for machine learning-problems, methods and evaluation. ACM SIGKDD Explorations Newsletter 22, 1 (2020), 18–33.
- Alleviating Spurious Correlations in Knowledge-aware Recommendations through Counterfactual Generator. In SIGIR.
- Detecting Noise in Recommender System Databases. In Proceedings of the 11th International Conference on Intelligent User Interfaces (IUI ’06). Association for Computing Machinery, New York, NY, USA, 109–115.
- Eli Pariser. 2011a. The filter bubble: What the Internet is hiding from you. penguin UK.
- E. Pariser. 2011b. The Filter Bubble: What the Internet Is Hiding from You. The Filter Bubble: What the Internet Is Hiding from You.
- Homophily, echo chambers, & selective exposure in social networks: What should civic educators do? Journal of Social Studies Research (2017).
- Judea Pearl. 1995. Causal diagrams for empirical research. Biometrika 82, 4 (1995), 669–688.
- Judea Pearl. 2009a. Causal inference in statistics: An overview. Statistics surveys 3 (2009), 96–146.
- Judea Pearl. 2009b. Causality. Cambridge university press.
- Judea Pearl and Dana Mackenzie. 2018. The Book of Why: The New Science of Cause and Effect (1st ed.). Basic Books, Inc.
- A million variables and more: the Fast Greedy Equivalence Search algorithm for learning high-dimensional graphical causal models, with an application to functional magnetic resonance images. International journal of data science and analytics 3, 2 (2017), 121–129.
- Steffen Rendle. 2010. Factorization machines. In 2010 IEEE International conference on data mining. IEEE, 995–1000.
- BPR: Bayesian personalized ranking from implicit feedback. In UAI. 452–461.
- Donald B Rubin. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66, 5 (1974), 688.
- Off-policy bandits with deficient support. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 965–975.
- Yuta Saito. 2020a. Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 309–318.
- Yuta Saito. 2020b. Doubly robust estimator for ranking metrics with post-click conversions. In Fourteenth ACM Conference on Recommender Systems. 92–100.
- Yuta Saito and Thorsten Joachims. 2021. Counterfactual learning and evaluation for recommender systems: Foundations, implementations, and recent advances. In Proceedings of the 15th ACM Conference on Recommender Systems. 828–830.
- Unbiased recommender learning from missing-not-at-random implicit feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining. 501–509.
- Masahiro Sato. 2021. Online Evaluation Methods for the Causal Effect of Recommendations. In Fifteenth ACM Conference on Recommender Systems. 96–101.
- Uplift-based evaluation and optimization of recommenders. In Proceedings of the 13th ACM Conference on Recommender Systems. 296–304.
- Unbiased learning for the causal effect of recommendation. In Fourteenth ACM Conference on Recommender Systems. 378–387.
- Recommendations as treatments: Debiasing learning and evaluation. In international conference on machine learning. PMLR, 1670–1679.
- Recommendations as Treatments: Debiasing Learning and Evaluation. ArXiv abs/1602.05352 (2016).
- Gideon Schwarz. 1978. Estimating the dimension of a model. The annals of statistics (1978), 461–464.
- Estimating individual treatment effect: generalization bounds and algorithms. In International Conference on Machine Learning. PMLR, 3076–3085.
- Virtual-taobao: Virtualizing real-world online retail environment for reinforcement learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 4902–4909.
- A Model-Agnostic Causal Learning Framework for Recommendation using Search Data. In Proceedings of the ACM Web Conference 2022. 224–233.
- Autoint: Automatic feature interaction learning via self-attentive neural networks. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1161–1170.
- Peter Spirtes. 2001. An anytime algorithm for causal inference. In International Workshop on Artificial Intelligence and Statistics. PMLR, 278–285.
- Causation, prediction, and search. MIT press.
- Harald Steck. 2013. Evaluation of recommendations: rating-prediction and ranking. In Proceedings of the 7th ACM conference on Recommender systems. 213–220.
- Off-policy evaluation for slate recommendation. Advances in Neural Information Processing Systems 30 (2017).
- Counterfactual Explainable Recommendation. Proceedings of the 30th ACM International Conference on Information & Knowledge Management (2021).
- Counterfactual explainable recommendation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 1784–1793.
- Philip Thomas and Emma Brunskill. 2016. Data-efficient off-policy policy evaluation for reinforcement learning. In International Conference on Machine Learning. PMLR, 2139–2148.
- Recommending the Most Effective Intervention to Improve Employment for Job Seekers with Disability. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 3616–3626.
- Explainable Recommendation via Multi-Task Learning in Opinionated Text Data. The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (2018).
- Denoising implicit feedback for recommendation. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 373–381.
- Deconfounded Recommendation for Alleviating Bias Amplification. In KDD ’21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Virtual Event, Singapore, August 14-18, 2021. 1717–1725.
- Clicks can be Cheating: Counterfactual Recommendation for Mitigating Clickbait Issue. In SIGIR ’21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, Canada, July 11-15, 2021. 1288–1297.
- User-controllable Recommendation Against Filter Bubbles. In SIGIR.
- Causal Representation Learning for Out-of-Distribution Recommendation. In Proceedings of the ACM Web Conference 2022.
- Neural graph collaborative filtering. In Proceedings of the 42nd international ACM SIGIR conference on Research and development in Information Retrieval. 165–174.
- Doubly robust joint learning for recommendation on data missing not at random. In International Conference on Machine Learning. PMLR, 6638–6647.
- Combating Selection Biases in Recommender Systems with a Few Unbiased Ratings. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. 427–435.
- Unbiased Sequential Recommendation with Latent Confounders. In Proceedings of the ACM Web Conference 2022. 2195–2204.
- Counterfactual data-augmented sequential recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 347–356.
- Model-agnostic counterfactual reasoning for eliminating popularity bias in recommender system. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 1791–1800.
- Leveraging post-click feedback for content recommendations. In Proceedings of the 13th ACM Conference on Recommender Systems. 278–286.
- A Survey on Accuracy-oriented Neural Recommendation: From Collaborative Filtering to Information-rich Recommendation. IEEE Transactions on Knowledge and Data Engineering (2022), 1–1.
- A neural influence diffusion model for social recommendation. In Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval. 235–244.
- On the Opportunity of Causal Learning in Recommendation Systems: Foundation, Estimation, Prediction and Challenges. IJCAI.
- Graph meta network for multi-behavior recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 757–766.
- Reinforcement knowledge graph reasoning for explainable recommendation. In Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval. 285–294.
- Attentional factorization machines: learning the weight of feature interactions via attention networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. 3119–3125.
- Teng Xiao and Suhang Wang. 2022. Towards Unbiased and Robust Causal Ranking for Recommender Systems. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 1158–1167.
- CausCF: Causal Collaborative Filtering for Recommendation Effect Estimation. 4253–4263.
- Counterfactual Review-based Recommendation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2231–2240.
- Adversarial counterfactual learning and evaluation for recommender system. arXiv preprint arXiv:2012.02295 (2020).
- Dynamic causal collaborative filtering. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2301–2310.
- Deconfounded Causal Collaborative Filtering. arXiv preprint arXiv:2110.07122 (2021).
- Causal bandits with propagating inference. In International Conference on Machine Learning. PMLR, 5512–5520.
- Top-N Recommendation with Counterfactual User Preference Simulation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 2342–2351.
- A survey on causal inference. arXiv preprint arXiv:2002.02770 (2020).
- A survey on causal inference. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 5 (2021), 1–46.
- A Feedback Shift Correction in Predicting Conversion Rates under Delayed Feedback. In Proceedings of The Web Conference 2020. 2740–2746.
- Graph Convolutional Neural Networks for Web-Scale Recommender Systems. arXiv preprint arXiv:1806.01973 (2018).
- Dynamic graph neural networks for sequential recommendation. IEEE Transactions on Knowledge and Data Engineering (2022).
- Causerec: Counterfactual user sequence synthesis for sequential recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 367–377.
- Deep learning based recommender system: A survey and new perspectives. ACM Computing Surveys (CSUR) 52, 1 (2019), 1–38.
- Large-scale causal approaches to debiasing post-click conversion rate estimation with multi-task learning. In Proceedings of The Web Conference 2020. 2775–2781.
- Multiplex Graph Neural Networks for Multi-behavior Recommendation. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 2313–2316.
- Counterfactual Reward Modification for Streaming Recommendation with Delayed Feedback. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 41–50.
- Causal Intervention for Leveraging Popularity Bias in Recommendation. In SIGIR ’21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Virtual Event, Canada, July 11-15, 2021. 11–20.
- Explicit factor models for explainable recommendation based on phrase-level sentiment analysis. Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval (2014).
- Disentangling Long and Short-Term Interests for Recommendation. In Proceedings of the ACM Web Conference 2022. 2256–2267.
- Disentangling User Interest and Conformity for Recommendation with Causal Embedding. In WWW. ACM, 2980–2991.
- Causal discovery with reinforcement learning. arXiv preprint arXiv:1906.04477 (2019).
- Graph-based Embedding Smoothing for Sequential Recommendation. IEEE Transactions on Knowledge and Data Engineering (2021).
- Mitigating Hidden Confounding Effects for Causal Recommendation. arXiv preprint arXiv:2205.07499 (2022).
- Counterfactual data augmentation for mitigating gender stereotypes in languages with rich morphology. arXiv preprint arXiv:1906.04571 (2019).
- Chen Gao (136 papers)
- Yu Zheng (196 papers)
- Wenjie Wang (150 papers)
- Fuli Feng (143 papers)
- Xiangnan He (200 papers)
- Yong Li (628 papers)