Understanding and Counteracting Feature-Level Bias in Click-Through Rate Prediction (2402.03600v1)
Abstract: Common click-through rate (CTR) prediction recommender models tend to exhibit feature-level bias, which leads to unfair recommendations among item groups and inaccurate recommendations for users. While existing methods address this issue by adjusting the learning of CTR models, such as through additional optimization objectives, they fail to consider how the bias is caused within these models. To address this research gap, our study performs a top-down analysis on representative CTR models. Through blocking different components of a trained CTR model one by one, we identify the key contribution of the linear component to feature-level bias. We conduct a theoretical analysis of the learning process for the weights in the linear component, revealing how group-wise properties of training data influence them. Our experimental and statistical analyses demonstrate a strong correlation between imbalanced positive sample ratios across item groups and feature-level bias. Based on this understanding, we propose a minimally invasive yet effective strategy to counteract feature-level bias in CTR models by removing the biased linear weights from trained models. Additionally, we present a linear weight adjusting strategy that requires fewer random exposure records than relevant debiasing methods. The superiority of our proposed strategies are validated through extensive experiments on three real-world datasets.
- Controlling Popularity Bias in Learning-to-Rank Recommendation. In Proceedings of the 11th ACM Conference on Recommender Systems. ACM, 42–46.
- Pearson Correlation Coefficient. In Noise reduction in speech processing. Springer, 1–4.
- Fairness in Recommendation Ranking through Pairwise Comparisons. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. ACM, 2212–2220.
- Higher-Order Factorization Machines. Advances in Neural Information Processing Systems 29 (2016).
- Simon Caton and Christian Haas. 2020. Fairness in Machine Learning: A Survey. arXiv preprint arXiv:2010.04053 (2020).
- AutoDebias: Learning to Debias for Recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 21–30.
- Bias and Debias in Recommender System: A Survey and Future Directions. ACM Transactions on Information Systems 41, 3 (2023), 1–39.
- Improving Recommendation Fairness via Data Augmentation. In Proceedings of the ACM Web Conference 2023. ACM, 1012–1020.
- Fairly adaptive negative sampling for recommendations. In Proceedings of the ACM Web Conference 2023. ACM, 3723–3733.
- Wide & Deep Learning for Recommender Systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 7–10.
- Interpolative Distillation for Unifying Biased and Debiased Recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 40–49.
- External Evaluation of Ranking Models under Extreme Position-Bias. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. ACM, 252–261.
- A Comprehensive Survey on Trustworthy Recommender Systems. (2022).
- KuaiRand: An Unbiased Sequential Recommendation Dataset with Randomly Exposed Videos. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. ACM, 3953–3957.
- Towards Long-Term Fairness in Recommendation. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. ACM, 445–453.
- Fairness-Aware Ranking in Search & Recommendation Systems with Application to LinkedIn Talent Search. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2221–2231.
- DeepFM: A Factorization-Machine Based Neural Network for CTR Prediction. In Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence. 1725–1731.
- Equality of Opportunity in Supervised Learning. Advances in Neural Information Processing Systems 29 (2016).
- F Maxwell Harper and Joseph A Konstan. 2015. The Movielens Datasets: History and Context. Acm Transactions on Interactive Intelligent Systems (TIIS) 5, 4 (2015), 1–19.
- Xiangnan He and Tat-Seng Chua. 2017. Neural Factorization Machines for Sparse Predictive Analytics. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 355–364.
- Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web. ACM, 173–182.
- Debiasing Career Recommendations with Neural Fair Collaborative Filtering. In Proceedings of the Web Conference 2021. ACM / IW3C2, 3779–3790.
- Kalervo Järvelin and Jaana Kekäläinen. 2002. Cumulated Gain-Based Evaluation of IR Techniques. ACM Transactions on Information Systems (TOIS) 20, 4 (2002), 422–446.
- Field-Aware Factorization Machines for CTR Prediction. In Proceedings of the 10th ACM Conference on Recommender Systems. ACM, 43–50.
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Optimization. (2014).
- Yehuda Koren. 2008. Factorization Meets the Neighborhood: A Multifaceted Collaborative Filtering Model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 426–434.
- An Adversarial Approach to Improve Long-Tail Performance in Neural Collaborative Filtering. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management. ACM, 1491–1494.
- FairGAN: GANs-based Fairness-aware Learning for Recommendations with Implicit Feedback. In Proceedings of the ACM Web Conference 2022. ACM, 297–307.
- User-Oriented Fairness in Recommendation. In Proceedings of the Web Conference 2021. ACM / IW3C2, 624–632.
- Fairness in Recommendation: A Survey. (2022).
- Towards Personalized Fairness Based on Causal Notion. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1054–1063.
- Interpretable Click-Through Rate Prediction through Hierarchical Attention. In Proceedings of the 13th International Conference on Web Search and Data Mining. ACM, 313–321.
- xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 1754–1763.
- A General Knowledge Distillation Framework for Counterfactual Recommendation via Uniform Data. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 831–840.
- Rating Distribution Calibration for Selection Bias Mitigation in Recommendations. In Proceedings of the ACM Web Conference 2022. ACM, 2048–2057.
- Mitigating Popularity Bias for Users and Items with Fairness-Centric Adaptive Recommendation. ACM Transactions on Information Systems 41, 3 (2023), 1–27.
- Feedback Loop and Bias Amplification in Recommender Systems. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. ACM, 2145–2148.
- Regulating Group Exposure for Item Providers in Recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1839–1843.
- Alistair Moffat and Justin Zobel. 2008. Rank-Biased Precision for Measurement of Retrieval Effectiveness. ACM Transactions on Information Systems (TOIS) 27, 1 (2008), 1–27.
- CPFair: Personalized Consumer and Producer Fairness Re-ranking for Recommender Systems. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 770–779.
- Vilfredo Pareto et al. 1971. Manual of Political Economy. (1971).
- Fairrec: Two-sided fairness for personalized recommendations in two-sided platforms. In Proceedings of the web conference 2020. ACM, 1194–1204.
- Judea Pearl. 2009. Causality. Cambridge university press.
- Experiments on Generalizability of User-Oriented Fairness in Recommender Systems. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2755–2764.
- Steffen Rendle. 2010. Factorization Machines. In 2010 IEEE International Conference on Data Mining. IEEE Computer Society, 995–1000.
- Countering Popularity Bias by Regularizing Score Differences. In Proceedings of the 16th ACM Conference on Recommender Systems. ACM, 145–155.
- Predicting Clicks: Estimating the Click-Through Rate for New Ads. In Proceedings of the 16th International Conference on World Wide Web. ACM, 521–530.
- Donald B Rubin. 2005. Causal Inference Using Potential Outcomes: Design, Modeling, Decisions. J. Amer. Statist. Assoc. 100, 469 (2005), 322–331.
- Unbiased Recommender Learning from Missing-Not-at-Random Implicit Feedback. In Proceedings of the 13th International Conference on Web Search and Data Mining. ACM, 501–509.
- Recommendations as Treatments: Debiasing Learning and Evaluation. In Proceedings of the 33nd International Conference on Machine Learning. PMLR, 1670–1679.
- Scoping Fairness Objectives and Identifying Fairness Metrics for Recommender Systems: The Practitioners’ Perspective. In Proceedings of the ACM Web Conference 2023. ACM, 3648–3659.
- Autoint: Automatic Feature Interaction Learning via Self-Attentive Neural Networks. In Proceedings of the 28th ACM International Conference on Information & Knowledge Management. ACM, 1161–1170.
- Lequn Wang and Thorsten Joachims. 2023. Uncertainty Quantification for Fairness in Two-Stage Recommender Systems. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. ACM, 940–948.
- Deconfounded Recommendation for Alleviating Bias Amplification. In The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. ACM, 1717–1725.
- Doubly Robust Joint Learning for Recommendation on Data Missing Not at Random. In International Conference on Machine Learning. PMLR, 6638–6647.
- Model-Agnostic Counterfactual Reasoning for Eliminating Popularity Bias in Recommender System. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. ACM, 1791–1800.
- A Multi-Objective Optimization Framework for Multi-Stakeholder Fairness-Aware Recommendation. ACM Transactions on Information Systems 41, 2 (2022), 1–29.
- Joint Multisided Exposure Fairness for Recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 703–714.
- Learning Fair Representations for Recommendation: A Graph-Based Perspective. In Proceedings of the Web Conference 2021. ACM / IW3C2, 2198–2208.
- FASTER: A Dynamic Fairness-assurance Strategy for Session-based Recommender Systems. ACM Transactions on Information Systems (2023).
- TFROM: A Two-Sided Fairness-Aware Recommendation Model for Both Customers and Providers. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1013–1022.
- Attentional Factorization Machines: Learning the Weight of Feature Interactions via Attention Networks. In Proceedings of the 26th International Joint Conference on Artificial Intelligence. AAAI Press, 3119–3125.
- P-MMF: Provider Max-min Fairness Re-ranking in Recommender System. In Proceedings of the ACM Web Conference 2023. ACM, 3701–3711.
- Neutralizing Popularity Bias in Recommendation Models. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2623–2628.
- Jerrold H Zar. 1972. Significance Testing of the Spearman Rank Correlation Coefficient. J. Amer. Statist. Assoc. 67, 339 (1972), 578–580.
- Causal Intervention for Leveraging Popularity Bias in Recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 11–20.
- Towards Fair Classifiers Without Sensitive Attributes: Exploring Biases in Related Features. In Proceedings of the 15th International Conference on Web Search and Data Mining. ACM, 1433–1442.
- Disentangling User Interest and Conformity for Recommendation with Causal Embedding. In Proceedings of the Web Conference 2021. ACM / IW3C2, 2980–2991.
- Deep Interest Network for Click-Through Rate Prediction. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1059–1068.
- BARS: Towards Open Benchmarking for Recommender Systems. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 2912–2923.
- Open Benchmarking for Click-Through Rate Prediction. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. ACM, 2759–2769.
- Popularity-Opportunity Bias in Collaborative Filtering. In Proceedings of the 14th ACM International Conference on Web Search and Data Mining. ACM, 85–93.
- Measuring and Mitigating Item Under-Recommendation Bias in Personalized Ranking Systems. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 449–458.