Enhancing Recommender Systems: A Strategy to Mitigate False Negative Impact (2211.13912v2)
Abstract: In implicit collaborative filtering (CF) task of recommender systems, recent works mainly focus on model structure design with promising techniques like graph neural networks (GNNs). Effective and efficient negative sampling methods that suit these models, however, remain underdeveloped. One challenge is that existing hard negative samplers tend to suffer from severer over-fitting in model training. In this work, we first study the reason behind the over-fitting, and illustrate it with the incorrect selection of false negative instances with the support of experiments. In addition, we empirically observe a counter-intuitive phenomenon, that is, polluting hard negative samples' embeddings with a quite large proportional of positive samples' embeddings will lead to remarkable performance gains for prediction accuracy. On top of this finding, we present a novel negative sampling strategy, i.e., positive-dominated negative synthesizing (PDNS). Moreover, we provide theoretical analysis and derive a simple equivalent algorithm of PDNS, where only a soft factor is added in the loss function. Comprehensive experiments on three real-world datasets demonstrate the superiority of our proposed method in terms of both effectiveness and robustness.
- Hard Negatives or False Negatives: Correcting Pooling Bias in Training Neural Ranking Models. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 118–127.
- Learning Recommenders for Implicit Feedback with Importance Resampling. In Proceedings of the ACM Web Conference 2022. 1997–2005.
- Top-k off-policy correction for a REINFORCE recommender system. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining. 456–464.
- Reinforced Negative Sampling for Recommendation with Exposure Data. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10-16, 2019. 2230–2236.
- Simplify and Robustify Negative Sampling for Implicit Collaborative Filtering. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual.
- Gintare Karolina Dziugaite and Daniel M. Roy. 2015. Neural Network Matrix Factorization. CoRR (2015).
- LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval, SIGIR 2020, Virtual Event, China, July 25-30, 2020. 639–648.
- Collaborative Filtering for Implicit Feedback Datasets. 2008 Eighth IEEE International Conference on Data Mining (2008), 263–272.
- Embedding-based Retrieval in Facebook Search. In KDD ’20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Virtual Event, CA, USA, August 23-27, 2020. 2553–2561.
- MixGCF: An Improved Training Method for Graph Neural Network-based Recommender Systems. In KDD ’21: The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Virtual Event, Singapore, August 14-18, 2021, Feida Zhu, Beng Chin Ooi, and Chunyan Miao (Eds.). ACM, 665–674. https://doi.org/10.1145/3447548.3467408
- Hard Negative Mixing for Contrastive Learning. In Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual.
- Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
- Yehuda Koren. 2008. Factorization meets the neighborhood: a multifaceted collaborative filtering model. In Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, August 24-27, 2008. 426–434.
- Matrix Factorization Techniques for Recommender Systems. Computer 42, 8 (2009), 30–37.
- Daniel Lee and H Sebastian Seung. 2000. Algorithms for non-negative matrix factorization. Advances in neural information processing systems 13 (2000).
- Collaborative filtering with ordinal scale-based implicit ratings for mobile music recommendations. Information Sciences 180, 11 (2010), 2142–2155.
- Personalized ranking with importance sampling. In Proceedings of The Web Conference 2020. 1093–1103.
- Modeling user exposure in recommendation. In Proceedings of the 25th international conference on World Wide Web. 951–961.
- Variational autoencoders for collaborative filtering. In Proceedings of the 2018 world wide web conference. 689–698.
- Andriy Mnih and Russ R Salakhutdinov. 2007. Probabilistic matrix factorization. Advances in neural information processing systems 20 (2007).
- An emotional recommender system for music. IEEE Intelligent Systems 36, 5 (2020), 57–68.
- Maryam Khanian Najafabadi and Mohd Naz’ri Mahrin. 2016. A systematic literature review on the state of research and practice of collaborative filtering technique and implicit feedback. Artificial Intelligence Review 45 (2016), 167–201.
- Learning social representations with deep autoencoder for recommender system. World Wide Web 23 (2020), 2259–2279.
- Dae Hoon Park and Yi Chang. 2019. Adversarial Sampling and Training for Semi-Supervised Information Retrieval. In The World Wide Web Conference, WWW 2019, San Francisco, CA, USA, May 13-17, 2019. 1443–1453.
- Steffen Rendle and Christoph Freudenthaler. 2014. Improving pairwise learning for item recommendation from implicit feedback. In Seventh ACM International Conference on Web Search and Data Mining, WSDM 2014, New York, NY, USA, February 24-28, 2014. 273–282.
- BPR: Bayesian Personalized Ranking from Implicit Feedback. In UAI 2009, Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, Montreal, QC, Canada, June 18-21, 2009. 452–461.
- Ruslan Salakhutdinov and Andriy Mnih. 2007. Probabilistic Matrix Factorization. In Advances in Neural Information Processing Systems 20, Proceedings of the Twenty-First Annual Conference on Neural Information Processing Systems, Vancouver, British Columbia, Canada, December 3-6, 2007. 1257–1264.
- On the Theories Behind Hard Negative Sampling for Recommendation. In Proceedings of the ACM Web Conference 2023. 812–822.
- Yueming Sun and Yi Zhang. 2018. Conversational recommender system. In The 41st international acm sigir conference on research & development in information retrieval. 235–244.
- Adversarial Training Towards Robust Multimedia Recommender System. IEEE Transactions on Knowledge and Data Engineering 32 (2018), 855–867.
- Graph Convolutional Matrix Completion. CoRR abs/1706.02263 (2017).
- IRGAN: A Minimax Game for Unifying Generative and Discriminative Information Retrieval Models. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval, Shinjuku, Tokyo, Japan, August 7-11, 2017. 515–524.
- Incorporating gan for negative sampling in knowledge representation learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 32.
- KGAT: Knowledge Graph Attention Network for Recommendation. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD 2019, Anchorage, AK, USA, August 4-8, 2019. 950–958.
- Neural Graph Collaborative Filtering. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2019, Paris, France, July 21-25, 2019. 165–174.
- mixup: Beyond Empirical Risk Minimization. In 6th International Conference on Learning Representations, ICLR 2018, Vancouver, BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings.
- Deep Learning Based Recommender System. ACM Computing Surveys (CSUR) 52 (2017), 1 – 38.
- Optimizing top-n collaborative filtering via dynamic negative item sampling. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. 785–788.
- Neural autoregressive collaborative filtering for implicit feedback. In Proceedings of the 1st workshop on deep learning for recommender systems. 2–6.
- A Gain-Tuning Dynamic Negative Sampler for Recommendation. In Proceedings of the ACM Web Conference 2022. 277–285.
- Kexin Shi (9 papers)
- Yun Zhang (103 papers)
- Bingyi Jing (15 papers)
- Wenjia Wang (68 papers)