2000 character limit reached
Confidence Calibration for Recommender Systems and Its Applications (2402.16325v1)
Published 26 Feb 2024 in cs.IR
Abstract: Despite the importance of having a measure of confidence in recommendation results, it has been surprisingly overlooked in the literature compared to the accuracy of the recommendation. In this dissertation, I propose a model calibration framework for recommender systems for estimating accurate confidence in recommendation results based on the learned ranking scores. Moreover, I subsequently introduce two real-world applications of confidence on recommendations: (1) Training a small student model by treating the confidence of a big teacher model as additional learning guidance, (2) Adjusting the number of presented items based on the expected user utility estimated with calibrated probability.
- Obtaining calibrated probabilities with personalized ranking models. In AAAI, 2022.
- Bpr: Bayesian personalized ranking from implicit feedback. In UAI, 2009.
- Predicting accurate probabilities with a ranking loss. In ICML, 2012.
- Where to stop reading a ranked list? threshold optimization using truncated score distributions. In SIGIR, 2009.
- On calibration of modern neural networks. In ICML, 2017.
- Beyond temperature scaling: Obtaining well-calibrated multi-class probabilities with dirichlet calibration. In NeurIPS, 2019.
- Intra order-preserving functions for calibration of multi-class neural networks. In NeurIPS, 2020.
- The isotonic regression problem and its dual. Journal of the American Statistical Association, 67(337):140–147, 1972.
- Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers, 10(3):61–74, 1999.
- Recommendations as treatments: Debiasing learning and evaluation. In ICML, 2016.
- Yuta Saito. Unbiased pairwise learning from implicit feedback. In NeurIPS 2019 Workshop on Causal Machine Learning, 2019.
- Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association, 89(427):846–866, 1994.
- Cofirank-maximum margin matrix factorization for collaborative ranking. In NeurIPS, 2007.
- Beta calibration: a well-founded and easily implemented improvement on logistic calibration for binary classifiers. In AISTATS, 2017.
- Obtaining well calibrated probabilities using bayesian binning. In AAAI, 2015.
- Obtaining calibrated probability estimates from decision trees and naive bayesian classifiers. In ICML, 2001.
- Calibrating deep neural networks using focal loss. In NeurIPS, 2020.
- Christoph Baumgarten. A probabilistic solution to the selection and fusion problem in distributed information retrieval. In SIGIR, 1999.
- John A Swets. Effectiveness of information retrieval methods. American Documentation, 20(1):72–89, 1969.
- Modeling score distributions for combining the outputs of search engines. In SIGIR, 2001.
- Score distribution models: assumptions, intuition, and robustness to score manipulation. In SIGIR, 2010.
- Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
- Doubly robust joint learning for recommendation on data missing not at random. In ICML, 2019.
- Unbiased learning-to-rank with biased feedback. In WSDM, 2017.
- Paul R Rosenbaum. Overt bias in observational studies. In Observational studies, pages 71–104. Springer, 2002.
- Cab: Continuous adaptive blending for policy evaluation and learning. In ICML, 2019.
- Neural collaborative filtering. In WWW, 2017.
- Collaborative metric learning. In WWW, 2017.
- Lightgcn: Simplifying and powering graph convolution network for recommendation. In SIGIR, 2020.
- Being accurate is not enough: how accuracy metrics have hurt recommender systems. In CHI’06 extended abstracts on Human factors in computing systems, pages 1097–1101, 2006.
- Deep rating elicitation for new users in collaborative filtering. In WWW, 2020.
- Bidirectional distillation for top-k recommender system. In WWW, 2021.
- Jiaxi Tang and Ke Wang. Ranking distillation: Learning compact ranking models with high performance for recommender system. In KDD, 2018.
- Collaborative distillation for top-n recommendation. In ICDM, 2019.
- De-rrd: A knowledge distillation framework for recommender system. In CIKM, 2020.
- Collaborative topic regression with social regularization for tag recommendation. In IJCAI, 2013.
- Modeling user activity preference by leveraging user spatial temporal characteristics in lbsns. IEEE Transactions on Systems, Man, and Cybernetics: Systems, 45(1):129–142, 2014.
- Deep residual learning for image recognition. In CVPR, 2016.
- Learning multiple layers of features from tiny images. 2009.
- Shallow-deep networks: Understanding and mitigating network overthinking. In ICML, 2019.
- Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531, 2015.
- Fast matrix factorization for online recommendation with implicit feedback. In SIGIR, 2016.
- On sampled metrics for item recommendation. In KDD, 2020.
- A relaxed ranking-based factor model for recommender system from implicit feedback. In IJCAI, 2016.
- Cumulated gain-based evaluation of ir techniques. ACM Transactions on Information Systems (TOIS), pages 422–446, 2002.
- Collaborative denoising auto-encoders for top-n recommender systems. In WSDM, 2016.
- Extracting and composing robust features with denoising autoencoders. In ICML, 2008.
- Matrix factorization techniques for recommender systems. Computer, 42(8):30–37, 2009.
- Pytorch: An imperative style, high-performance deep learning library. In NeurIPS, 2019.
- Adam: A method for stochastic optimization. In ICLR, 2015.
- Born again neural networks. In ICML, 2018.
- Be your own teacher: Improve the performance of convolutional neural networks via self distillation. In CVPR, 2019.
- Discrete content-aware matrix factorization. In KDD, 2017.
- Discrete collaborative filtering. In SIGIR, 2016.
- Discrete personalized ranking for fast collaborative filtering from implicit feedback. In AAAI, 2017.
- Ke Zhou and Hongyuan Zha. Learning binary codes for collaborative filtering. In KDD, 2012.
- Candidate generation with binary codes for large-scale top-n recommendation. In CIKM, 2019.
- Speeding up the xbox recommender system using a euclidean transformation for inner-product spaces. In RecSys, 2014.
- Lemp: Fast retrieval of large entries in a matrix product. In SIGMOD, 2015.
- Fexipro: fast and exact inner product retrieval in recommender systems. In SIGMOD, 2017.
- Deep mutual learning. In CVPR, 2018.
- Collaborative learning for deep neural networks. In NeurIPS, 2018.
- Dual learning for machine translation. In NeurIPS, 2016.
- Performance of recommender algorithms on top-n recommendation tasks. In RecSys, 2010.
- Self-supervised graph learning for recommendation. In SIGIR, 2021.
- Learning binarized graph representations with multi-faceted quantization reinforcement for top-k recommendation. In KDD, 2022.
- Self-supervised hypergraph transformer for recommender systems. In KDD, 2022.
- Stephen E Robertson. The probability ranking principle in ir. Journal of documentation, 33(4):209–304, 1977.
- To swing or not to swing: learning when (not) to advertise. In CIKM, 2008.
- Measuring the business value of recommender systems. ACM Transactions on Management Information Systems (TMIS), 10(4):1–23, 2019.
- Cross-domain collaboration recommendation. In KDD, 2012.
- On the effectiveness of video prefetching relying on recommender systems for mobile devices. In 2016 13th IEEE Annual Consumer Communications & Networking Conference (CCNC), pages 429–434. IEEE, 2016.
- Fairness of exposure in rankings. In KDD, 2018.
- Fair ranking as fair division: Impact-based individual fairness in ranking. In KDD, 2022.
- Variational autoencoders for collaborative filtering. In WWW, 2018.
- Choppy: Cut transformer for ranked list truncation. In SIGIR, 2020.
- Overview of the trec 2007 legal track. In TREC, 2007.
- Incorporating retrieval information into the truncation of ranking lists for better legal search. In SIGIR, 2022.
- Learning to truncate ranked lists for information retrieval. In AAAI, 2021.
- Mtcut: A multi-task framework for ranked list truncation. In WSDM, 2022.
- An assumption-free approach to the dynamic truncation of ranked lists. In SIGIR, 2019.
- Long short-term memory. Neural computation, 9(8):1735–1780, 1997.
- Attention is all you need. In NeurIPS, 2017.
- Letor: A benchmark collection for research on learning to rank for information retrieval. Information Retrieval, 13(4):346–374, 2010.
- Ms marco: A human generated machine reading comprehension dataset. In CoCo@NeurIPS, 2016.
- Juan Ramos et al. Using tf-idf to determine word relevance in document queries. In Proceedings of the first instructional conference on machine learning, volume 242, pages 29–48. Citeseer, 2003.
- Distributed representations of sentences and documents. In ICML, 2014.
- Aspect-aware latent factor model: Rating prediction with ratings and reviews. In WWW, 2018.
- Collaborative filtering for implicit feedback datasets. In ICDM, 2008.
- Unbiased recommender learning from missing-not-at-random implicit feedback. In WSDM, 2020.
- Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748, 2018.
- Incorporating bias-aware margins into contrastive loss for collaborative filtering. In NeurIPS, 2022.
- An experimental comparison of click position-bias models. In WWW, 2008.
- David MW Powers. Evaluation: from precision, recall and f-measure to roc, informedness, markedness and correlation. Journal of Machine Learning Technologies, 2:37–63, 2011.
- Ranking interruptus: When truncated rankings are better and how to measure that. In SIGIR, 2022.
- Cross-domain recommendation: An embedding and mapping approach. In IJCAI, 2017.
- A dynamic model of sponsored search advertising. Marketing Science, 30(3):447–468, 2011.
- The knapsack problem: a survey. Naval Research Logistics Quarterly, 22(1):127–144, 1975.
- Patrick Billingsley. Probability and measure. John Wiley & Sons, 2008.
- Lucien Le Cam. An approximation theorem for the poisson binomial distribution. Pacific Journal of Mathematics, 10(4):1181–1197, 1960.
- Revisiting the calibration of modern neural networks. In NeurIPS, 2021.
- Local temperature scaling for probability calibration. In ICCV, 2021.
- Calibration of pre-trained transformers. In EMNLP, 2020.
- Justifying recommendations using distantly-labeled reviews and fine-grained aspects. In EMNLP-IJCNLP, 2019.
- Collaborative deep learning for recommender systems. In KDD, 2015.
- Collaborative variational autoencoder for recommender systems. In KDD, 2017.
- Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In KDD, 2018.
- Semi-supervised classification with graph convolutional networks. In ICLR, 2017.
- Trends, problems and solutions of recommender system. In International conference on computing, communication & automation, pages 955–958. IEEE, 2015.