Understanding and Scaling Collaborative Filtering Optimization from the Perspective of Matrix Rank
Abstract: Collaborative Filtering (CF) methods dominate real-world recommender systems given their ability to learn high-quality, sparse ID-embedding tables that effectively capture user preferences. These tables scale linearly with the number of users and items, and are trained to ensure high similarity between embeddings of interacted user-item pairs, while maintaining low similarity for non-interacted pairs. Despite their high performance, encouraging dispersion for non-interacted pairs necessitates expensive regularization (e.g., negative sampling), hurting runtime and scalability. Existing research tends to address these challenges by simplifying the learning process, either by reducing model complexity or sampling data, trading performance for runtime. In this work, we move beyond model-level modifications and study the properties of the embedding tables under different learning strategies. Through theoretical analysis, we find that the singular values of the embedding tables are intrinsically linked to different CF loss functions. These findings are empirically validated on real-world datasets, demonstrating the practical benefits of higher stable rank, a continuous version of matrix rank which encodes the distribution of singular values. Based on these insights, we propose an efficient warm-start strategy that regularizes the stable rank of the user and item embeddings. We show that stable rank regularization during early training phases can promote higher-quality embeddings, resulting in training speed improvements of up to 66%. Additionally, stable rank regularization can act as a proxy for negative sampling, allowing for performance gains of up to 21% over loss functions with small negative sampling ratios. Overall, our analysis unifies current CF methods under a new perspective, their optimization of stable rank, motivating a flexible regularization method.
- Matrix Factorization Model in Collaborative Filtering Algorithms: A Survey. Procedia Computer Science 49 (2015), 136–146. https://doi.org/10.1016/j.procs.2015.04.237 Proceedings of 4th International Conference on Advances in Computing, Communication and Control (ICAC3’15).
- Towards Mitigating Dimensional Collapse of Representations in Collaborative Filtering. In Proceedings of the 17th ACM International Conference on Web Search and Data Mining (Merida, Mexico) (WSDM ’24). Association for Computing Machinery, New York, NY, USA, 106–115.
- A Survey of Collaborative Filtering-Based Recommender Systems: From Traditional Methods to Hybrid Methods Based on Social Networks. IEEE Access 6 (2018), 64301–64320. https://doi.org/10.1109/ACCESS.2018.2877208
- Wide and Deep Learning for Recommender Systems. arXiv:1606.07792Â [cs.LG] https://arxiv.org/abs/1606.07792
- Friendship and mobility: user movement in location-based social networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Diego, California, USA) (KDD ’11). Association for Computing Machinery, New York, NY, USA, 1082–1090. https://doi.org/10.1145/2020408.2020579
- Deep Neural Networks for YouTube Recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems (Boston, Massachusetts, USA) (RecSys ’16). Association for Computing Machinery, New York, NY, USA, 191–198. https://doi.org/10.1145/2959100.2959190
- Clustering Techniques to Improve Scalability and Accuracy of Recommender Systems. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 29, 04 (2021), 621–651. https://doi.org/10.1142/S0218488521500276
- An Improved Sampler for Bayesian Personalized Ranking by Leveraging View Data. In Companion Proceedings of the The Web Conference 2018 (Lyon, France) (WWW ’18). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 13–14. https://doi.org/10.1145/3184558.3186905
- A Survey of Collaborative Filtering Algorithms for Social Recommender Systems. In 2016 12th International Conference on Semantics, Knowledge and Grids (SKG). 40–46. https://doi.org/10.1109/SKG.2016.014
- T-RECSYS: A Novel Music Recommendation System Using Deep Learning. In 2019 IEEE International Conference on Consumer Electronics (ICCE). 1–6. https://doi.org/10.1109/ICCE.2019.8662028
- A Survey of Graph Neural Networks for Recommender Systems: Challenges, Methods, and Directions. ACM Trans. Recomm. Syst. 1, 1, Article 3 (March 2023), 51Â pages. https://doi.org/10.1145/3568022
- Learning to collide: Recommendation system model compression with learned hash functions. arXiv preprint arXiv:2203.15837 (2022).
- Carlos A. Gomez-Uribe and Neil Hunt. 2016. The Netflix Recommender System: Algorithms, Business Value, and Innovation. ACM Trans. Manage. Inf. Syst. 6, 4, Article 13 (Dec. 2016), 19 pages. https://doi.org/10.1145/2843948
- F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Trans. Interact. Intell. Syst. 5, 4, Article 19 (Dec. 2015), 19 pages. https://doi.org/10.1145/2827872
- Lightgcn: Simplifying and powering graph convolution network for recommendation. In Proceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 639–648.
- Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web (Perth, Australia) (WWW ’17). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 173–182. https://doi.org/10.1145/3038912.3052569
- A Locality Sensitive Hashing Based Approach for Federated Recommender System. In 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID). 836–842. https://doi.org/10.1109/CCGrid49817.2020.000-1
- Ilse CF Ipsen and Arvind K Saibaba. 2024. Stable Rank and Intrinsic Dimension of Real and Complex Matrices. arXiv preprint arXiv:2407.21594 (2024).
- Dietmar Jannach and Gediminas Adomavicius. 2016. Recommendation: From Algorithms to User Experience. In Proceedings of the 10th ACM Conference on Recommender Systems (RecSys).
- Amazon-m2: A multilingual multi-locale shopping session dataset for recommendation and text generation. Advances in Neural Information Processing Systems 36 (2024).
- Understanding Dimensional Collapse in Contrastive Self-supervised Learning. arXiv:2110.09348Â [cs.CV] https://arxiv.org/abs/2110.09348
- How Does Message Passing Improve Collaborative Filtering? arXiv preprint arXiv:2404.08660 (2024).
- Matrix Factorization Techniques for Recommender Systems. Computer 42, 8 (2009), 30–37. https://doi.org/10.1109/MC.2009.263
- Benjamin Lacker and Samuel F. Way. 2024. Socially-Motivated Music Recommendation. In Proceedings of the International AAAI Conference on Web and Social Media.
- Revisiting Recommendation Loss Functions through Contrastive Learning (Technical Report). arXiv preprint arXiv:2312.08520 (2023).
- Deep generative ranking for personalized recommendation. In Proceedings of the 13th ACM Conference on Recommender Systems (Copenhagen, Denmark) (RecSys ’19). Association for Computing Machinery, New York, NY, USA, 34–42. https://doi.org/10.1145/3298689.3347012
- Julian McAuley and Alex Yang. 2016. Addressing Complex and Subjective Product-Related Queries with Customer Reviews. In Proceedings of the 25th International Conference on World Wide Web (Montreal, Quebec, Canada) (WWW ’16). International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva, CHE, 625–635. https://doi.org/10.1145/2872427.2883044
- Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018).
- Toward a Better Understanding of Loss Functions for Collaborative Filtering. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 2034–2043.
- BPR: Bayesian personalized ranking from implicit feedback. In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence (Montreal, Quebec, Canada) (UAI ’09). AUAI Press, Arlington, Virginia, USA, 452–461.
- Mark Rudelson and Roman Vershynin. 2007. Sampling from large matrices: An approach through geometric functional analysis. Journal of the ACM (JACM) 54, 4 (2007), 21–es.
- Graph Neural Networks for Friend Ranking in Large-scale Social Platforms. In Proceedings of the Web Conference 2021 (Ljubljana, Slovenia) (WWW ’21). Association for Computing Machinery, New York, NY, USA, 2535–2546. https://doi.org/10.1145/3442381.3450120
- Xiaoyuan Su and Taghi M. Khoshgoftaar. 2009. A survey of collaborative filtering techniques. Adv. in Artif. Intell. 2009, Article 4 (Jan. 2009), 1 pages. https://doi.org/10.1155/2009/421425
- Anchen Sun and Yuanzhe Peng. 2022. A survey on modern recommendation system based on big data. arXiv e-prints (2022), arXiv–2206.
- Towards Representation Alignment and Uniformity in Collaborative Filtering. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1816–1825.
- DCN V2: Improved Deep and Cross Network and Practical Lessons for Web-scale Learning to Rank Systems. In Proceedings of the Web Conference 2021 (WWW ’21). ACM. https://doi.org/10.1145/3442381.3450078
- Neural graph collaborative filtering. In Proceedings of the 42nd international ACM SIGIR conference on Research and development in Information Retrieval. 165–174.
- A Message Passing Perspective on Learning Dynamics of Contrastive Learning. In International Conference on Learning Representations.
- On the Effectiveness of Sampled Softmax Loss for Item Recommendation. ACM Trans. Inf. Syst. 42, 4, Article 98 (March 2024), 26Â pages.
- Going Beyond Local: Global Graph-Enhanced Personalized News Recommendations. In Proceedings of the 17th ACM Conference on Recommender Systems (Singapore, Singapore) (RecSys ’23). Association for Computing Machinery, New York, NY, USA, 24–34. https://doi.org/10.1145/3604915.3608801
- Yelp. 2023. Yelp Dataset. https://www.yelp.com/dataset
- Barlow Twins: Self-Supervised Learning via Redundancy Reduction. arXiv:2103.03230Â [cs.CV] https://arxiv.org/abs/2103.03230
- Deep Learning Based Recommender System: A Survey and New Perspectives. ACM Comput. Surv. 52, 1, Article 5 (Feb. 2019), 38Â pages. https://doi.org/10.1145/3285029
- Fairness among New Items in Cold Start Recommender Systems. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (Virtual Event, Canada) (SIGIR ’21). Association for Computing Machinery, New York, NY, USA, 767–776. https://doi.org/10.1145/3404835.3462948
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.