Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Data Scarcity in Recommendation Systems: A Survey (2312.10073v1)

Published 8 Dec 2023 in cs.IR and cs.AI

Abstract: The prevalence of online content has led to the widespread adoption of recommendation systems (RSs), which serve diverse purposes such as news, advertisements, and e-commerce recommendations. Despite their significance, data scarcity issues have significantly impaired the effectiveness of existing RS models and hindered their progress. To address this challenge, the concept of knowledge transfer, particularly from external sources like pre-trained LLMs, emerges as a potential solution to alleviate data scarcity and enhance RS development. However, the practice of knowledge transfer in RSs is intricate. Transferring knowledge between domains introduces data disparities, and the application of knowledge transfer in complex RS scenarios can yield negative consequences if not carefully designed. Therefore, this article contributes to this discourse by addressing the implications of data scarcity on RSs and introducing various strategies, such as data augmentation, self-supervised learning, transfer learning, broad learning, and knowledge graph utilization, to mitigate this challenge. Furthermore, it delves into the challenges and future direction within the RS domain, offering insights that are poised to facilitate the development and implementation of robust RSs, particularly when confronted with data scarcity. We aim to provide valuable guidance and inspiration for researchers and practitioners, ultimately driving advancements in the field of RS.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (146)
  1. Gediminas Adomavicius and Alexander Tuzhilin. 2010. Context-aware recommender systems. In Recommender Systems Handbook. Springer, 217–253.
  2. Charu C Aggarwal. 2016. Recommender Systems - The Textbook. Springer.
  3. Khadija A Almohsen and Huda Al-Jobori. 2015. Recommender systems in light of big data. International Journal of Electrical and Computer Engineering 5, 6 (2015), 1553–1563.
  4. Trust-based recommendation systems: an axiomatic approach. In The 17th International Conference on World Wide Web. ACM, 199–208.
  5. Fariba Aznoli and Nima Jafari Navimipour. 2017. Cloud services recommendation: Reviewing the recent advances and suggesting the future research directions. Journal of Network and Computer Applications 77 (2017), 73–86.
  6. Rohit Babbar and Bernhard Schölkopf. 2019. Data scarcity, robustness and extreme multi-label classification. Machine Learning 108, 8-9 (2019), 1329–1351.
  7. A noise correction-based approach to support a recommender system in a highly sparse rating environment. Decision Support Systems 118 (2019), 46–57.
  8. A systematic review on data scarcity problem in deep learning: Solution and applications. Comput. Surveys 54, 10s (2022), 1–29.
  9. David Bawden and Lyn Robinson. 2009. The dark side of information: Overload, anxiety and other paradoxes and pathologies. Journal of Information Science 35, 2 (2009), 180–191.
  10. Personalized digital marketing recommender engine. Journal of Retailing and Consumer Services 53 (2020), 101799.
  11. Robin Burke. 2002. Hybrid recommender systems: Survey and experiments. User Modeling and User-Adapted Interaction 12 (2002), 331–370.
  12. Feature selection in machine learning: A new perspective. Neurocomputing 300 (2018), 70–79.
  13. Erion Çano and Maurizio Morisio. 2017. Hybrid recommender systems: A systematic literature review. Intelligent Data Analysis 21, 6 (2017), 1487–1524.
  14. Multi-task item-attribute graph pre-training for strict cold-start item recommendation. In The ACM Conference on Recommender Systems. ACM, 1–18.
  15. StreamRec: a real-time recommender system. In ACM SIGMOD International Conference on Management of Data. ACM, 1243–1246.
  16. CL Philip Chen and Zhulin Liu. 2017. Broad learning system: An effective and efficient incremental learning system without the need for deep architecture. IEEE Transactions on Neural Networks and Learning Systems 29, 1 (2017), 10–24.
  17. TLRec: transfer learning for cross-domain recommendation. In IEEE International Conference on Big Knowledge. IEEE, 167–172.
  18. Open Metaverse: Issues, Evolution, and Future. (2023). arXiv:2304.13931
  19. Metaverse security and privacy: An overview. In IEEE International Conference on Big Data. IEEE, 2950–2959.
  20. Jinyu Cheng and Hong Wang. 2021. Adaptive algorithm recommendation and application of learning resources in English fragmented reading. Complexity 2021 (2021), 1–11.
  21. A review of medical image data augmentation techniques for deep learning applications. Journal of Medical Imaging and Radiation Oncology 65, 5 (2021), 545–563.
  22. AutoAugment: Learning augmentation policies from data. (2018). arXiv:1805.09501
  23. A recommendation system for meta-modeling: A meta-learning based approach. Expert Systems with Applications 46 (2016), 33–44.
  24. A survey on recommendation system. International Journal of Computer Applications 160, 7 (2017), 6–10.
  25. Hierarchical Bi-directional self-attention networks for paper review rating recommendation. In Proceedings of the 28th International Conference on Computational Linguistics. 6302–6314.
  26. BERT: Pre-training of deep bidirectional transformers for language understanding. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Association for Computational Linguistics, 4171–4186.
  27. Terrance DeVries and Graham W Taylor. 2017. Dataset augmentation in feature space. In The 5th International Conference on Learning Representations. OpenReview, 1–12.
  28. Mutual Wasserstein Discrepancy Minimization for Sequential Recommendation. In The ACM Web Conference. ACM, 1375–1385.
  29. Sequential recommendation with auxiliary item relationships via multi-relational transformer. In IEEE International Conference on Big Data. IEEE, 525–534.
  30. Sequential recommendation via stochastic self-attention. In Proceedings of the ACM Web Conference. 2036–2047.
  31. Graph collaborative signals denoising and augmentation for recommendation. In The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2037–2041.
  32. Recommendation systems: Algorithms, challenges, metrics, and business opportunities. Applied Sciences 10, 21 (2020), 7748.
  33. Shuang Feng and CL Philip Chen. 2018. Fuzzy broad learning system: A novel neuro-fuzzy model for regression and classification. IEEE Transactions on Cybernetics 50, 2 (2018), 414–424.
  34. Pattern mining: Current challenges and opportunities. In International Conference on Database Systems for Advanced Applications. Springer, 34–49.
  35. Data mining in distributed environment: a survey. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery 7, 6 (2017), e1216.
  36. A survey of utility-oriented pattern mining. IEEE Transactions on Knowledge and Data Engineering 33, 4 (2021), 1306–1327.
  37. Utility mining across multi-dimensional sequences. ACM Transactions on Knowledge Discovery from Data 15, 5 (2021), 1–24.
  38. Web 3.0: The Future of Internet. In Companion Proceedings of the ACM Web Conference 2023. 1266–1275.
  39. Enabling data diversity: efficient automatic augmentation via regularized adversarial training. In International Conference on Information Processing in Medical Imaging. Springer, 85–97.
  40. Research review for broad learning system: Algorithms, theory, and applications. IEEE Transactions on Cybernetics 52, 9 (2021), 8922–8950.
  41. A survey on knowledge graph-based recommender systems. IEEE Transactions on Knowledge and Data Engineering 34, 8 (2020), 3549–3568.
  42. Collaborative filtering based recommendation system: A survey. International Journal on Computer Science and Engineering 4, 5 (2012), 859.
  43. Probabilistic logic graph attention networks for reasoning. In Companion Proceedings of the Web Conference. ACM / IW3C2, 669–673.
  44. Brian W Head. 2007. Community engagement: participation on whose terms? Australian Journal of Political Science 42, 3 (2007), 441–454.
  45. Using self-supervised learning can improve model robustness and uncertainty. In Conference on Neural Information Processing System. NeurIPS, 32.
  46. Population based augmentation: Efficient learning of augmentation policy schedules. In International conference on machine learning. PMLR, 2731–2741.
  47. HuBERT: Self-supervised speech representation learning by masked prediction of hidden units. IEEE/ACM Transactions on Audio, Speech, and Language Processing 29 (2021), 3451–3460.
  48. Self-supervised learning for recommender system. In The 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 3440–3443.
  49. Motor learning and generalization using broad learning adaptive neural control. IEEE Transactions on Industrial Electronics 67, 10 (2019), 8608–8617.
  50. Accurate and efficient large-scale multi-label learning with reduced feature broad learning system using label correlation, Early Access. IEEE Transactions on Neural Networks and Learning Systems (2022), 1–14.
  51. Nouhaila Idrissi and Ahmed Zellou. 2020. A systematic literature review of sparsity issues in recommender systems. Social Network Analysis and Mining 10 (2020), 1–23.
  52. Recommendation systems: Principles, methods and evaluation. Egyptian Informatics Journal 16, 3 (2015), 261–273.
  53. A survey on contrastive self-supervised learning. Technologies 9, 1 (2020), 2.
  54. Recommender systems: Past, present, future. AI Magazine 42, 3 (2021), 3–6.
  55. Knowledge graph completion with adaptive sparse transfer matrix. In the AAAI Conference on Artificial Intelligence. AAAI Press, 985–991.
  56. Jun-Wei Jin and CL Philip Chen. 2018. Regularized robust broad learning system for uncertain data modeling. Neurocomputing 322 (2018), 58–69.
  57. News recommender systems–Survey and roads ahead. Information Processing & Management 54, 6 (2018), 1203–1227.
  58. A social-relationships-based service recommendation system for SIoT devices. IEEE Internet of Things Journal 8, 3 (2020), 1859–1870.
  59. A survey of recommendation systems: recommendation models, techniques, and application fields. Electronics 11, 1 (2022), 141.
  60. Walid Krichene and Steffen Rendle. 2020. On sampled metrics for item recommendation. In The 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 1748–1757.
  61. Transfer learning via contextual invariants for one-to-many cross-domain recommendation. In The 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, 1081–1090.
  62. User-generated content. IEEE Pervasive Computing 7, 4 (2008), 10–11.
  63. Canonical tensor decomposition for knowledge base completion. In International Conference on Machine Learning. PMLR, 2863–2872.
  64. Addressing cold-start problem in recommendation systems. In The 2nd International Conference on Ubiquitous Information Management and Communication. ACM, 208–211.
  65. A preliminary study on data augmentation of deep learning for image classification. In The 11th Asia-Pacific Symposium on Internetware. ACM, 1–6.
  66. Smart augmentation learning an optimal data augmentation strategy. IEEE Access 5 (2017), 5858–5869.
  67. Data shadows: Knowledge, openness, and absence. (2017), 191–202 pages.
  68. Pan Li and Alexander Tuzhilin. 2020. DDTCDR: Deep dual transfer cross domain recommendation. In The 13th International Conference on Web Search and Data Mining. ACM, 331–339.
  69. ATLRec: An attentional adversarial transfer learning network for cross-domain recommendation. Journal of Computer Science and Technology 35 (2020), 794–808.
  70. Fast autoaugment. In Advances in Neural Information Processing Systems. 6662–6672.
  71. Learning entity and relation embeddings for knowledge graph completion. In the AAAI conference on Artificial Intelligence. AAAI Press, 2181–2187.
  72. Joint representation learning for multi-modal transportation recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence. AAAI Press, 1036–1043.
  73. K-BERT: Enabling language representation with knowledge graph. In the AAAI Conference on Artificial Intelligence. AAAI Press, 2901–2908.
  74. Exploiting unlabeled data in cnns by self-supervised learning to rank. IEEE Transactions on Pattern Analysis and Machine Intelligence 41, 8 (2019), 1862–1878.
  75. Self-supervised learning: Generative or contrastive. IEEE Transactions on Knowledge and Data Engineering 35, 1 (2021), 857–876.
  76. Research on the Matthews correlation coefficients metrics of personalized recommendation algorithm evaluation. International Journal of Hybrid Information Technology 8, 1 (2015), 163–172.
  77. Stacked broad learning system: From incremental flatted structure to deep model. IEEE Transactions on Systems, Man, and Cybernetics: Systems 51, 1 (2020), 209–222.
  78. Federated social recommendation with graph neural network. ACM Transactions on Intelligent Systems and Technology 13, 4 (2022), 1–24.
  79. Content-based recommender systems: State of the art and trends. In Recommender Systems Handbook. Springer, 73–105.
  80. Recommender systems. Physics Peports 519, 1 (2012), 1–49.
  81. Selective transfer learning for cross domain recommendation. In SIAM International Conference on Data Mining. SIAM, 641–649.
  82. Dynamic anticipation and completion for multi-hop reasoning over sparse knowledge graph. (2020). arXiv:2010.01899
  83. Transfer learning for cross-company software defect prediction. Information and Software Technology 54, 3 (2012), 248–256.
  84. Pattie Maes. 1995. Agents that reduce work and information overload. In Readings in Human-Computer Interaction. Elsevier, 811–821.
  85. Commonsense knowledge base completion with structural and semantic context. In the AAAI conference on Artificial Intelligence. AAAI Press, 2925–2933.
  86. Vusumuzi Maphosa and Mfowabo Maphosa. 2023. Fifteen years of recommender systems research in higher education: Current trends and future direction. Applied Artificial Intelligence 37, 1 (2023), 2175106.
  87. Pedro Marcelino. 2018. Transfer learning from pre-trained models. Towards Data Science 10 (2018), 23.
  88. TALMUD: transfer learning for multiple domains. In The 21st ACM International Conference on Information and Knowledge Management. ACM, 425–434.
  89. Lakshmi Narke and Azra Nasreen. 2020. A comprehensive review of approaches and challenges of a recommendation system. International Journal of Research in Engineering, Science and Management 3, 4 (2020), 381–384.
  90. Resolving data sparsity and cold start problem in collaborative filtering recommender system using linked open data. Expert Systems with Applications 149 (2020), 113248.
  91. Deep learning recommendation model for personalization and recommendation systems. (2019). arXiv:1906.00091
  92. Weike Pan. 2016. A survey of transfer learning for collaborative recommendation with auxiliary data. Neurocomputing 177 (2016), 447–453.
  93. Michael J Pazzani and Daniel Billsus. 2007. Content-based recommendation systems. In The Adaptive Web: Methods and Strategies of Web Personalization. Springer, 325–341.
  94. Yuanzhe Peng. 2022. A survey on modern recommendation system based on big data. (2022). arXiv:2206.02631
  95. Phongsavanh Phorasim and Lasheng Yu. 2017. Movies recommendation system using collaborative filtering and K-means. International Journal of Advanced Computer Research 7, 29 (2017), 52.
  96. Ricardo Ribani and Mauricio Marengoni. 2019. A survey of transfer learning for convolutional neural networks. In The 32nd SIBGRAPI Conference on Graphics, Patterns and Images Tutorials. IEEE, 47–57.
  97. Recommender systems: Introduction and challenges. In Recommender Systems Handbook. Springer, 1–34.
  98. Deepjyoti Roy and Mala Dutta. 2022. A systematic review and research perspective on recommender systems. Journal of Big Data 9, 1 (2022), 59.
  99. Laila Safoury and Akram Salah. 2013. Exploiting user demographic attributes for solving cold-start problem in recommender system. Lecture Notes on Software Engineering 1, 3 (2013), 303–307.
  100. Masked language model scoring. In the 58th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, 2699–2712.
  101. Recommender systems in e-commerce. In The 1st ACM conference on Electronic commerce. ACM, 158–166.
  102. E-commerce recommendation applications. Data Mining and Knowledge Discovery 5 (2001), 115–153.
  103. Natural language processing for recommender systems. In Recommender Systems Handbook. Springer, 447–483.
  104. Guy Shani and Asela Gunawardana. 2011. Evaluating recommendation systems. In Recommender Systems Handbook. Springer, 257–297.
  105. A survey of research hotspots and frontier trends of recommendation systems from the perspective of knowledge graph. Expert Systems with Applications 165 (2021), 113764.
  106. Monika Singh. 2020. Scalability and sparsity issues in recommender datasets: a survey. Knowledge and Information Systems 62 (2020), 1–43.
  107. Classifications of recommender systems: A review. Journal of Engineering Science & Technology Review 10, 4 (2017), 132–153.
  108. The multisided complexity of fairness in recommender systems. AI magazine 43, 2 (2022), 164–176.
  109. Joo-yeong Song and Bongwon Suh. 2022. Data Augmentation Strategies for Improving Sequential Recommender Systems. (2022). arXiv:2203.14037
  110. A survey of music recommendation systems and future perspectives. In The 9th International Symposium on Computer Music Modeling and Retrieval, Vol. 4. Citeseer, 395–410.
  111. G Suganeshwari and SP Syed Ibrahim. 2016. A survey on collaborative filtering based recommendation system. In The 3rd International Symposium on Big Data and Cloud Computing Challenges. Springer, 503–518.
  112. Metaverse: Survey, applications, security, and opportunities. (2022). arXiv:2210.07990
  113. Internet of Behaviors: A Survey. IEEE Internet of Things Journal 10, 13 (2023), 11117–11134.
  114. Big data meets metaverse: A survey. (2022). arXiv:2210.16282
  115. Mohammed Temraz and Mark T Keane. 2022. Solving the class imbalance problem using a counterfactual method for data augmentation. Machine Learning with Applications 9 (2022), 100375.
  116. Complex embeddings for simple link prediction. In International Conference on Machine Learning. PMLR, 2071–2080.
  117. Recommendation systems for education: Systematic review. Electronics 10, 14 (2021), 1611.
  118. David A Van Dyk and Xiao-Li Meng. 2001. The art of data augmentation. Journal of Computational and Graphical Statistics 10, 1 (2001), 1–50.
  119. Ricardo Vilalta and Youssef Drissi. 2002. A perspective view and survey of meta-learning. Artificial Intelligence Review 18 (2002), 77–95.
  120. Addressing interpretability and cold-start in matrix factorization for recommender systems. IEEE Transactions on Knowledge and Data Engineering 31, 7 (2018), 1253–1266.
  121. A model of a trust-based recommendation system on a social network. Autonomous Agents and Multi-Agent Systems 16 (2008), 57–74.
  122. Web3: The Next Internet Revolution. arXiv preprint arXiv:2304.06111 (2023).
  123. RecSys-DAN: Discriminative adversarial networks for cross-domain recommender systems. IEEE Transactions on Neural Networks and Learning Systems 31, 8 (2019), 2731–2740.
  124. Whose AI Dream? In search of the aspiration in data annotation.. In the Conference on Human Factors in Computing Systems. 1–16.
  125. A survey on session-based recommender systems. Comput. Surveys 54, 7 (2021), 1–38.
  126. A survey on the fairness of recommender systems. ACM Transactions on Information Systems 41, 3 (2023), 1–43.
  127. MMGCN: Multi-modal graph convolution network for personalized recommendation of micro-video. In The 27th ACM International Conference on Multimedia. 1437–1445.
  128. Tracklet self-supervised learning for unsupervised person re-identification. The AAAI Conference on Artificial Intelligence 34, 07 (2020), 12362–12369.
  129. The Human-Centric Metaverse: A Survey. In Companion Proceedings of the ACM Web Conference 2023. 1296–1306.
  130. Image data augmentation for deep learning: A survey. (2022). arXiv:2204.08610
  131. Overcoming data sparsity in group recommendation. IEEE Transactions on Knowledge and Data Engineering 34, 7 (2020), 3447–3460.
  132. Constraint-based Sequential Rule Mining. In IEEE 9th International Conference on Data Science and Advanced Analytics. IEEE, 1–10.
  133. Self-supervised learning for recommender systems: A survey. IEEE Transactions on Knowledge and Data Engineering, Early Access (2023), 1–20.
  134. S4l: Self-supervised semi-supervised learning. In International Conference on Computer Vision. 1476–1485.
  135. Jiawei Zhang and Philip S Yu. 2019. Broad Learning Through Fusions. Springer.
  136. Deep learning based recommender system: A survey and new perspectives. Comput. Surveys 52, 1 (2019), 1–38.
  137. Iteratively learning embeddings and rules for knowledge graph reasoning. In The World Wide Web conference. 2366–2377.
  138. Adversarial autoaugment. The 8th International Conference on Learning Representations, 1–13.
  139. Efficient probabilistic logic reasoning with graph neural networks. (2020). arXiv:2001.11850
  140. Active transfer learning for cross-system recommendation. In The AAAI Conference on Artificial Intelligence. AIII Press, 1205–1211.
  141. A unified framework of active transfer learning for cross-system recommendation. Artificial Intelligence 245 (2017), 38–55.
  142. A preference-based method of updating the surrogate model by broad learning and its application. In IEEE Congress on Evolutionary Computation. IEEE, 1702–1709.
  143. On completing sparse knowledge base with transitive relation embedding. In the AAAI Conference on Artificial Intelligence. AIII Press, 3125–3132.
  144. Mobile app recommendations with security and privacy awareness. In The 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 951–960.
  145. Broad learning based multi-source collaborative recommendation. In The ACM on Conference on Information and Knowledge Management. 1409–1418.
  146. A comprehensive survey on transfer learning. Proc. IEEE 109, 1 (2020), 43–76.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Zefeng Chen (19 papers)
  2. Wensheng Gan (80 papers)
  3. Jiayang Wu (64 papers)
  4. Kaixia Hu (1 paper)
  5. Hong Lin (14 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.