Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Dataset and Models for Item Recommendation Using Multi-Modal User Interactions (2405.04246v1)

Published 7 May 2024 in cs.IR

Abstract: While recommender systems with multi-modal item representations (image, audio, and text), have been widely explored, learning recommendations from multi-modal user interactions (e.g., clicks and speech) remains an open problem. We study the case of multi-modal user interactions in a setting where users engage with a service provider through multiple channels (website and call center). In such cases, incomplete modalities naturally occur, since not all users interact through all the available channels. To address these challenges, we publish a real-world dataset that allows progress in this under-researched area. We further present and benchmark various methods for leveraging multi-modal user interactions for item recommendations, and propose a novel approach that specifically deals with missing modalities by mapping user interactions to a common feature space. Our analysis reveals important interactions between the different modalities and that a frequently occurring modality can enhance learning from a less frequent one.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Charu C. Aggarwal. 2016. Context-Sensitive Recommender Systems. Springer International Publishing, Cham, 255–281. https://doi.org/10.1007/978-3-319-29659-3_8
  2. Jie Bao and Yu Zheng. 2017. Location-Based Recommendation Systems. Springer International Publishing, Cham, 1145–1153. https://doi.org/10.1007/978-3-319-17885-1_1580
  3. The Million Song Dataset. In Proceedings of the 12th International Conference on Music Information Retrieval (ISMIR 2011).
  4. DCDIR: A Deep Cross-Domain Recommendation System for Cold Start Users in Insurance Domain. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 1661–1664. https://doi.org/10.1145/3397271.3401193
  5. Recommending Target Actions Outside Sessions in the Data-Poor Insurance Domain. ACM Trans. Recomm. Syst. (jun 2023). https://doi.org/10.1145/3606950 Just Accepted.
  6. Learning Recommendations from User Actions in the Item-Poor Insurance Domain. In Proceedings of the 16th ACM Conference on Recommender Systems (Seattle, WA, USA) (RecSys ’22). Association for Computing Machinery, New York, NY, USA, 113–123. https://doi.org/10.1145/3523227.3546775
  7. Deep Adversarial Learning for Multi-Modality Missing Data Completion. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’18). Association for Computing Machinery, 1158–1166. https://doi.org/10.1145/3219819.3219963
  8. Thomas G. Dietterich. 1998. Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms. Neural Computation 10, 7 (oct 1998), 1895–1923. https://doi.org/10.1162/089976698300017197
  9. Personalized recommendation system based on social tags in the era of Internet of Things. Journal of Intelligent Systems 31, 1 (2022), 681–689. https://doi.org/doi:10.1515/jisys-2022-0053
  10. Extracting Semantic User Networks from Informal Communication Exchanges. In The Semantic Web – ISWC 2011. Springer Berlin Heidelberg, 209–224. https://doi.org/10.1007/978-3-642-25073-6_14
  11. You Get What You Chat: Using Conversations to Personalize Search-based Recommendations. , 207–223 pages.
  12. Ruining He and Julian McAuley. 2016. VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the 30th AAAI Conference on Artificial Intelligence (Phoenix, Arizona) (AAAI’16). AAAI Press, 144–150.
  13. A Survey on Conversational Recommender Systems. Comput. Surveys 54, 5, Article 105 (may 2021), 36 pages. https://doi.org/10.1145/3453154
  14. Zühal Kurt and Kemal Özkan. 2017. An image-based recommender system based on feature extraction techniques. In 2017 International Conference on Computer Science and Engineering (UBMK). 769–774. https://doi.org/10.1109/UBMK.2017.8093527
  15. Applied Linear Statistical Models. McGraw-Hill, Irwin.
  16. A Multi-Behavior Recommendation Method for Users Based on Graph Neural Networks. Applied Sciences 13, 16 (2023). https://doi.org/10.3390/app13169315
  17. User Diverse Preference Modeling by Multimodal Attentive Metric Learning. In Proceedings of the 27th ACM International Conference on Multimedia (Nice, France) (MM ’19). Association for Computing Machinery, New York, NY, USA, 1526–1534. https://doi.org/10.1145/3343031.3350953
  18. Leveraging Hybrid Recommendation System in Insurance Domain. International Journal Of Engineering And Computer Science 3 (2014), 8988–8992.
  19. Relative representations enable zero-shot latent space communication. In The Eleventh International Conference on Learning Representations. https://openreview.net/forum?id=SrC-nwieGJ
  20. An Insurance Recommendation System Using Bayesian Networks. In Proceedings of the Eleventh ACM Conference on Recommender Systems (Como, Italy) (RecSys ’17). Association for Computing Machinery, New York, NY, USA, 274–278. https://doi.org/10.1145/3109859.3109907
  21. A Knowledge-Based Recommendation System That Includes Sentiment Analysis and Deep Learning. IEEE Transactions on Industrial Informatics 15, 4 (2019), 2124–2135. https://doi.org/10.1109/TII.2018.2867174
  22. Multi-Modal Knowledge Graphs for Recommender Systems. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management (Virtual Event, Ireland) (CIKM ’20). Association for Computing Machinery, New York, NY, USA, 1405–1414. https://doi.org/10.1145/3340531.3411947
  23. KWAI-AD-AudVis. https://doi.org/10.5281/zenodo.4023390
  24. Missing Modalities Imputation via Cascaded Residual Autoencoder. In 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4971–4980. https://doi.org/10.1109/CVPR.2017.528
  25. LRMM: Learning to Recommend with Missing Modalities. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 3360–3370. https://doi.org/10.18653/v1/D18-1373
  26. Multimodal Learning with Incomplete Modalities by Knowledge Distillation. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD ’20). Association for Computing Machinery, 1828–1838. https://doi.org/10.1145/3394486.3403234
  27. MMGCN: Multi-Modal Graph Convolution Network for Personalized Recommendation of Micro-Video. In Proceedings of the 27th ACM International Conference on Multimedia (Nice, France) (MM ’19). Association for Computing Machinery, New York, NY, USA, 1437–1445. https://doi.org/10.1145/3343031.3351034
  28. Multi-Behavior Enhanced Recommendation with Cross-Interaction Collaborative Relation Modeling. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). 1931–1936. https://doi.org/10.1109/ICDE51399.2021.00179
  29. Multi-Behavior Self-Supervised Learning for Recommendation. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (¡conf-loc¿, ¡city¿Taipei¡/city¿, ¡country¿Taiwan¡/country¿, ¡/conf-loc¿) (SIGIR ’23). Association for Computing Machinery, New York, NY, USA, 496–505. https://doi.org/10.1145/3539618.3591734
  30. A Novel Intelligence Recommendation Model for Insurance Products with Consumer Segmentation. Journal of Systems Science and Information 2 (02 2014). https://doi.org/10.1515/JSSI-2014-0016
  31. Feng Zhu. 2021. Douban dataset (ratings, item details, user profiles, tags, and reviews). https://doi.org/10.13140/RG.2.2.31724.49286
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Simone Borg Bruun (5 papers)
  2. Krisztian Balog (76 papers)
  3. Maria Maistro (24 papers)

Summary

We haven't generated a summary for this paper yet.