Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multimodal Recommender Systems: A Survey (2302.03883v2)

Published 8 Feb 2023 in cs.IR and cs.AI

Abstract: The recommender system (RS) has been an integral toolkit of online services. They are equipped with various deep learning techniques to model user preference based on identifier and attribute information. With the emergence of multimedia services, such as short videos, news and etc., understanding these contents while recommending becomes critical. Besides, multimodal features are also helpful in alleviating the problem of data sparsity in RS. Thus, Multimodal Recommender System (MRS) has attracted much attention from both academia and industry recently. In this paper, we will give a comprehensive survey of the MRS models, mainly from technical views. First, we conclude the general procedures and major challenges for MRS. Then, we introduce the existing MRS models according to four categories, i.e., Modality Encoder, Feature Interaction, Feature Enhancement and Model Optimization. Besides, to make it convenient for those who want to research this field, we also summarize the dataset and code resources. Finally, we discuss some promising future directions of MRS and conclude this paper. To access more details of the surveyed papers, such as implementation code, we open source a repository.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Qidong Liu (36 papers)
  2. Jiaxi Hu (12 papers)
  3. Yutian Xiao (2 papers)
  4. Jingtong Gao (14 papers)
  5. Xiangyu Zhao (192 papers)
  6. Wanyu Wang (26 papers)
  7. Qing Li (429 papers)
  8. Jiliang Tang (204 papers)
Citations (13)