Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AlignRec: Aligning and Training in Multimodal Recommendations (2403.12384v4)

Published 19 Mar 2024 in cs.IR and cs.LG

Abstract: With the development of multimedia systems, multimodal recommendations are playing an essential role, as they can leverage rich contexts beyond interactions. Existing methods mainly regard multimodal information as an auxiliary, using them to help learn ID features; However, there exist semantic gaps among multimodal content features and ID-based features, for which directly using multimodal information as an auxiliary would lead to misalignment in representations of users and items. In this paper, we first systematically investigate the misalignment issue in multimodal recommendations, and propose a solution named AlignRec. In AlignRec, the recommendation objective is decomposed into three alignments, namely alignment within contents, alignment between content and categorical ID, and alignment between users and items. Each alignment is characterized by a specific objective function and is integrated into our multimodal recommendation framework. To effectively train AlignRec, we propose starting from pre-training the first alignment to obtain unified multimodal features and subsequently training the following two alignments together with these features as input. As it is essential to analyze whether each multimodal feature helps in training and accelerate the iteration cycle of recommendation models, we design three new classes of metrics to evaluate intermediate performance. Our extensive experiments on three real-world datasets consistently verify the superiority of AlignRec compared to nine baselines. We also find that the multimodal features generated by AlignRec are better than currently used ones, which are to be open-sourced in our repository https://github.com/sjtulyf123/AlignRec_CIKM24.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Yifan Liu (135 papers)
  2. Kangning Zhang (7 papers)
  3. Xiangyuan Ren (3 papers)
  4. Yanhua Huang (6 papers)
  5. Jiarui Jin (23 papers)
  6. Yingjie Qin (5 papers)
  7. Ruilong Su (4 papers)
  8. Ruiwen Xu (6 papers)
  9. Weinan Zhang (322 papers)
  10. Yong Yu (219 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.