Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multimodal Recommendation Dialog with Subjective Preference: A New Challenge and Benchmark (2305.18212v1)

Published 26 May 2023 in cs.IR, cs.AI, cs.CL, cs.CV, cs.LG, and cs.MM

Abstract: Existing multimodal task-oriented dialog data fails to demonstrate the diverse expressions of user subjective preferences and recommendation acts in the real-life shopping scenario. This paper introduces a new dataset SURE (Multimodal Recommendation Dialog with SUbjective PREference), which contains 12K shopping dialogs in complex store scenes. The data is built in two phases with human annotations to ensure quality and diversity. SURE is well-annotated with subjective preferences and recommendation acts proposed by sales experts. A comprehensive analysis is given to reveal the distinguishing features of SURE. Three benchmark tasks are then proposed on the data to evaluate the capability of multimodal recommendation agents. Based on the SURE, we propose a baseline model, powered by a state-of-the-art multimodal model, for these tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yuxing Long (9 papers)
  2. Binyuan Hui (57 papers)
  3. Caixia Yuan1 (1 paper)
  4. Fei Huang (408 papers)
  5. Yongbin Li (128 papers)
  6. Xiaojie Wang (108 papers)
Citations (3)