Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

41 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

41 tokens/sec

o3 Pro

7 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

182 1

A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys) (2404.00579v2)

Published 31 Mar 2024 in cs.IR and cs.AI

Abstract: Traditional recommender systems (RS) typically use user-item rating histories as their main data source. However, deep generative models now have the capability to model and sample from complex data distributions, including user-item interactions, text, images, and videos, enabling novel recommendation tasks. This comprehensive, multidisciplinary survey connects key advancements in RS using Generative Models (Gen-RecSys), covering: interaction-driven generative models; the use of LLMs (LLM) and textual data for natural language recommendation; and the integration of multimodal models for generating and processing images/videos in RS. Our work highlights necessary paradigms for evaluating the impact and harm of Gen-RecSys and identifies open challenges. This survey accompanies a tutorial presented at ACM KDD'24, with supporting materials provided at: https://encr.pw/vDhLq.

PDF HTML Abstract

A Review of Modern Recommender Systems Using Generative Models (Gen-RecSys)

This manuscript presents an in-depth survey of the integration of generative models within recommender systems, also known as Gen-RecSys. The paper highlights how generative models have shifted the narrative from traditional recommendation techniques such as collaborative filtering, which primarily focus on user-item interactions, toward more complex methodologies leveraging data from text, images, and videos.

Overview of Gen-RecSys

The survey meticulously categorizes the advancements in generative models applied to recommendation systems. It encompasses a foundational overview of interaction-driven generative models, applications of LLMs in generative recommendations, retrieval, and conversational recommendations, as well as the integration of multimodal models that handle images and video content.

Key Contributions

Interaction-Driven Generative Models: The survey covers diverse model paradigms like auto-encoding models, auto-regressive models, Generative Adversarial Networks (GANs), and diffusion models, which are utilized for various recommendation tasks. These models facilitate learning from complex user-item interaction histories, thereby helping to improve model predictions and recommendations.
LLM in Recommender Systems: The paper explores the role of LLMs in generative recommendation tasks, focusing on zero-shot and few-shot prompting, fine-tuning, and retrieval-augmented generation (RAG). The capabilities of LLMs to enrich user and item representations through both dense retrieval and joint embedding techniques are scrutinized.
Multimodal Models: The survey extends beyond text to include image and video interactions. It discusses the challenges and motivations behind multimodal recommendations such as cross-modal alignment and fusion, and provides insights into models like CLIP and contrastive learning approaches addressing these multimodality challenges.
Evaluation Frameworks: With the emergence of Gen-RecSys, existing evaluation methods are shown to be insufficient. Therefore, the survey suggests comprehensive bench-marking efforts for assessing the impact and potential societal harm these systems could trigger. It emphasizes the importance of novel metrics for cognitive and affective engagements in user-system interactions.

Implications for Future Developments

The transition towards Gen-RecSys represents both practical advancements and theoretical shifts in how recommendations can be generated and evaluated. The inclusion of wide-ranging data modalities unlocks new potential for personalization and user engagement, but it also poses challenges related to fairness, privacy, and the ethical use of rich data. The survey underscores the necessity for more sophisticated evaluation methodologies that can discern the fine line between enhancing user experience and mitigating risks associated with biased or ethically questionable recommendations.

Furthermore, future research directions are pointed towards developing multimodal generative models that effectively integrate and align different data modalities, conducting red-teaming to ensure robustness against adversarial attacks, and understanding the broader societal implications of deploying Gen-RecSys at scale.

In conclusion, this meticulous survey sets the stage for advancing Gen-RecSys, offering a wealth of insights into both the technological innovations available and the requisite caution needed in deploying these models. It calls for collective efforts from the academia and industry to ensure that the burgeoning capabilities of generative models align with societal values and ethical standards.

PDF Markdown Bookmark Chat (Pro)

References (183)

Authors (10)

Yashar Deldjoo (46 papers)
Zhankui He (27 papers)
Julian McAuley (238 papers)
Anton Korikov (10 papers)
Scott Sanner (70 papers)
Arnau Ramisa (14 papers)
René Vidal (154 papers)
Maheswaran Sathiamoorthy (14 papers)
Atoosa Kasirzadeh (28 papers)
Silvia Milano (4 papers)

Citations (22)

View on Semantic Scholar

Tweets

https://twitter.com/madiator/status/1779994701707481337

https://twitter.com/fly51fly/status/1775283285213729195

https://twitter.com/thepurpleowl_/status/1870839886250402119

https://twitter.com/knishimae0531/status/1775303890239672755

YouTube

Show All Videos