Towards Human-like Preference Profiling in Sequential Recommendation (2506.02261v1)

Published 2 Jun 2025 in cs.IR and cs.LG

Abstract: Sequential recommendation systems aspire to profile users by interpreting their interaction histories, echoing how humans make decisions by weighing experience, relative preference strength, and situational relevance. Yet, existing LLM-based recommenders often fall short of mimicking the flexible, context-aware decision strategies humans exhibit, neglecting the structured, dynamic, and context-aware mechanisms fundamental to human behaviors. To bridge this gap, we propose RecPO, a preference optimization framework that models structured feedback and contextual delay to emulate human-like prioritization in sequential recommendation RecPO exploits adaptive reward margins based on inferred preference hierarchies and temporal signals, enabling the model to favor immediately relevant items and to distinguish between varying degrees of preference and aversion. Extensive experiments across five real-world datasets demonstrate that RecPO not only yields performance gains over state-of-the-art baselines, but also mirrors key characteristics of human decision-making: favoring timely satisfaction, maintaining coherent preferences, and exercising discernment under shifting contexts.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Towards Human-like Preference Profiling in Sequential Recommendation (2506.02261v1)

Collections

Summary

Follow-up Questions

Authors (5)

Don't miss out on important new AI/ML research

Towards Human-like Preference Profiling in Sequential Recommendation (2506.02261v1)

Collections

Summary

Follow-up Questions

Related Papers

Authors (5)

Don't miss out on important new AI/ML research