- The paper demonstrates that MIRRN enhances CTR prediction by leveraging multi-granularity interest retrieval and sequence refinement, achieving up to a 1.35% AUC improvement.
- It introduces a novel multi-head Fourier Transformer to capture sequential interactions efficiently, balancing performance with computational complexity.
- Real-world tests validate MIRRN's impact, showing significant engagement gains with a 1.32% increase in song listens and a 0.55% boost in listening time.
An Analysis of Multi-granularity Interest Retrieval and Refinement Network for Long-Term User Behavior Modeling in CTR Prediction
This paper introduces a novel approach to Click-Through Rate (CTR) prediction by proposing the Multi-granularity Interest Retrieval and Refinement Network (MIRRN). The model addresses the limitations of current methodologies by focusing on long-term user behavior modeling to more effectively capture users' diverse interests.
Problem Statement
CTR prediction is a pivotal component for personalization in online platforms such as recommendation systems and online advertising. Current methods in long-term user behavior modeling mainly focus on identifying sequences related to a target item, followed by modeling relationships between the retrieved subsequences and the item. However, these methods often neglect two key aspects: the need to capture diverse user interests due to limited retrieval queries and the insufficient exploration of relational information within subsequences.
Proposed Methodology
The authors propose the MIRRN framework to mitigate these shortcomings through a multi-stage approach:
- Multi-granularity Interest Retrieval: Users' diverse interests are captured by constructing different queries utilizing behaviors at various time scales. This yields subsequences reflective of multi-granular interests, categorized into target-aware, local-aware, and global-aware interests.
- Behavior Sequence Refinement: To extract sequential and interactive information within subsequences, a novel multi-head Fourier Transformer is introduced. This approach enhances the modeling of user behaviors by efficiently capturing interaction with a time complexity comparable to self-attention mechanisms.
- Interest Activation: MIRRN employs multi-head target attention to weigh the influence of different granular interests on the target item.
Experimental Setup and Results
The presented model underwent rigorous evaluation using three large-scale datasets: Taobao, Alipay, and Tmall. MIRRN demonstrated superior performance, achieving an AUC improvement of up to 1.35% over state-of-the-art baselines. The inclusion of multiple granularities in interest retrieval was shown to significantly enhance performance. Moreover, each component of the model, specifically the behavior sequence refinement module and the multi-head target attention mechanism, proved crucial to achieving these results.
Additionally, an online A/B test conducted on a popular music streaming application revealed that MIRRN increased the average number of songs listened to by 1.32% and the average listening time by 0.55%.
Key Contributions
- Successful Multi-granular Interest Modeling: Unlike previous models that focus solely on target-aware interests, MIRRN encapsulates the user's multi-faceted behaviors across different temporal scopes, enhancing the user interest representation.
- Efficient Interaction Modeling: Through the introduction of the multi-head Fourier transformer, sequential and interactive information is mined comprehensively and efficiently. This innovation offers a balance between model complexity and inference speed.
- Practical Application and Business Value: In addition to outperforming academic benchmarks, the model demonstrates significant practical benefits in real-world applications, as illustrated by the improvements in user engagement metrics in the online test.
Implications and Future Work
The implications of this research extend to improving the efficacy and personalization of CTR models across various platforms by more accurately modeling user behavior sequences. Future research could explore further optimizations of retrieval methods and extend the multidimensional interest representations to cater to different business scenarios. Additionally, ongoing investigation into more efficient computational techniques could reduce the time complexity further, enhancing real-time application capabilities.
The MIRRN not only pushes the boundaries of CTR prediction through advanced user modeling techniques but also sets a precedent for future innovations in personalization technologies.