RAT: Retrieval-Augmented Transformer for Click-Through Rate Prediction (2404.02249v2)
Abstract: Predicting click-through rates (CTR) is a fundamental task for Web applications, where a key issue is to devise effective models for feature interactions. Current methodologies predominantly concentrate on modeling feature interactions within an individual sample, while overlooking the potential cross-sample relationships that can serve as a reference context to enhance the prediction. To make up for such deficiency, this paper develops a Retrieval-Augmented Transformer (RAT), aiming to acquire fine-grained feature interactions within and across samples. By retrieving similar samples, we construct augmented input for each target sample. We then build Transformer layers with cascaded attention to capture both intra- and cross-sample feature interactions, facilitating comprehensive reasoning for improved CTR prediction while retaining efficiency. Extensive experiments on real-world datasets substantiate the effectiveness of RAT and suggest its advantage in long-tail scenarios. The code has been open-sourced at \url{https://github.com/YushenLi807/WWW24-RAT}.
- Re-Imagen: Retrieval-Augmented Text-to-Image Generator. In ICLR.
- Wide & Deep Learning for Recommender Systems. In RecSys.
- Learning Enhanced Representations for Tabular Data via Neighborhood Propagation. In NeurIPS.
- DeepFM: A Factorization-Machine Based Neural Network for CTR Prediction. In IJCAI.
- REALM: Retrieval-Augmented Language Model Pre-Training. In ICML.
- FiBiNET: Combining Feature Importance and Bilinear Feature Interaction for Click-Through Rate Prediction. In RecSys.
- Revisiting the Tag Relevance Prediction Problem. In SIGIR.
- Architecture and Operation Adaptive Network for Online Recommendations. In KDD.
- ReFer: Retrieval-Enhanced Vertical Federated Recommendation for Full Set User Benefit. In SIGIR.
- Interpretable Click-Through Rate Prediction Through Hierarchical Attention. In WSDM.
- xDeepFM: Combining Explicit and Implicit Feature Interactions for Recommender Systems. In KDD.
- Retrieval Augmented Classification for Long-Tail Visual Recognition. In CVPR.
- FinalMLP: An Enhanced Two-Stream MLP Model for CTR Prediction. In AAAI.
- Retrieval & Interaction Machine for Tabular Data Prediction. In KDD.
- Fast Context-Aware Recommendations with Factorization Machines. In SIGIR.
- Okapi at TREC-3. NIST SP (1995).
- MISSRec: Pre-Training and Transferring Multi-Modal Interest-Aware Sequence Representation for Recommendation. In MM.
- DCN v2: Improved Deep & Cross Network and Practical Lessons for Web-Scale Learning to Rank Systems. In WWW.
- Dense Representation Learning and Retrieval for Tabular Data Prediction. In KDD.
- BARS: Towards Open Benchmarking for Recommender Systems. In SIGIR.
- Open Benchmarking for Click-Through Rate Prediction. In CIKM.