Proxy-based Item Representation for Attribute and Context-aware Recommendation (2312.06145v1)
Abstract: Neural network approaches in recommender systems have shown remarkable success by representing a large set of items as a learnable vector embedding table. However, infrequent items may suffer from inadequate training opportunities, making it difficult to learn meaningful representations. We examine that in attribute and context-aware settings, the poorly learned embeddings of infrequent items impair the recommendation accuracy. To address such an issue, we propose a proxy-based item representation that allows each item to be expressed as a weighted sum of learnable proxy embeddings. Here, the proxy weight is determined by the attributes and context of each item and may incorporate bias terms in case of frequent items to further reflect collaborative signals. The proxy-based method calculates the item representations compositionally, ensuring each representation resides inside a well-trained simplex and, thus, acquires guaranteed quality. Additionally, that the proxy embeddings are shared across all items allows the infrequent items to borrow training signals of frequent items in a unified model structure and end-to-end manner. Our proposed method is a plug-and-play model that can replace the item encoding layer of any neural network-based recommendation model, while consistently improving the recommendation performance with much smaller parameter usage. Experiments conducted on real-world recommendation benchmark datasets demonstrate that our proposed model outperforms state-of-the-art models in terms of recommendation accuracy by up to 17% while using only 10% of the parameters.
- Understanding Scaling Laws for Recommendation Models. arXiv preprint arXiv:2208.08489 (2022).
- Layer Normalization. arXiv preprint arXiv:1607.06450 (2016).
- A Review on Deep Learning for Recommender Systems: Challenges and Remedies. Artificial Intelligence Review 52 (2019), 1–37.
- Unsupervised Learning of Visual Features by Contrasting Cluster Assignments. Advances in Neural Information Processing Systems 33 (2020), 9912–9924.
- Emerging Properties in Self-supervised Vision Transformers. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 9650–9660.
- Generative Adversarial Framework for Cold-start Item Recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2565–2571.
- A Simple Framework for Contrastive Learning of Visual Representations. In International Conference on Machine Learning. 1597–1607.
- Clustered Embedding Learning for Recommender Systems. In The World Wide Web Conference.
- Intent Contrastive Learning for Sequential Recommendation. arXiv preprint arXiv:2202.02519 (2022).
- Masked Contrastive Learning for Anomaly Detection. The 30th International Joint Conference on Artificial Intelligence (2021).
- Unsupervised Proxy Selection for Session-based Recommender Systems. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 327–336.
- Deep Neural Networks for Youtube Recommendations. In Proceedings of the 10th ACM Conference on Recommender Systems. 191–198.
- Mamo: Memory-augmented Meta-optimization for Cold-start Recommendation. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 688–697.
- Continuous-time Sequential Recommendation with Temporal Graph Collaborative Transformer. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 433–442.
- DeepFM: A Factorization-machine based Neural Network for CTR Prediction. Proceedings of the 26th International Joint Conference on Artificial Intelligence (2017).
- Ruining He and Julian McAuley. 2016. VBPR: Visual Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the AAAI conference on Artificial Intelligence, Vol. 30.
- Neural Collaborative Filtering. In Proceedings of the 26th International Conference on World Wide Web. 173–182.
- Learning to Embed Categorical Features without Embedding Tables for Recommendation. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 840–850.
- Wang-Cheng Kang and Julian McAuley. 2018. Self-attentive Sequential Recommendation. In 2018 IEEE International Conference on Data Mining. 197–206.
- Proxy Anchor Koss for Deep Metric Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3238–3247.
- Matrix Factorization Techniques for Recommender Systems. Computer 42, 8 (2009), 30–37.
- Prototypical Contrastive Learning of Unsupervised Representations. International Conference on Learning Representations (2021).
- Lightweight Self-attentive Sequential Recommendation. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 967–977.
- Learnable embedding sizes for recommender systems. 2021 International Conference on Learning Representations (2021).
- Ilya Loshchilov and Frank Hutter. 2019. Decoupled Weight Decay Regularization. 2019 International Conference on Learning Representations (2019).
- Image-based Recommendations on Styles and Substitutes. In Proceedings of the 38th International ACM SIGIR Conference on Research and Development in Information Retrieval. 43–52.
- Distributed Representations of Words and Phrases and their Compositionality. Advances in Neural Information Processing Systems 26 (2013).
- No Fuss Distance Metric Learning using Proxies. In Proceedings of the IEEE International Conference on Computer Vision. 360–368.
- Mixed Dimension Embeddings with Application to Memory-efficient Recommendation Systems. In IEEE International Symposium on Information Theory. 2786–2791.
- Xia Ning and George Karypis. 2011. SLIM: Sparse Linear Methods for Top-n Recommender Systems. In IEEE 11th International Conference on Data Mining. 497–506.
- Aleksandr Petrov and Craig Macdonald. 2022a. Effective and Efficient Training for Sequential Recommendation using Recency Sampling. In Proceedings of the 16th ACM Conference on Recommender Systems. 81–91.
- Aleksandr Petrov and Craig Macdonald. 2022b. A Systematic Review and Replicability Study of BERT4Rec for Sequential Recommendation. In Proceedings of the 16th ACM Conference on Recommender Systems. 436–447.
- Billion-scale recommendation with Heterogeneous Side Information at Taobao. In IEEE 36th International Conference on Data Engineering. 1667–1676.
- Personalizing Session-based Recommendations with Hierarchical Recurrent Neural Networks. In Proceedings of the 11th ACM Conference on Recommender Systems. 130–137.
- Context and Attribute-Aware Sequential Recommendation via Cross-Attention. In Proceedings of the 16th ACM Conference on Recommender Systems. 71–80.
- Steffen Rendle. 2010. Factorization Machines. In IEEE International Conference on Data Mining. 995–1000.
- BPR: Bayesian Personalized Ranking from Implicit Feedback. In Proceedings of the 25th Conference on Uncertainty in Artificial Intelligence. 452–461.
- Item-based Collaborative Filtering Recommendation Algorithms. In Proceedings of the 10th International Conference on World Wide Web. 285–295.
- Exploiting Session Information in BERT-based Session-aware Sequential Recommendation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2639–2644.
- UMEC: Unified Model and Embedding Compression for Efficient Recommendation Systems. International Conference on Learning Representations.
- Kihyuk Sohn. 2016. Improved Deep Metric Learning with Multi-class n-pair Loss Objective. Advances in Neural Information Processing Systems 29 (2016).
- BERT4Rec: Sequential Recommendation with Bidirectional Encoder Representations from Transformer. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management. 1441–1450.
- Improving Collaborative Metric Learning with Efficient Negative Sampling. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval. 1201–1204.
- Attention is All You Need. Advances in Neural Information Processing Systems 30 (2017).
- Sequential Recommender Systems: Challenges, Progress and Prospects. Proceedings of the 28th International Joint Conference on Artificial Intelligence (2019).
- Contrastive Learning for Cold-start Recommendation. In Proceedings of the 29th ACM International Conference on Multimedia. 5382–5390.
- On the Unreasonable Effectiveness of Centroids in Image Retrieval. In Neural Information Processing: 28th International Conference, ICONIP 2021, Proceedings, Part IV 28. Springer, 212–223.
- SSE-PT: Sequential Recommendation via Personalized Transformer. In Proceedings of the 14th ACM Conference on Recommender Systems. 328–337.
- Stochastic Shared Embeddings: Data-driven Regularization of Embedding Layers. Advances in Neural Information Processing Systems 32 (2019).
- Deep Matrix Factorization Models for Recommender Systems.. In IJCAI, Vol. 17. Melbourne, Australia, 3203–3209.
- Learning Effective and Efficient Embedding via an Adaptively-masked Twins-based Layer. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management. 3568–3572.
- PCL: Proxy-based Contrastive Learning for Domain Generalization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7097–7107.
- Challenging the Long Tail Recommendation. Proceedings of the VLDB Endowment 5.9 (2012).
- Supporting Clustering with Contrastive Learning. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2021).
- Memory-efficient Embedding for Recommendations. arXiv preprint arXiv:2006.14827 (2020).
- S3-Rec: Self-supervised Learning for Sequential Recommendation with Mutual Information Maximization. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1893–1902.
- Learning to Warm Up Cold Item Embeddings for Cold-start Recommendation with Meta Scaling and Shifting Networks. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1167–1176.