Monitoring the Evolution of Behavioural Embeddings in Social Media Recommendation (2312.15265v2)
Abstract: Emerging short-video platforms like TikTok, Instagram Reels, and ShareChat present unique challenges for recommender systems, primarily originating from a continuous stream of new content. ShareChat alone receives approximately 2 million pieces of fresh content daily, complicating efforts to assess quality, learn effective latent representations, and accurately match content with the appropriate user base, especially given limited user feedback. Embedding-based approaches are a popular choice for industrial recommender systems because they can learn low-dimensional representations of items, leading to effective recommendation that can easily scale to millions of items and users. Our work characterizes the evolution of such embeddings in short-video recommendation systems, comparing the effect of batch and real-time updates to content embeddings. We investigate \emph{how} embeddings change with subsequent updates, explore the relationship between embeddings and popularity bias, and highlight their impact on user engagement metrics. Our study unveils the contrast in the number of interactions needed to achieve mature embeddings in a batch learning setup versus a real-time one, identifies the point of highest information updates, and explores the distribution of $\ell_2$-norms across the two competing learning modes. Utilizing a production system deployed on a large-scale short-video app with over 180 million users, our findings offer insights into designing effective recommendation systems and enhancing user satisfaction and engagement in short-video applications.
- Mohit Agarwal, Srijan Saket and Rishabh Mehrotra “MEMER - Multimodal Encoder for Multi-Signal Early-Stage Recommendations” In Companion Proceedings of the ACM Web Conference 2023, WWW ’23 Companion Austin, TX, USA: Association for Computing Machinery, 2023, pp. 773–777 DOI: 10.1145/3543873.3587679
- “Enriching Word Vectors with Subword Information” In Transactions of the Association for Computational Linguistics 5, 2017, pp. 135–146
- “Wide & deep learning for recommender systems” In Proceedings of the 1st workshop on deep learning for recommender systems, 2016, pp. 7–10
- “DeepFM: a factorization-machine based neural network for CTR prediction” In arXiv preprint arXiv:1703.04247, 2017
- Kensho Hara, Hirokatsu Kataoka and Yutaka Satoh “Can spatiotemporal 3d cnns retrace the history of 2d cnns and imagenet?” In Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, 2018, pp. 6546–6555
- “CNN architectures for large-scale audio classification” In 2017 ieee international conference on acoustics, speech and signal processing (icassp), 2017, pp. 131–135 IEEE
- “Field-aware factorization machines for CTR prediction” In Proceedings of the 10th ACM conference on recommender systems, 2016, pp. 43–50
- “Deep learning based recommender system: A survey and new perspectives” In ACM computing surveys (CSUR) 52.1 ACM New York, NY, USA, 2019, pp. 1–38