USE: Dynamic User Modeling with Stateful Sequence Models (2403.13344v1)
Abstract: User embeddings play a crucial role in user engagement forecasting and personalized services. Recent advances in sequence modeling have sparked interest in learning user embeddings from behavioral data. Yet behavior-based user embedding learning faces the unique challenge of dynamic user modeling. As users continuously interact with the apps, user embeddings should be periodically updated to account for users' recent and long-term behavior patterns. Existing methods highly rely on stateless sequence models that lack memory of historical behavior. They have to either discard historical data and use only the most recent data or reprocess the old and new data jointly. Both cases incur substantial computational overhead. To address this limitation, we introduce User Stateful Embedding (USE). USE generates user embeddings and reflects users' evolving behaviors without the need for exhaustive reprocessing by storing previous model states and revisiting them in the future. Furthermore, we introduce a novel training objective named future W-behavior prediction to transcend the limitations of next-token prediction by forecasting a broader horizon of upcoming user behaviors. By combining it with the Same User Prediction, a contrastive learning-based objective that predicts whether different segments of behavior sequences belong to the same user, we further improve the embeddings' distinctiveness and representativeness. We conducted experiments on 8 downstream tasks using Snapchat users' behavioral logs in both static (i.e., fixed user behavior sequences) and dynamic (i.e., periodically updated user behavior sequences) settings. We demonstrate USE's superior performance over established baselines. The results underscore USE's effectiveness and efficiency in integrating historical and recent user behavior sequences into user embeddings in dynamic user modeling.
- A Reference Data Model for Process-Related User Interaction Logs. In Claudio Di Ciccio, Remco Dijkman, Adela del Río Ortega, and Stefanie Rinderle-Ma, editors, Business Process Management, Lecture Notes in Computer Science, pages 57–74, Cham, 2022. Springer International Publishing. ISBN 978-3-031-16103-2. doi: 10.1007/978-3-031-16103-2_7.
- Learning Invariant Representations of Social Media Users. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 1684–1695, Hong Kong, China, November 2019. Association for Computational Linguistics. doi: 10.18653/v1/D19-1178. URL https://aclanthology.org/D19-1178.
- Contextual embeddings: When are they worth it? In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 2650–2663, Online, July 2020. Association for Computational Linguistics. doi: 10.18653/v1/2020.acl-main.236. URL https://aclanthology.org/2020.acl-main.236.
- Latent cross: Making use of context in recurrent recommender systems. In Proceedings of the eleventh ACM international conference on web search and data mining, pages 46–54, 2018.
- Predictive Analysis by Leveraging Temporal User Behavior and User Embeddings. In Proceedings of the 27th ACM International Conference on Information and Knowledge Management, pages 2175–2182, Torino Italy, October 2018a. ACM. ISBN 978-1-4503-6014-2. doi: 10.1145/3269206.3272032. URL https://dl.acm.org/doi/10.1145/3269206.3272032.
- Forum User Profiling by Incorporating User Behavior and Social Network Connections. In Jing Xiao, Zhi-Hong Mao, Toyotaro Suzumura, and Liang-Jie Zhang, editors, Cognitive Computing – ICCC 2018, Lecture Notes in Computer Science, pages 30–42, Cham, 2018b. Springer International Publishing. ISBN 978-3-319-94307-7. doi: 10.1007/978-3-319-94307-7_3.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
- SimCURL: Simple Contrastive User Representation Learning from Command Sequences. In Proceedings of 21st IEEE International Conference on Machine Learning and Applications (ICMLA), Bahamas, 2022. IEEE. doi: 10.1109/ICMLA55696.2022.00186.
- Graph neural networks for social recommendation. In The world wide web conference, pages 417–426, 2019.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016a.
- Fusing similarity models with markov chains for sparse sequential recommendation. In 2016 IEEE 16th international conference on data mining (ICDM), pages 191–200. IEEE, 2016.
- Vista: A visually, socially, and temporally-aware model for artistic recommendation. In Proceedings of the 10th ACM conference on recommender systems, pages 309–316, 2016b.
- Recurrent neural networks with top-k gains for session-based recommendations. In Proceedings of the 27th ACM international conference on information and knowledge management, pages 843–852, 2018.
- Jacob Devlin Ming-Wei Chang Kenton and Lee Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of NAACL-HLT, pages 4171–4186, 2019.
- Yehuda Koren. Collaborative filtering with temporal dynamics. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 447–456, 2009.
- Personalized news recommendation based on click behavior. In Proceedings of the 15th international conference on Intelligent user interfaces, pages 31–40, 2010.
- Context-aware sequential recommendation. In 2016 IEEE 16th International Conference on Data Mining (ICDM), pages 1053–1058. IEEE, 2016.
- Distributed Representations of Words and Phrases and their Compositionality. In Advances in Neural Information Processing Systems, volume 26. Curran Associates, Inc., 2013. URL https://papers.nips.cc/paper_files/paper/2013/hash/9aa42b31882ec039965f3c4923ce901b-Abstract.html.
- A Graph Embedding Approach to User Behavior Anomaly Detection. In 2021 IEEE International Conference on Big Data (Big Data), pages 2650–2655, December 2021. doi: 10.1109/BigData52589.2021.9671423.
- PinnerFormer: Sequence Modeling for User Representation at Pinterest, May 2022. URL http://arxiv.org/abs/2205.04507. arXiv:2205.04507 [cs].
- Practice on long sequential user behavior modeling for click-through rate prediction. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 2671–2679, 2019.
- Language models are unsupervised multitask learners. 2019.
- Lifelong sequential modeling with personalized memorization for user response prediction. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 565–574, 2019.
- Factorizing personalized markov chains for next-basket recommendation. In Proceedings of the 19th international conference on World wide web, pages 811–820, 2010.
- Statista. Snapchat daily active users 2023, aug 2023. URL https://www.statista.com/statistics/545967/snapchat-app-dau/.
- Learning interest-oriented universal user representation via self-supervision. In Proceedings of the 30th ACM International Conference on Multimedia, pages 7270–7278, 2022.
- Retentive network: A successor to transformer for large language models, 2023.
- Log2Intent: Towards Interpretable User Modeling via Recurrent Semantics Memory Unit. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD ’19, pages 1055–1063, New York, NY, USA, July 2019. Association for Computing Machinery. ISBN 978-1-4503-6201-6. doi: 10.1145/3292500.3330889. URL https://dl.acm.org/doi/10.1145/3292500.3330889.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Generalists and Specialists: Using Community Embeddings to Quantify Activity Diversity in Online Platforms. In The World Wide Web Conference, WWW ’19, pages 1954–1964, New York, NY, USA, May 2019. Association for Computing Machinery. ISBN 978-1-4503-6674-8. doi: 10.1145/3308558.3313729. URL https://dl.acm.org/doi/10.1145/3308558.3313729.
- PTUM: Pre-training User Model from Unlabeled User Behaviors via Self-supervision. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 1939–1944, Online, November 2020. Association for Computational Linguistics. doi: 10.18653/v1/2020.findings-emnlp.174. URL https://aclanthology.org/2020.findings-emnlp.174.
- Personalizing Software and Web Services by Integrating Unstructured Application Usage Traces. In Proceedings of the 26th International Conference on World Wide Web Companion, WWW ’17 Companion, pages 485–493, Republic and Canton of Geneva, CHE, April 2017. International World Wide Web Conferences Steering Committee. ISBN 978-1-4503-4914-7. doi: 10.1145/3041021.3054183. URL https://dl.acm.org/doi/10.1145/3041021.3054183.
- General-Purpose User Embeddings based on Mobile App Usage. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD ’20, pages 2831–2840, New York, NY, USA, August 2020. Association for Computing Machinery. ISBN 978-1-4503-7998-4. doi: 10.1145/3394486.3403334. URL https://dl.acm.org/doi/10.1145/3394486.3403334.
- Joint deep modeling of users and items using reviews for recommendation. In Proceedings of the tenth ACM international conference on web search and data mining, pages 425–434, 2017.