player2vec: A Language Modeling Approach to Understand Player Behavior in Games (2404.04234v3)
Abstract: Methods for learning latent user representations from historical behavior logs have gained traction for recommendation tasks in e-commerce, content streaming, and other settings. However, this area still remains relatively underexplored in video and mobile gaming contexts. In this work, we present a novel method for overcoming this limitation by extending a long-range Transformer model from the natural language processing domain to player behavior data. We discuss specifics of behavior tracking in games and propose preprocessing and tokenization approaches by viewing in-game events in an analogous way to words in sentences, thus enabling learning player representations in a self-supervised manner in the absence of ground-truth annotations. We experimentally demonstrate the efficacy of the proposed approach in fitting the distribution of behavior events by evaluating intrinsic LLMing metrics. Furthermore, we qualitatively analyze the emerging structure of the learned embedding space and show its value for generating insights into behavior patterns to inform downstream applications.
- A survey on churn analysis in various business domains. IEEE Access 8 (2020), 220816–220839.
- Web-scale user modeling for targeting. In Proceedings of the 21st international conference on world wide web (2012), pp. 3–12.
- The dynamics of repeat consumption. In Proceedings of the 23rd international conference on World wide web (2014), pp. 419–430.
- Improved customer lifetime value prediction with sequence-to-sequence learning and feature-based models. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 5 (2021), 1–37.
- Longformer: The long-document transformer. arXiv preprint arXiv:2004.05150 (2020).
- Guns, swords and data: Clustering of player behavior in computer games in the wild. In 2012 IEEE conference on Computational Intelligence and Games (CIG) (2012), IEEE, pp. 163–170.
- Game data mining. Game analytics: Maximizing the value of player data (2013), 205–253.
- Modelling the training practices of recreational marathon runners to make personalised training recommendations. In Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization (2023), pp. 183–193.
- Contextual and sequential user embeddings for large-scale music recommendation. In Proceedings of the 14th ACM Conference on Recommender Systems (2020), pp. 53–62.
- Context-aware music recommendation based on latenttopic sequential patterns. In Proceedings of the sixth ACM conference on Recommender systems (2012), pp. 131–138.
- Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of naacL-HLT (2019), vol. 1, p. 2.
- Kingma, D. Adam: a method for stochastic optimization. In Int Conf Learn Represent (2014).
- Predicting dynamic embedding trajectory in temporal interaction networks. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (2019), pp. 1269–1278.
- Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In Annual Meeting of the Association for Computational Linguistics (2019).
- Roberta: A robustly optimized bert pretraining approach.
- Knowledge enhanced personalized search. In Proceedings of the 43rd International ACM SIGIR conference on research and development in information retrieval (2020), pp. 709–718.
- Mcm: A multi-task pre-trained customer model for personalization. In Proceedings of the 17th ACM Conference on Recommender Systems (2023), pp. 637–639.
- Towards a fair marketplace: Counterfactual evaluation of the trade-off between relevance, fairness & satisfaction in recommendation systems. In Proceedings of the 27th acm international conference on information and knowledge management (2018), pp. 2243–2251.
- Ethical considerations for player modeling. In Workshops at the Thirty-First AAAI Conference on Artificial Intelligence (2017).
- Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781 (2013).
- Tfx: A tensorflow-based production-scale machine learning platform. In KDD 2017 (2017).
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems 32 (2019).
- Modeling player experience in super mario bros. In 2009 IEEE Symposium on Computational Intelligence and Games (2009), IEEE, pp. 132–139.
- Repeatnet: A repeat aware neural recommendation machine for session-based recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence (2019), vol. 33, pp. 4806–4813.
- Gaussian mixture models. Encyclopedia of biometrics 741, 659-663 (2009).
- An mdp-based recommender system. Journal of Machine Learning Research 6, 9 (2005).
- A data analysis of player in world of warcraft using game data mining. In 2017 16th Brazilian Symposium on Computer Games and Digital Entertainment (SBGames) (2017), IEEE, pp. 1–9.
- Bert4rec: Sequential recommendation with bidirectional encoder representations from transformer. In Proceedings of the 28th ACM international conference on information and knowledge management (2019), pp. 1441–1450.
- A survey of personalized recommendation based on machine learning algorithms. In Proceedings of the 2020 4th International Conference on Electronic Information Technology and Computer Engineering (2020), pp. 602–610.
- Visualizing data using t-sne. Journal of machine learning research 9, 11 (2008).
- Attention is all you need. Advances in neural information processing systems 30 (2017).
- Huggingface’s transformers: State-of-the-art natural language processing. arXiv preprint arXiv:1910.03771 (2019).
- Recurrent recommender networks. In Proceedings of the tenth ACM international conference on web search and data mining (2017), pp. 495–503.
- Player modeling. Artificial and Computational Intelligence in Games (2013), 45.
- Atrank: An attention-based user behavior modeling framework for recommendation. In Proceedings of the AAAI conference on artificial intelligence (2018), vol. 32.
- Using temporal data for making recommendations. In Proceedings of the Seventeenth conference on Uncertainty in artificial intelligence (2001), pp. 580–588.