PlayMyData: a curated dataset of multi-platform video games (2401.08561v2)
Abstract: Being predominant in digital entertainment for decades, video games have been recognized as valuable software artifacts by the software engineering (SE) community just recently. Such an acknowledgment has unveiled several research opportunities, spanning from empirical studies to the application of AI techniques for classification tasks. In this respect, several curated game datasets have been disclosed for research purposes even though the collected data are insufficient to support the application of advanced models or to enable interdisciplinary studies. Moreover, the majority of those are limited to PC games, thus excluding notorious gaming platforms, e.g., PlayStation, Xbox, and Nintendo. In this paper, we propose PlayMyData, a curated dataset composed of 99,864 multi-platform games gathered by IGDB website. By exploiting a dedicated API, we collect relevant metadata for each game, e.g., description, genre, rating, gameplay video URLs, and screenshots. Furthermore, we enrich PlayMyData with the timing needed to complete each game by mining the HLTB website. To the best of our knowledge, this is the most comprehensive dataset in the domain that can be used to support different automated tasks in SE. More importantly, PlayMyData can be used to foster cross-domain investigations built on top of the provided multimedia data.
- 2023. Video Game Reviews, Articles, Trailers and more. https://www.metacritic.com/game/ [last accessed on 2023-12-06].
- Sentiment analysis on E-sports for education curriculum using naive Bayes and support vector machine. Jurnal Ilmu Komputer dan Informasi 13, 2 (2020), 109–122.
- A machine-learning item recommendation system for video games. In 2018 IEEE Conference on Computational Intelligence and Games (CIG). IEEE, 1–4.
- Games and Software Engineering: Engineering Fun, Inspiration, and Motivation. SIGSOFT Softw. Eng. Notes 48, 1 (jan 2023), 85–89. https://doi.org/10.1145/3573074.3573096
- Dalila Forni. 2020. Horizon Zero Dawn: The educational influence of video games in counteracting gender stereotypes. Transactions of the Digital Games Research Association 5, 1 (2020).
- The relationship between videogame micro-transactions and problem gaming and gambling: A systematic review. Computers in Human Behavior 131 (2022), 107219.
- The benefits of playing video games. American psychologist 69, 1 (2014), 66.
- Mark D Griffiths and Alex Meredith. 2009. Videogame addiction and its treatment. Journal of Contemporary Psychotherapy 39 (2009), 247–253.
- Using gameplay videos for detecting issues in video games. Empirical Software Engineering 28, 6 (2023), 136.
- Your Favorite Gameplay Speaks Volumes About You: Predicting User Behavior and Hexad Type. In HCI in Games (Lecture Notes in Computer Science), Xiaowen Fang (Ed.). Springer Nature Switzerland, Cham, 210–228. https://doi.org/10.1007/978-3-031-35979-8_17
- Automatic Classification of Games using Support Vector Machine. CoRR abs/2105.05674 (2021). arXiv:2105.05674 https://arxiv.org/abs/2105.05674
- igdb API. 2023. Getting Started – IGDB API docs. https://api-docs.igdb.com/#getting-started
- Yuhang Jiang and Lukun Zheng. 2020. Deep learning for video game genre classification. arXiv:2011.12143 [cs] (Nov. 2020). http://arxiv.org/abs/2011.12143 00001 arXiv: 2011.12143.
- ViGGO: A video game corpus for data-to-text generation in open-domain conversation. arXiv preprint arXiv:1910.12129 (2019).
- ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 60, 6 (may 2017), 84–90. https://doi.org/10.1145/3065386
- Does playing violent video games cause aggression? A longitudinal intervention study. Molecular psychiatry 24, 8 (2019), 1220–1234.
- Tianrui Liu. 2022. RecommenderPlus: New Content-based User-centered Game Recommendation System. In 2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA). 767–770. https://doi.org/10.1109/CVIDLICCEA56201.2022.9825356
- “Why are you playing games? You are a girl!”: Exploring gender biases in Esports. In Proceedings of the 2021 CHI conference on human factors in computing systems. 1–15.
- The arousal video game annotation (AGAIN) dataset. IEEE Transactions on Affective Computing 13, 4 (2022), 2171–2184.
- Michele. 2023. HowLongToBeat Python API. https://github.com/ScrappyCocco/HowLongToBeat-PythonAPI original-date: 2018-12-28T22:50:59Z.
- Video Game Bad Smells: What They Are and How Developers Perceive Them. ACM Trans. Softw. Eng. Methodol. 32, 4, Article 88 (may 2023), 35 pages. https://doi.org/10.1145/3563214
- Gonzalo Navarro. 2001. A guided tour to approximate string matching. Comput. Surveys 33, 1 (2001), 31–88. https://doi.org/10.1145/375360.375365
- How Is Video Game Development Different from Software Development in Open Source?. In 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR). 392–402. ISSN: 2574-3864.
- Generating and Personalizing Bundle Recommendations on Steam. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’17). Association for Computing Machinery, New York, NY, USA, 1073–1076. https://doi.org/10.1145/3077136.3080724
- PlayMyData. 2023. riccardoRubei/MSR2024-Data-Showcase: Repository for MSR2024 Data Showcase. https://github.com/riccardoRubei/MSR2024-Data-Showcase
- PlaystationStore. 2023. Latest | Official PlayStation™Store. https://store.playstation.com/ [last accessed on 2023-12-06].
- Dataset of video game development problems. In Proceedings of the 17th International Conference on Mining Software Repositories. 553–557.
- The impact of video games on the players behaviors: A survey. Procedia Computer Science 151 (2019), 575–582.
- Let’s play for action: Recognizing activities of daily living by learning from life simulation video games. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 8563–8569.
- Riccardo Rubei and Claudio Di Sipio. 2021. AURYGA: A Recommender System for Game Tagging.. In IIR.
- Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 [cs.CV]
- Steam. 2023. Steam Store. https://store.steampowered.com/ [last accessed on 2023-12-06].
- Sentiment analysis of player chat messaging in the video game StarCraft 2: Extending a lexicon-based model. Knowledge-Based Systems 137 (2017), 149–162.
- Richard TA Wood. 2008. Problems with the concept of video game “addiction”: Some case study examples. International journal of mental health and addiction 6 (2008), 169–178.
- A Classification of Video Games based on Game Characteristics linked to Video Coding Complexity. In 2018 16th Annual Workshop on Network and Systems Support for Games (NetGames). 1–6. https://doi.org/10.1109/NetGames.2018.8463434