Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PlayMyData: a curated dataset of multi-platform video games (2401.08561v2)

Published 16 Jan 2024 in cs.SE

Abstract: Being predominant in digital entertainment for decades, video games have been recognized as valuable software artifacts by the software engineering (SE) community just recently. Such an acknowledgment has unveiled several research opportunities, spanning from empirical studies to the application of AI techniques for classification tasks. In this respect, several curated game datasets have been disclosed for research purposes even though the collected data are insufficient to support the application of advanced models or to enable interdisciplinary studies. Moreover, the majority of those are limited to PC games, thus excluding notorious gaming platforms, e.g., PlayStation, Xbox, and Nintendo. In this paper, we propose PlayMyData, a curated dataset composed of 99,864 multi-platform games gathered by IGDB website. By exploiting a dedicated API, we collect relevant metadata for each game, e.g., description, genre, rating, gameplay video URLs, and screenshots. Furthermore, we enrich PlayMyData with the timing needed to complete each game by mining the HLTB website. To the best of our knowledge, this is the most comprehensive dataset in the domain that can be used to support different automated tasks in SE. More importantly, PlayMyData can be used to foster cross-domain investigations built on top of the provided multimedia data.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. 2023. Video Game Reviews, Articles, Trailers and more. https://www.metacritic.com/game/ [last accessed on 2023-12-06].
  2. Sentiment analysis on E-sports for education curriculum using naive Bayes and support vector machine. Jurnal Ilmu Komputer dan Informasi 13, 2 (2020), 109–122.
  3. A machine-learning item recommendation system for video games. In 2018 IEEE Conference on Computational Intelligence and Games (CIG). IEEE, 1–4.
  4. Games and Software Engineering: Engineering Fun, Inspiration, and Motivation. SIGSOFT Softw. Eng. Notes 48, 1 (jan 2023), 85–89. https://doi.org/10.1145/3573074.3573096
  5. Dalila Forni. 2020. Horizon Zero Dawn: The educational influence of video games in counteracting gender stereotypes. Transactions of the Digital Games Research Association 5, 1 (2020).
  6. The relationship between videogame micro-transactions and problem gaming and gambling: A systematic review. Computers in Human Behavior 131 (2022), 107219.
  7. The benefits of playing video games. American psychologist 69, 1 (2014), 66.
  8. Mark D Griffiths and Alex Meredith. 2009. Videogame addiction and its treatment. Journal of Contemporary Psychotherapy 39 (2009), 247–253.
  9. Using gameplay videos for detecting issues in video games. Empirical Software Engineering 28, 6 (2023), 136.
  10. Your Favorite Gameplay Speaks Volumes About You: Predicting User Behavior and Hexad Type. In HCI in Games (Lecture Notes in Computer Science), Xiaowen Fang (Ed.). Springer Nature Switzerland, Cham, 210–228. https://doi.org/10.1007/978-3-031-35979-8_17
  11. Automatic Classification of Games using Support Vector Machine. CoRR abs/2105.05674 (2021). arXiv:2105.05674 https://arxiv.org/abs/2105.05674
  12. igdb API. 2023. Getting Started – IGDB API docs. https://api-docs.igdb.com/#getting-started
  13. Yuhang Jiang and Lukun Zheng. 2020. Deep learning for video game genre classification. arXiv:2011.12143 [cs] (Nov. 2020). http://arxiv.org/abs/2011.12143 00001 arXiv: 2011.12143.
  14. ViGGO: A video game corpus for data-to-text generation in open-domain conversation. arXiv preprint arXiv:1910.12129 (2019).
  15. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 60, 6 (may 2017), 84–90. https://doi.org/10.1145/3065386
  16. Does playing violent video games cause aggression? A longitudinal intervention study. Molecular psychiatry 24, 8 (2019), 1220–1234.
  17. Tianrui Liu. 2022. RecommenderPlus: New Content-based User-centered Game Recommendation System. In 2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA). 767–770. https://doi.org/10.1109/CVIDLICCEA56201.2022.9825356
  18. “Why are you playing games? You are a girl!”: Exploring gender biases in Esports. In Proceedings of the 2021 CHI conference on human factors in computing systems. 1–15.
  19. The arousal video game annotation (AGAIN) dataset. IEEE Transactions on Affective Computing 13, 4 (2022), 2171–2184.
  20. Michele. 2023. HowLongToBeat Python API. https://github.com/ScrappyCocco/HowLongToBeat-PythonAPI original-date: 2018-12-28T22:50:59Z.
  21. Video Game Bad Smells: What They Are and How Developers Perceive Them. ACM Trans. Softw. Eng. Methodol. 32, 4, Article 88 (may 2023), 35 pages. https://doi.org/10.1145/3563214
  22. Gonzalo Navarro. 2001. A guided tour to approximate string matching. Comput. Surveys 33, 1 (2001), 31–88. https://doi.org/10.1145/375360.375365
  23. How Is Video Game Development Different from Software Development in Open Source?. In 2018 IEEE/ACM 15th International Conference on Mining Software Repositories (MSR). 392–402. ISSN: 2574-3864.
  24. Generating and Personalizing Bundle Recommendations on Steam. In Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’17). Association for Computing Machinery, New York, NY, USA, 1073–1076. https://doi.org/10.1145/3077136.3080724
  25. PlayMyData. 2023. riccardoRubei/MSR2024-Data-Showcase: Repository for MSR2024 Data Showcase. https://github.com/riccardoRubei/MSR2024-Data-Showcase
  26. PlaystationStore. 2023. Latest | Official PlayStation™Store. https://store.playstation.com/ [last accessed on 2023-12-06].
  27. Dataset of video game development problems. In Proceedings of the 17th International Conference on Mining Software Repositories. 553–557.
  28. The impact of video games on the players behaviors: A survey. Procedia Computer Science 151 (2019), 575–582.
  29. Let’s play for action: Recognizing activities of daily living by learning from life simulation video games. In 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 8563–8569.
  30. Riccardo Rubei and Claudio Di Sipio. 2021. AURYGA: A Recommender System for Game Tagging.. In IIR.
  31. Karen Simonyan and Andrew Zisserman. 2015. Very Deep Convolutional Networks for Large-Scale Image Recognition. arXiv:1409.1556 [cs.CV]
  32. Steam. 2023. Steam Store. https://store.steampowered.com/ [last accessed on 2023-12-06].
  33. Sentiment analysis of player chat messaging in the video game StarCraft 2: Extending a lexicon-based model. Knowledge-Based Systems 137 (2017), 149–162.
  34. Richard TA Wood. 2008. Problems with the concept of video game “addiction”: Some case study examples. International journal of mental health and addiction 6 (2008), 169–178.
  35. A Classification of Video Games based on Game Characteristics linked to Video Coding Complexity. In 2018 16th Annual Workshop on Network and Systems Support for Games (NetGames). 1–6. https://doi.org/10.1109/NetGames.2018.8463434

Summary

We haven't generated a summary for this paper yet.