Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fine-Tuning Pre-trained Language Models to Detect In-Game Trash Talks (2403.15458v1)

Published 19 Mar 2024 in cs.CL and cs.LG

Abstract: Common problems in playing online mobile and computer games were related to toxic behavior and abusive communication among players. Based on different reports and studies, the study also discusses the impact of online hate speech and toxicity on players' in-game performance and overall well-being. This study investigates the capability of pre-trained LLMs to classify or detect trash talk or toxic in-game messages The study employs and evaluates the performance of pre-trained BERT and GPT LLMs in detecting toxicity within in-game chats. Using publicly available APIs, in-game chat data from DOTA 2 game matches were collected, processed, reviewed, and labeled as non-toxic, mild (toxicity), and toxic. The study was able to collect around two thousand in-game chats to train and test BERT (Base-uncased), BERT (Large-uncased), and GPT-3 models. Based on the three models' state-of-the-art performance, this study concludes pre-trained LLMs' promising potential for addressing online hate speech and in-game insulting trash talk.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Daniel Fesalbon (2 papers)
  2. Arvin De La Cruz (1 paper)
  3. Marvin Mallari (1 paper)
  4. Nelson Rodelas (4 papers)