Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Multi-faceted Semi-Synthetic Dataset for Automated Cyberbullying Detection (2402.10231v1)

Published 9 Feb 2024 in cs.SI, cs.LG, and cs.CL

Abstract: In recent years, the rising use of social media has propelled automated cyberbullying detection into a prominent research domain. However, challenges persist due to the absence of a standardized definition and universally accepted datasets. Many researchers now view cyberbullying as a facet of cyberaggression, encompassing factors like repetition, peer relationships, and harmful intent in addition to online aggression. Acquiring comprehensive data reflective of all cyberbullying components from social media networks proves to be a complex task. This paper provides a description of an extensive semi-synthetic cyberbullying dataset that incorporates all of the essential aspects of cyberbullying, including aggression, repetition, peer relationships, and intent to harm. The method of creating the dataset is succinctly outlined, and a detailed overview of the publicly accessible dataset is additionally presented. This accompanying data article provides an in-depth look at the dataset, increasing transparency and enabling replication. It also aids in a deeper understanding of the data, supporting broader research use.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (10)
  1. Session-based cyberbullying detection in social media: A survey. Online Social Networks and Media, 36:100250, 2023.
  2. N Darawsheh. The impact of cyber bullying on the psychological well-being of university students: A study in jordanian universities. Information Sciences Letters, 12(8):2757–2768, 2023.
  3. Automatic cyberbullying detection: A systematic review. Computers in Human Behavior, 93:333–345, 2019. doi:10.1016/j.chb.2018.12.021.
  4. When the timeline meets the pipeline: A survey on automated cyberbullying detection. IEEE Access, 9:103541–103563, 2021. doi:10.1109/ACCESS.2021.3098979.
  5. Cyber-aggression, cyberbullying, and cyber-grooming. ACM Computing Surveys, 54(1), 2021. doi:10.1145/3424246.
  6. Peter K. Smith. Cyberbullying and cyber aggression. In Handbook of School Violence and School Safety: International Research and Practice, volume 2, pages 93–103. 2012.
  7. Dan Olweus. What is bullying. Encyclopedia of psychology, 1:487–489, 2000.
  8. Towards comprehensive cyberbullying detection: A dataset incorporating aggressive texts, repetition, peerness, and intent to harm. Computers in Human Behavior, 153:108123, 2024. doi:10.1016/j.chb.2023.108123. URL https://doi.org/10.1016/j.chb.2023.108123.
  9. F. Elsafoury. Cyberbullying datasets. Kaggle, 2020. URL https://www.kaggle.com/datasets/saurabhshahane/cyberbullying-dataset. [Online].
  10. Aggression-annotated corpus of hindi-english code-mixed data. In N. et al. Calzolari, editor, Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages ISBN: 979–10–95546–00–9, Miyazaki, Japan, 2018. European Language Resources Association (ELRA).
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Naveed Ejaz (3 papers)
  2. Fakhra Kashif (1 paper)
  3. Salimur Choudhury (11 papers)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com