Words as Trigger Points in Social Media Discussions: A Large-Scale Case Study about UK Politics on Reddit (2405.10213v3)
Abstract: Political debates on social media sometimes flare up. From that moment on, users engage much more with one another; their communication is also more emotional and polarised. While it has been difficult to grasp such moments with computational methods, we suggest that trigger points are a useful concept to understand and ultimately model such behaviour. Established in qualitative focus group interviews to understand political polarisation (Mau, Lux, and Westheuser 2023), trigger points represent moments when individuals feel that their understanding of what is fair, normal, or appropriate in society is questioned. In the original studies, individuals show strong and negative emotional responses when certain triggering words or topics are mentioned. Our paper finds that these trigger points also exist in online debates. We examine online deliberations on Reddit between 2020 and 2022 and collect >100 million comments from subreddits related to a set of words identified as trigger points in UK politics. Analysing the comments, we find that trigger words increase user engagement and animosity, i.e., more negativity, hate speech, and controversial comments. Introducing trigger points to computational studies of online communication, our findings are relevant to researchers interested in affective computing, online deliberation, and how citizens debate politics and society in light of affective polarisation.
- Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton university press.
- Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation. In The 7th Workshop on Online Abuse and Harms (WOAH), 231–242.
- SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research. In Bouamor, H.; Pino, J.; and Bali, K., eds., Findings of the Association for Computational Linguistics: EMNLP 2023, 12590–12607. Singapore: Association for Computational Linguistics.
- On the causes of Brexit. European Journal of Political Economy, 55: 301–323.
- Online misogyny. Journal of International Affairs, 72(2): 95–114.
- SemEval-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in Twitter. In Proceedings of the 13th International Workshop on Semantic Evaluation, 54–63. Minneapolis, Minnesota, USA: Association for Computational Linguistics.
- The pushshift reddit dataset. In Proceedings of the international AAAI conference on web and social media, volume 14, 830–839.
- Language (Technology) is Power: A Critical Survey of “Bias” in NLP. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 5454–5476. Online: Association for Computational Linguistics.
- Enriching Word Vectors with Subword Information. Transactions of the Association for Computational Linguistics, 5: 135–146.
- Dogwhistles as Inferences in Interaction. In Proceedings of the Reasoning and Interaction Conference (ReInAct 2021), 40–46. Gothenburg, Sweden: Association for Computational Linguistics.
- Brexit and bots: characterizing the behaviour of automated accounts on Twitter during the UK election. EPJ Data Science, 11(1): 17.
- TweetNLP: Cutting-Edge Natural Language Processing for Social Media. In Proceedings of the The 2022 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, 38–49.
- Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania. The American Economic Review, 84(4): 772–793.
- Would your tweet invoke hate on the fly? forecasting hate intensity of reply threads on twitter. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, 2732–2742.
- Automated hate speech detection and the problem of offensive language. In Proceedings of the international AAAI conference on web and social media, volume 11, 512–515.
- I Beg to Differ: A study of constructive disagreement in online conversations. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, 2017–2027. Online: Association for Computational Linguistics.
- How to disagree well: Investigating the dispute tactics used on Wikipedia. In Goldberg, Y.; Kozareva, Z.; and Zhang, Y., eds., Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 3824–3837. Abu Dhabi, United Arab Emirates: Association for Computational Linguistics.
- Latent Hatred: A Benchmark for Understanding Implicit Hate Speech. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 345–363. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics.
- Exploring misogyny across the manosphere in reddit. In Proceedings of the 10th ACM conference on web science, 87–96.
- Online misinformation and vaccine hesitancy. Translational behavioral medicine, 11(12): 2194–2199.
- Public attitudes to the NHS. London: The Health Foundation.
- Analyzing the traits and anomalies of political discussions on reddit. In Proceedings of the International AAAI Conference on Web and Social Media, volume 13, 205–213.
- ANTi-Vax: a novel Twitter dataset for COVID-19 vaccine misinformation detection. Public health, 203: 23–30.
- Höller, M. 2021. The human component in social media and fake news: the performance of UK opinion leaders on Twitter during the Brexit campaign. European Journal of English Studies, 25(1): 80–95.
- Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692.
- TimeLMs: Diachronic Language Models from Twitter. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, 251–260. Dublin, Ireland: Association for Computational Linguistics.
- Quantifying gender biases towards politicians on Reddit. PloS one, 17(10): e0274317.
- Triggerpunkte: Konsens und Konflikt in der Gegenwartsgesellschaft. (No Title).
- Public sentiment analysis and topic modeling regarding COVID-19 vaccines on the Reddit social media platform: A call to action for strengthening vaccine confidence. Journal of Infection and Public Health, 14(10): 1505–1512.
- Semeval-2018 task 1: Affect in tweets. In Proceedings of the 12th international workshop on semantic evaluation, 1–17.
- The ethics of COVID-19 vaccine distribution. Journal of Public Health Policy, 42(3): 514–517.
- An In-depth Analysis of Implicit and Subtle Hate Speech Messages. In Vlachos, A.; and Augenstein, I., eds., Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 1997–2013. Dubrovnik, Croatia: Association for Computational Linguistics.
- Social data: Biases, methodological pitfalls, and ethical boundaries. Frontiers in big data, 2: 13.
- Multilingual and Multi-Aspect Hate Speech Analysis. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4675–4684. Hong Kong, China: Association for Computational Linguistics.
- Comparative Evaluation of Label-Agnostic Selection Bias in Multilingual Hate Speech Datasets. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2532–2542. Online: Association for Computational Linguistics.
- The chilling: A global study of online violence against women journalists.
- Where should one get news updates: Twitter or Reddit. Online Social Networks and Media, 9: 17–29.
- Assessing the extent and types of hate speech in fringe communities: A case study of alt-right communities on 8chan, 4chan, and Reddit. Social Media+ Society, 7(4): 20563051211052906.
- SemEval-2017 task 4: Sentiment analysis in Twitter. arXiv preprint arXiv:1912.00741.
- Social IQa: Commonsense Reasoning about Social Interactions. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 4463–4473. Hong Kong, China: Association for Computational Linguistics.
- Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter. In North American Chapter of the Association for Computational Linguistics.
- The Economist. 2023. How a Rwandan gambit consumed the Conservative Party. https://www.economist.com/britain/2023/12/13/how-a-rwandan-gambit-consumed-the-conservative-party. Accessed: 2024-4-13.
- The Guardian. 2022. Decade of neglect means NHS unable to tackle care backlog, report says. https://www.theguardian.com/society/2022/dec/12/decade-of-neglect-means-nhs-unable-to-tackle-care-backlog-report-says. Accessed: 2024-04-22.
- The Independent. 2024. Jeremy Hunt Warned of £2bn Real-Term Cuts to NHS Funding. Accessed: 2024-04-22.
- Vaccine hesitancy in the era of COVID-19. Public health, 194: 245–251.
- Do Differences in Values Influence Disagreements in Online Discussions? In Bouamor, H.; Pino, J.; and Bali, K., eds., Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 15986–16008. Singapore: Association for Computational Linguistics.
- British public opinion on Brexit: Controversies and contradictions. European Political Science, 18: 134–142.
- Vaccine discourse in white nationalist online communication: A mixed-methods computational approach. Social Science & Medicine, 298: 114859.
- Detection of Abusive Language: the Problem of Biased Datasets. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 602–608. Minneapolis, Minnesota: Association for Computational Linguistics.