Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Voting Booklet Bias: Stance Detection in Swiss Federal Communication (2306.08999v1)

Published 15 Jun 2023 in cs.CL and cs.AI

Abstract: In this study, we use recent stance detection methods to study the stance (for, against or neutral) of statements in official information booklets for voters. Our main goal is to answer the fundamental question: are topics to be voted on presented in a neutral way? To this end, we first train and compare several models for stance detection on a large dataset about Swiss politics. We find that fine-tuning an M-BERT model leads to the best accuracy. We then use our best model to analyze the stance of utterances extracted from the Swiss federal voting booklet concerning the Swiss popular votes of September 2022, which is the main goal of this project. We evaluated the models in both a multilingual as well as a monolingual context for German, French, and Italian. Our analysis shows that some issues are heavily favored while others are more balanced, and that the results are largely consistent across languages. Our findings have implications for the editorial process of future voting booklets and the design of better automated systems for analyzing political discourse. The data and code accompanying this paper are available at https://github.com/ZurichNLP/voting-booklet-bias.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. D. Küçük, F. Can, Stance detection: A survey, ACM Comput. Surv. 53 (2020). URL: https://doi.org/10.1145/3369026. doi:10.1145/3369026.
  2. A survey on stance detection for mis- and disinformation identification, in: Findings of the Association for Computational Linguistics: NAACL 2022, Association for Computational Linguistics, Seattle, United States, 2022, pp. 1259–1277. URL: https://aclanthology.org/2022.findings-naacl.94. doi:10.18653/v1/2022.findings-naacl.94.
  3. A systematic review of machine learning techniques for stance detection and its applications, Neural Computing and Applications (2023) 1–32.
  4. J. W. Du Bois, The stance triangle, Stancetaking in discourse: Subjectivity, evaluation, interaction 164 (2007) 139–182.
  5. P. Kockelman, Stance and subjectivity, Journal of Linguistic Anthropology 14 (2004) 127–150.
  6. J. Vamvas, R. Sennrich, X-Stance: A multilingual multi-target dataset for stance detection, in: Proceedings of the 5th Swiss Text Analytics Conference (SwissText) & 16th Conference on Natural Language Processing (KONVENS), Zurich, Switzerland, 2020. URL: http://ceur-ws.org/Vol-2624/paper9.pdf.
  7. BERT: Pre-training of deep bidirectional transformers for language understanding, in: Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Association for Computational Linguistics, Minneapolis, Minnesota, 2019, pp. 4171–4186. URL: https://aclanthology.org/N19-1423. doi:10.18653/v1/N19-1423.
  8. J. Wilkerson, A. Casas, Large-scale computerized text analysis in political science: Opportunities and challenges, Annual Review of Political Science 20 (2017) 529–544.
  9. K. Chatsiou, S. J. Mikhaylov, Deep learning for political science, arXiv preprint arXiv:2005.06540 (2020).
  10. Computational analysis of political texts: Bridging research efforts across communities, in: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Tutorial Abstracts, Association for Computational Linguistics, Florence, Italy, 2019, pp. 18–23. URL: https://www.aclweb.org/anthology/P19-4004. doi:10.18653/v1/P19-4004.
  11. The new release of corps: A corpus of political speeches annotated with audience reactions, in: Multimodal Communication in Political Speech. Shaping Minds and Social Action: International Workshop, Political Speech 2010, Rome, Italy, November 10-12, 2010, Revised Selected Papers, Springer, 2013, pp. 86–98.
  12. A. Barbaresi, A corpus of German political speeches from the 21st century, in: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), European Language Resources Association (ELRA), Miyazaki, Japan, 2018. URL: https://aclanthology.org/L18-1127.
  13. SemEval-2016 task 6: Detecting stance in tweets, in: Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), Association for Computational Linguistics, San Diego, California, 2016, pp. 31–41. URL: https://aclanthology.org/S16-1003. doi:10.18653/v1/S16-1003.
  14. K. Kawintiranon, L. Singh, PoliBERTweet: A pre-trained language model for analyzing political content on Twitter, in: Proceedings of the Thirteenth Language Resources and Evaluation Conference, European Language Resources Association, Marseille, France, 2022, pp. 7360–7367. URL: https://aclanthology.org/2022.lrec-1.801.
  15. Analyzing political bias and unfairness in news articles at different levels of granularity, in: Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science, Association for Computational Linguistics, Online, 2020, pp. 149–154. URL: https://aclanthology.org/2020.nlpcss-1.16. doi:10.18653/v1/2020.nlpcss-1.16.
  16. Multilingual stance detection in social media political debates, Computer Speech & Language 63 (2020) 101075. URL: https://www.sciencedirect.com/science/article/pii/S0885230820300085. doi:https://doi.org/10.1016/j.csl.2020.101075.
  17. Debating Europe: A multilingual multi-target stance classification dataset of online debates, in: Proceedings of the LREC 2022 workshop on Natural Language Processing for Political Sciences, European Language Resources Association, Marseille, France, 2022, pp. 16–21. URL: https://aclanthology.org/2022.politicalnlp-1.3.
  18. Legal and political stance detection of SCOTUS language, in: Proceedings of the Natural Legal Language Processing Workshop 2022, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid), 2022, pp. 265–275. URL: https://aclanthology.org/2022.nllp-1.25.
  19. Bag of tricks for efficient text classification, arXiv preprint arXiv:1607.01759 (2016).
  20. Scikit-learn: Machine learning in Python, Journal of Machine Learning Research 12 (2011) 2825–2830.
  21. Q. Yu, Towards a more in-depth detection of political framing, in: Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Association for Computational Linguistics, Dubrovnik, Croatia, 2023, pp. 162–174. URL: https://aclanthology.org/2023.latechclfl-1.18.

Summary

We haven't generated a summary for this paper yet.