Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Can GPT-4 Identify Propaganda? Annotation and Detection of Propaganda Spans in News Articles (2402.17478v1)

Published 27 Feb 2024 in cs.CL

Abstract: The use of propaganda has spiked on mainstream and social media, aiming to manipulate or mislead users. While efforts to automatically detect propaganda techniques in textual, visual, or multimodal content have increased, most of them primarily focus on English content. The majority of the recent initiatives targeting medium to low-resource languages produced relatively small annotated datasets, with a skewed distribution, posing challenges for the development of sophisticated propaganda detection models. To address this challenge, we carefully develop the largest propaganda dataset to date, ArPro, comprised of 8K paragraphs from newspaper articles, labeled at the text span level following a taxonomy of 23 propagandistic techniques. Furthermore, our work offers the first attempt to understand the performance of LLMs, using GPT-4, for fine-grained propaganda detection from text. Results showed that GPT-4's performance degrades as the task moves from simply classifying a paragraph as propagandistic or not, to the fine-grained task of detecting propaganda techniques and their manifestation in text. Compared to models fine-tuned on the dataset for propaganda detection at different classification granularities, GPT-4 is still far behind. Finally, we evaluate GPT-4 on a dataset consisting of six other languages for span detection, and results suggest that the model struggles with the task across languages. Our dataset and resources will be released to the community.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. Benchmarking arabic ai with large language models.
  2. MEGA: Multilingual evaluation of generative ai. arXiv preprint arXiv:2303.12528.
  3. A survey on multimodal disinformation detection. In Proceedings of the 29th International Conference on Computational Linguistics, pages 6625–6643, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
  4. Overview of the WANLP 2022 shared task on propaganda detection in Arabic. In Proceedings of the Seventh Arabic Natural Language Processing Workshop, WANLP ’22, Abu Dhabi, UAE.
  5. AraBERT: Transformer-based model for Arabic language understanding. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, pages 9–15.
  6. Ron Artstein and Massimo Poesio. 2008. Inter-coder agreement for computational linguistics. Computational Linguistics, 34(4):555–596.
  7. A multitask, multilingual, multimodal evaluation of ChatGPT on reasoning, hallucination, and interactivity. arXiv preprint arXiv:2302.04023.
  8. Proppy: A system to unmask propaganda in online news. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI’19), AAAI’19, Honolulu, HI.
  9. Proppy: Organizing the news based on their propagandistic content. Information Processing & Management, 56(5):1849–1864.
  10. Combined annotations of misinformation, propaganda, and fallacies identified robustly and explainably (campfire).
  11. Multimodal visual-textual object graph attention network for propaganda detection in memes. Multimedia Tools and Applications, pages 1–16.
  12. Unsupervised cross-lingual representation learning at scale. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, ACL ’20, pages 8440–8451, Online. Association for Computational Linguistics.
  13. Fine-grained analysis of propaganda in news articles. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, EMNLP-IJCNLP 2019, Hong Kong, China, November 3-7, 2019, EMNLP-IJCNLP 2019, Hong Kong, China.
  14. LLMeBench: A flexible framework for accelerating llms benchmarking.
  15. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT ’19, pages 4171–4186, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
  16. Detecting propaganda techniques in memes. arXiv preprint arXiv:2109.08013.
  17. SemEval-2021 task 6: Detection of persuasion techniques in texts and images. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 70–98, Online. Association for Computational Linguistics.
  18. Argotario: Computational argumentation meets serious games. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 7–12, Copenhagen, Denmark. Association for Computational Linguistics.
  19. Adapting Serious Game for Fallacious Argumentation to German: Pitfalls, Insights, and Best Practices. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), pages 3329–3335.
  20. QCRI at SemEval-2023 task 3: News genre, framing and persuasion techniques detection using multilingual models. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1237–1244, Toronto, Canada. Association for Computational Linguistics.
  21. Institute for Propaganda Analysis. 1938. How to Detect Propaganda. In Propaganda Analysis. Volume I of the Publications of the Institute for Propaganda Analysis, chapter 2, pages 210–218. New York, NY.
  22. Garth S Jowett and Victoria O’donnell. 2018. Propaganda & persuasion. Sage publications.
  23. J Richard Landis and Gary G Koch. 1977. The measurement of observer agreement for categorical data. biometrics, pages 159–174.
  24. Holistic evaluation of language models. arXiv preprint arXiv:2211.09110.
  25. A survey on computational propaganda detection. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI ’20, pages 4826–4832.
  26. Yann Mathet. 2017. The agreement measure γcat a complement to γ focused on categorization of a continuum. Computational Linguistics, 43(3):661–681.
  27. The Unified and Holistic Method Gamma (γ𝛾\gammaitalic_γ) for Inter-Annotator Agreement Measure and Alignment. Computational Linguistics, 41(3):437–479.
  28. OpenAI. 2023. GPT-4 technical report. Technical report, OpenAI.
  29. Rebecca Passonneau. 2006. Measuring agreement on set-valued items (MASI) for semantic and pragmatic annotation. In Proceedings of the Fifth International Conference on Language Resources and Evaluation, LREC ’06, pages 831–836, Genoa, Italy.
  30. Andrew Perrin. 2015. Social media usage. Pew research center, pages 52–68.
  31. News categorization, framing and persuasion techniques: Annotation guidelines. Technical report, European Commission Joint Research Centre, Ispra (Italy).
  32. SemEval-2023 task 3: Detecting the category, the framing, and the persuasion techniques in online news in a multi-lingual setup. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 2343–2361, Toronto, Canada. Association for Computational Linguistics.
  33. Truth of varying shades: Analyzing language in fake news and political fact-checking. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 2931–2937. Association for Computational Linguistics.
  34. Can the crowd identify misinformation objectively? The effects of judgment scale and assessor’s background. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’20, pages 439–448, Virtual Event, China. Association for Computing Machinery.
  35. On the stratification of multi-label data. In Machine Learning and Knowledge Discovery in Databases, ECML-PKDD ’11, pages 145–158, Berlin, Heidelberg. Springer Berlin Heidelberg.
  36. Detecting and understanding harmful memes: A survey. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI ’22, pages 5597–5606, Vienna, Austria. International Joint Conferences on Artificial Intelligence Organization. Survey Track.
  37. AraFacts: the first large arabic dataset of naturally occurring claims. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, pages 231–236.
  38. Iulian Vamanu. 2019. Fake news and propaganda: A critical discourse research perspective. Open Information Science, 3(1):197–208.
  39. Prashanth Vijayaraghavan and Soroush Vosoughi. 2022. TWEETSPIN: Fine-grained propaganda detection in social media using multi-view representations. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 3433–3448, Seattle, United States. Association for Computational Linguistics.
  40. Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, EMNLP ’20, pages 38–45, Online. Association for Computational Linguistics.
  41. Samuel C Woolley and Philip N Howard. 2018. Computational propaganda: political parties, politicians, and political manipulation on social media. Oxford University Press.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Maram Hasanain (24 papers)
  2. Fatema Ahmed (4 papers)
  3. Firoj Alam (75 papers)
Citations (20)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com