Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MAGPIE: Multi-Task Media-Bias Analysis Generalization for Pre-Trained Identification of Expressions (2403.07910v2)

Published 27 Feb 2024 in cs.CY and cs.CL

Abstract: Media bias detection poses a complex, multifaceted problem traditionally tackled using single-task models and small in-domain datasets, consequently lacking generalizability. To address this, we introduce MAGPIE, the first large-scale multi-task pre-training approach explicitly tailored for media bias detection. To enable pre-training at scale, we present Large Bias Mixture (LBM), a compilation of 59 bias-related tasks. MAGPIE outperforms previous approaches in media bias detection on the Bias Annotation By Experts (BABE) dataset, with a relative improvement of 3.3% F1-score. MAGPIE also performs better than previous models on 5 out of 8 tasks in the Media Bias Identification Benchmark (MBIB). Using a RoBERTa encoder, MAGPIE needs only 15% of finetuning steps compared to single-task approaches. Our evaluation shows, for instance, that tasks like sentiment and emotionality boost all learning, all tasks enhance fake news detection, and scaling tasks leads to the best results. MAGPIE confirms that MTL is a promising approach for addressing media bias detection, enhancing the accuracy and efficiency of existing models. Furthermore, LBM is the first available resource collection focused on media bias MTL.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (105)
  1. Muppet: Massive multi-task representations with pre-finetuning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 5799–5811, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
  2. Jigsaw/Conversation AI. 2019. Jigsaw unintended bias in toxicity classification.
  3. Abeer ALDayel and Walid Magdy. 2021. Stance detection on social media: State of the art and trends. Information Processing & Management, 58(4):102597.
  4. How reliable are model diagnostics? In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1778–1785, Online. Association for Computational Linguistics.
  5. ExT5: Towards Extreme Multi-Task Scaling for Transfer Learning. CoRR, abs/2111.10952. ArXiv: 2111.10952.
  6. Sandeep Attree. 2019. Gendered Ambiguous Pronouns Shared Task: Boosting Model Confidence by Evidence Pooling.
  7. RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models.
  8. RedditBias: A Real-World Resource for Bias Evaluation and Debiasing of Conversational Language Models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1941–1955, Online. Association for Computational Linguistics.
  9. Testing and Comparing Computational Approaches for Identifying the Language of Framing in Political News. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 1472–1482, Denver, Colorado. Association for Computational Linguistics.
  10. Joachim Bingel and Anders Søgaard. 2017. Identifying beneficial task relations for multi-task learning in deep neural networks. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 164–169, Valencia, Spain. Association for Computational Linguistics.
  11. GoodNewsEveryone: A Corpus of News Headlines Annotated with Emotions, Semantic Roles, and Reader Perception. In Proceedings of the 12th Language Resources and Evaluation Conference, pages 1554–1566, Marseille, France. European Language Resources Association.
  12. BSI. 1973a. Natural Fibre Twines, 3rd edition. British Standards Institution, London. BS 2570.
  13. BSI. 1973b. Natural fibre twines. BS 2570, British Standards Institution, London. 3rd. edn.
  14. The use of user modelling to guide inference and learning. Applied Intelligence, 2(1):37–53.
  15. Multi-Task Learning in Natural Language Processing: An Overview. arXiv:2109.09138 [cs].
  16. J.L. Chercheur. 1994. Case-Based Reasoning, 2nd edition. Morgan Kaufman Publishers, San Mateo, CA.
  17. N. Chomsky. 1973. Conditions on transformations. In A festschrift for Morris Halle, New York. Holt, Rinehart & Winston.
  18. Will-they-won’t-they: A very large dataset for stance detection on Twitter. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1715–1724, Online. Association for Computational Linguistics.
  19. Jamell Dacon and Haochen Liu. 2021. Does Gender Matter in the News? Detecting and Examining Gender Bias in News Articles. In Companion Proceedings of the Web Conference 2021, WWW ’21, pages 385–392, New York, NY, USA. Association for Computing Machinery.
  20. Trump vs. Hillary: What Went Viral During the 2016 US Presidential Election. In Social Informatics, Lecture Notes in Computer Science, pages 143–161, Cham. Springer International Publishing.
  21. Automated hate speech detection and the problem of offensive language. In Proceedings of the International AAAI Conference on Web and Social Media, volume 11. Issue: 1.
  22. Multi-dimensional gender bias classification. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 314–331, Online. Association for Computational Linguistics.
  23. Wizard of Wikipedia: Knowledge-Powered Conversational agents.
  24. Wizard of Wikipedia: Knowledge-powered Conversational Agents. In Proceedings of the International Conference on Learning Representations (ICLR).
  25. Measuring and Mitigating Unintended Bias in Text Classification. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’18, pages 67–73, New York, NY, USA. Association for Computing Machinery.
  26. Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping.
  27. Umberto Eco. 1990. The Limits of Interpretation. Indian University Press.
  28. In plain sight: Media bias through the lens of factual reporting. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 6343–6349, Hong Kong, China. Association for Computational Linguistics.
  29. W. Ferreira and A. Vlachos. 2016. Emergent: a novel data-set for stance classification.
  30. Large Scale Crowdsourcing and Characterization of Twitter Abusive Behavior. Proceedings of the International AAAI Conference on Web and Social Media, 12(1). Number: 1.
  31. A Multidimensional Dataset Based on Crowdsourcing for Analyzing and Detecting News Bias. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, CIKM ’20, pages 3007–3014, New York, NY, USA. Association for Computing Machinery. Event-place: Virtual Event, Ireland.
  32. #MeTooMA: Multi-Aspect Annotations of Tweets Related to the MeToo Movement. Proceedings of the International AAAI Conference on Web and Social Media, 14:209–216.
  33. A Large Labeled Corpus for Online Harassment Research. In Proceedings of the 2017 ACM on Web Science Conference, pages 229–233, Troy New York USA. ACM.
  34. Dylan Grosz and Patricia Conde-Cespedes. 2020. Automatic Detection of Sexist Statements Commonly Used at the Workplace. In Wei Lu and Kenny Q. Zhu, editors, Trends and Applications in Knowledge Discovery and Data Mining, volume 12237, pages 104–115. Springer International Publishing, Cham.
  35. Automated identification of media bias in news articles: an interdisciplinary literature review. International Journal on Digital Libraries, 20(4):391–415.
  36. An interactive multi-task learning network for end-to-end aspect-based sentiment analysis. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 504–515, Florence, Italy. Association for Computational Linguistics.
  37. Infusing Knowledge from Wikipedia to Enhance Stance Detection.
  38. Paul Gerhard Hoel. 1971a. Elementary Statistics, 3rd edition. Wiley series in probability and mathematical statistics. Wiley, New York, Chichester. ISBN 0 471 40300.
  39. Paul Gerhard Hoel. 1971b. Elementary Statistics, 3rd edition, Wiley series in probability and mathematical statistics, pages 19–33. Wiley, New York, Chichester. ISBN 0 471 40300.
  40. Christoph Hube and Besnik Fetahu. 2018. Detecting Biased Statements in Wikipedia. In Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW ’18, pages 1779–1786, Lyon, France. ACM Press.
  41. Christoph Hube and Besnik Fetahu. 2019. Neural Based Statement Classification for Biased Language. In Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining, WSDM ’19, pages 195–203, New York, NY, USA. Association for Computing Machinery. Event-place: Melbourne VIC, Australia.
  42. Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 1921–1945. Association for Computational Linguistics. Event-place: Online.
  43. Otto Jespersen. 1922. Language: Its Nature, Development, and Origin. Allen and Unwin.
  44. Analyzing multi-task learning for abstractive text summarization. In Proceedings of the 2nd Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), pages 54–77, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
  45. All-in-one: Multi-task learning for rumour verification. In Proceedings of the 27th International Conference on Computational Linguistics, pages 3402–3413, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
  46. A Domain-adaptive Pre-training Approach for Language Bias Detection in News. In 2022 ACM/IEEE Joint Conference on Digital Libraries (JCDL), Cologne, Germany.
  47. An Experimental Analysis of Data Annotation Methodologies for Emotion Detection in Short Text Posted on Social Media. Informatics, 8(1):19.
  48. Mitigating Media Bias through Neutral Article Generation. CoRR, abs/2104.00336. _eprint: 2104.00336.
  49. On unifying misinformation detection. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 5479–5485, Online. Association for Computational Linguistics.
  50. Annotating and Analyzing Biased Sentences in News Articles using Crowdsourcing. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 1478–1484, Marseille, France. European Language Resources Association.
  51. Bing Liu. 2012. Sentiment Analysis and Opinion Mining. Synthesis Lectures on Human Language Technologies. Springer International Publishing, Cham.
  52. Enhancing Zero-shot and Few-shot Stance Detection with Commonsense Knowledge Graph. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 3152–3157, Online. Association for Computational Linguistics.
  53. RoBERTa: A Robustly Optimized BERT Pretraining Approach. CoRR, abs/1907.11692. Eprint: 1907.11692.
  54. DeSMOG: Detecting Stance in Media On Global Warming. In Findings of the Association for Computational Linguistics: EMNLP 2020, pages 3296–3315, Online. Association for Computational Linguistics.
  55. GradTS: A gradient-based automatic auxiliary task selection method based on transformer networks. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 5621–5632, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
  56. Learning Word Vectors for Sentiment Analysis. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 142–150, Portland, Oregon, USA. Association for Computational Linguistics.
  57. HateXplain: A Benchmark Dataset for Explainable Hate Speech Detection. Proceedings of the AAAI Conference on Artificial Intelligence, 35(17):14867–14875.
  58. ParlAI: A dialog research software platform. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 79–84, Copenhagen, Denmark. Association for Computational Linguistics.
  59. The touché23-valueeval dataset for identifying human values behind arguments.
  60. Semeval-2016 task 6: Detecting stance in tweets. In Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016), pages 31–41.
  61. Stance and Sentiment in Tweets. ACM Transactions on Internet Technology, 17(3):26:1–26:23.
  62. StereoSet: Measuring stereotypical bias in pretrained language models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 5356–5371, Online. Association for Computational Linguistics.
  63. StereoSet: Measuring stereotypical bias in pretrained language models. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 5356–5371. Association for Computational Linguistics. Event-place: Online.
  64. CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Masked Language Models. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 1953–1967, Online. Association for Computational Linguistics.
  65. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, ACL ’04, pages 271–es, USA. Association for Computational Linguistics. Event-place: Barcelona, Spain.
  66. News categorization, framing and persuasion techniques: Annotation guidelines. Technical report, European Commission Joint Research Centre, Ispra (Italy).
  67. SemEval-2014 Task 4: Aspect Based Sentiment Analysis. In Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014), pages 27–35, Dublin, Ireland. Association for Computational Linguistics.
  68. Automatically neutralizing subjective bias in text. In Proceedings of the aaai conference on artificial intelligence, volume 34, pages 480–489. Issue: 01.
  69. Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype Detection. Technical report. ADS Bibcode: 2022arXiv220314349P Type: article.
  70. Effective use of Spearman’s and Kendall’s correlation coefficients for association between two measured traits. Animal Behaviour, 102:77–84.
  71. Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research, 21(140):1–67.
  72. Dbias: Detecting biases and ensuring fairness in news articles. International Journal of Data Science and Analytics.
  73. BU-NEmo: an Affective Dataset of Gun Violence News. Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 2507–2516.
  74. Linguistic Models for Analyzing and Detecting Biased Language. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), volume 1, pages 1650–1659, Sofia, Bulgaria. Association for Computational Linguistics.
  75. Sebastian Ruder and Barbara Plank. 2017. Learning to select data for transfer learning with Bayesian optimization. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pages 372–382, Copenhagen, Denmark. Association for Computational Linguistics.
  76. Aggression and Misogyny Detection using BERT: A Multi-Task Approach. In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, pages 126–131, Marseille, France. European Language Resources Association (ELRA).
  77. "Call me sexist, but…": Revisiting Sexism Detection Using Psychological Scales and Adversarial Samples.
  78. FakeNewsNet: A Data Repository with News Content, Social Context, and Spatiotemporal Information for Studying Fake News on Social Media. Big Data, 8(3):171–188.
  79. A history of technology. Oxford University Press, London. 5 vol.
  80. Manjira Sinha and Tirthankar Dasgupta. 2021. Determining Subjective Bias in Text through Linguistically Informed Transformer based Multi-Task Network. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management, pages 3418–3422. ACM. Event-place: Virtual Event Queensland Australia.
  81. A Dataset for Multi-Target Stance Detection. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 551–557, Valencia, Spain. Association for Computational Linguistics.
  82. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 1631–1642, Seattle, Washington, USA. Association for Computational Linguistics.
  83. The media bias taxonomy: A systematic literature review on the forms and automated detection of media bias. CSUR. [in review].
  84. Do You Think It’s Biased? How To Ask For The Perception Of Media Bias. In Proceedings of the ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 61–69.
  85. Towards A Reliable Ground-Truth For Biased Language Detection. In Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL), Virtual Event.
  86. Exploiting transformer-based multitask learning for the detection of media bias in news articles. In Proceedings of the iConference 2022, Virtual event. Tex.pubstate: published tex.tppubtype: inproceedings.
  87. Neural Media Bias Detection Using Distant Supervision With BABE - Bias Annotations By Experts. In Findings of the Association for Computational Linguistics: EMNLP 2021, pages 1166–1177, Punta Cana, Dominican Republic. Association for Computational Linguistics.
  88. Automated identification of bias inducing words in news articles using linguistic and context-oriented features. Information Processing & Management, 58(3):102505.
  89. Dhanya Sridhar and Lise Getoor. 2019. Estimating causal effects of tone in online debates. In Proceedings of the 28th International Joint Conference on Artificial Intelligence, IJCAI’19, page 1872–1878. AAAI Press.
  90. Jannik Strötgen and Michael Gertz. 2012. Temporal tagging on different domains: Challenges, strategies, and gold standards. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12), pages 3746–3753, Istanbul, Turkey. European Language Resource Association (ELRA).
  91. Superheroes experiences with books, 20th edition. The Phantom Editors Associates, Gotham City.
  92. RtGender: A Corpus for Studying Differential Responses to Gender. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
  93. SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems. In Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, December 8-14, 2019, Vancouver, BC, Canada, pages 3261–3275.
  94. GLUE: A multi-task benchmark and analysis platform for natural language understanding. In Proceedings of the 2018 EMNLP Workshop BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP, pages 353–355, Brussels, Belgium. Association for Computational Linguistics.
  95. William Yang Wang. 2017a. "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 422–426, Vancouver, Canada. Association for Computational Linguistics.
  96. William Yang Wang. 2017b. "Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection.
  97. Mind the GAP: A Balanced Corpus of Gendered Ambiguous Pronouns. Transactions of the Association for Computational Linguistics, 6:605–617.
  98. Maxwell Weinzierl and Sanda Harabagiu. 2022. VaccineLies: A Natural Language Resource for Learning to Recognize Misinformation about the COVID-19 and HPV Vaccines. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 6967–6975, Marseille, France. European Language Resources Association.
  99. Introducing MBIB - The First Media Bias Identification Benchmark Task and Dataset Collection. In Proceedings of 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’23), New York, NY, USA. ACM. ISBN 978-1-4503-9408-6/23/07.
  100. Theresa Ann Wilson. 2008. Fine-Grained Subjectivity and Sentiment Analysis: Recognizing the Intensity, Polarity, and Attitudes of Private States. Ph.D. thesis, USA. AAI3322382.
  101. Ex Machina: Personal Attacks Seen at Scale. In Proceedings of the 26th International Conference on World Wide Web, WWW 2017, Perth, Australia, April 3-7, 2017, pages 1391–1399. ACM.
  102. Ex Machina: Personal Attacks Seen at Scale. In Proceedings of the 26th International Conference on World Wide Web, WWW ’17, pages 1391–1399, Republic and Canton of Geneva, CHE. International World Wide Web Conferences Steering Committee.
  103. Gradient surgery for multi-task learning. In Advances in Neural Information Processing Systems, volume 33, pages 5824–5836. Curran Associates, Inc.
  104. Character-level convolutional networks for text classification. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 1, NIPS’15, pages 649–657, Cambridge, MA, USA. MIT Press.
  105. Character-level Convolutional Networks for Text Classification. In Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, December 7-12, 2015, Montreal, Quebec, Canada, pages 649–657.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com