Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? (2505.05406v1)

Published 8 May 2025 in cs.CL

Abstract: Framing in media critically shapes public perception by selectively emphasizing some details while downplaying others. With the rise of LLMs in automated news and content creation, there is growing concern that these systems may introduce or even amplify framing biases compared to human authors. In this paper, we explore how framing manifests in both out-of-the-box and fine-tuned LLM-generated news content. Our analysis reveals that, particularly in politically and socially sensitive contexts, LLMs tend to exhibit more pronounced framing than their human counterparts. In addition, we observe significant variation in framing tendencies across different model architectures, with some models displaying notably higher biases. These findings point to the need for effective post-training mitigation strategies and tighter evaluation frameworks to ensure that automated news content upholds the standards of balanced reporting.

PDF Abstract

Analysis of Framing Bias in LLM-Generated News Headlines

The paper "Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?" by Valeria Pastorino and Nafise Sadat Moosavi presents an insightful examination of framing biases in news content generated by LLMs compared to human-written headlines. It investigates the propensity of LLMs to produce biased framing, particularly in politically and socially sensitive contexts, and emphasizes the need for more comprehensive evaluation frameworks to mitigate such biases in AI-generated text.

Core Findings and Analysis

Human vs. LLM Framing Bias

The research identifies a consistent trend where LLMs generate news headlines that are more framed compared to their human counterparts. Particularly in politically and socially charged content, LLMs showed amplified framing tendencies. This observation is supported by analyses conducted on various LLM architectures, revealing that larger models and those trained on extensive pretraining data exhibit more pronounced framing, surpassing human baselines. This finding highlights the inherent biases that such models may autonomously introduce, which poses challenges in maintaining neutrality in news reporting.

Impact of Model Architecture and Fine-Tuning

The paper outlines the effect of model size and pretraining data on framing patterns. Models such as T5, BART, and FLAN-T5, which have smaller architectures and limited training data, generally produced less framed content. Conversely, architectures like GPT, LLaMA, Cohere, and Claude demonstrated higher framing rates, suggesting that larger-scale pretraining data might accentuate framing biases. Additionally, fine-tuning some models led to a decrease in these biases, indicating promising avenues for bias mitigation through targeted post-training adjustments.

Framing Across Different Topics

The research delivers a comprehensive examination of framing tendencies across various topics, including Business, Economics, Health, Science, Political News, Conflicts, Crime, Historical Recognition, and Sports. Political news headlines appear most susceptible to AI-driven framing, corroborating previous literature on framing in media. However, factual domains such as Health and Science, expected to maintain neutrality, showed surprising degrees of interpretive bias when handled by LLMs.

Text Length and Framing

An additional observation noted is the correlation between text length and framing presence—longer text tends to include more framing. This suggests that extended context may allow LLMs to expand interpretative elements, thereby introducing framing more readily.

Implications and Future Directions

The findings underscore the critical importance of integrating framing bias assessments into standard evaluation practices for LLMs, especially as these models are increasingly incorporated into automated content generation. The implications are profound for the development of AI systems that not only excel in accuracy and fluency but also uphold the standards of fairness and balanced reporting. Therefore, future research must prioritize developing robust methodologies for detecting and mitigating framing biases across diverse domains.

Concluding Remarks

This paper significantly contributes to the discourse surrounding AI-generated media content by empirically highlighting the framing bias introduced by LLMs. It serves as an important call to action for the research community, encouraging the development of more nuanced evaluation frameworks that safeguard against biases and reinforce neutrality in automated news generation. As LLMs continue to evolve, so should our methods for ensuring they align with the ethical standards expected in human communications.

PDF Markdown Bookmark Chat (Pro)

Authors (2)

Valeria Pastorino (2 papers)
Nafise Sadat Moosavi (38 papers)

Related Papers

Find Related Papers

YouTube

Show All Videos