Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites (2305.09820v5)

Published 16 May 2023 in cs.CY, cs.LG, and cs.SI

Abstract: As LLMs like ChatGPT have gained traction, an increasing number of news websites have begun utilizing them to generate articles. However, not only can these LLMs produce factually inaccurate articles on reputable websites but disreputable news sites can utilize LLMs to mass produce misinformation. To begin to understand this phenomenon, we present one of the first large-scale studies of the prevalence of synthetic articles within online news media. To do this, we train a DeBERTa-based synthetic news detector and classify over 15.46 million articles from 3,074 misinformation and mainstream news websites. We find that between January 1, 2022, and May 1, 2023, the relative number of synthetic news articles increased by 57.3% on mainstream websites while increasing by 474% on misinformation sites. We find that this increase is largely driven by smaller less popular websites. Analyzing the impact of the release of ChatGPT using an interrupted-time-series, we show that while its release resulted in a marked increase in synthetic articles on small sites as well as misinformation news websites, there was not a corresponding increase on large mainstream news websites.

References (52)

Citations (25)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - hanshanley/machine-made-media: Repository for the paper Machine Made Media https://www.hanshanley.com/files/machine_made.pdf (7 stars)

Tweets

https://twitter.com/Hans_Hanley/status/1744527600532226459

https://twitter.com/unbabeled/status/1845213714607136782

https://twitter.com/WGOV/status/1770818942405120451

Machine-Made Media: Monitoring the Mobilization of Machine-Generated Articles on Misinformation and Mainstream News Websites (2305.09820v5)

Summary

Related Papers

GitHub

Tweets