Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models (2408.11871v2)

Published 19 Aug 2024 in cs.CL and cs.AI

Abstract: The advent of LLMs has revolutionized online content creation, making it much easier to generate high-quality fake news. This misuse threatens the integrity of our digital environment and ethical standards. Therefore, understanding the motivations and mechanisms behind LLM-generated fake news is crucial. In this study, we analyze the creation of fake news from a social psychology perspective and develop a comprehensive LLM-based theoretical framework, LLM-Fake Theory. We introduce a novel pipeline that automates the generation of fake news using LLMs, thereby eliminating the need for manual annotation. Utilizing this pipeline, we create a theoretically informed Machine-generated Fake news dataset, MegaFake, derived from the GossipCop dataset. We conduct comprehensive analyses to evaluate our MegaFake dataset. We believe that our dataset and insights will provide valuable contributions to future research focused on the detection and governance of fake news in the era of LLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Lionel Z. Wang (4 papers)
  2. Yiming Ma (29 papers)
  3. Renfei Gao (1 paper)
  4. Beichen Guo (2 papers)
  5. Han Zhu (50 papers)
  6. Wenqi Fan (78 papers)
  7. Zexin Lu (21 papers)
  8. Ka Chung Ng (2 papers)
Citations (1)