Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach (2010.16324v1)

Published 30 Oct 2020 in cs.CL and cs.LG

Abstract: Nowadays, there exist powerful LLMs such as OpenAI's GPT-2 that can generate readable text and can be fine-tuned to generate text for a specific domain. Considering GPT-2, it cannot directly generate synthetic news with respect to a given topic and the output of the LLM cannot be explicitly controlled. In this paper, we study the novel problem of topic-preserving synthetic news generation. We propose a novel deep reinforcement learning-based method to control the output of GPT-2 with respect to a given news topic. When generating text using GPT-2, by default, the most probable word is selected from the vocabulary. Instead of selecting the best word each time from GPT-2's output, an RL agent tries to select words that optimize the matching of a given topic. In addition, using a fake news detector as an adversary, we investigate generating realistic news using our proposed method. In this paper, we consider realistic news as news that cannot be easily detected by a fake news classifier. Experimental results demonstrate the effectiveness of the proposed framework on generating topic-preserving news content than state-of-the-art baselines.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (3)

Ahmadreza Mosallanezhad (10 papers)
Kai Shu (88 papers)
Huan Liu (283 papers)

Citations (9)

View on Semantic Scholar

Topic-Preserving Synthetic News Generation: An Adversarial Deep Reinforcement Learning Approach (2010.16324v1)

Related Papers