ARAML: A Stable Adversarial Training Framework for Text Generation (1908.07195v1)

Published 20 Aug 2019 in cs.CL and cs.LG

Abstract: Most of the existing generative adversarial networks (GAN) for text generation suffer from the instability of reinforcement learning training algorithms such as policy gradient, leading to unstable performance. To tackle this problem, we propose a novel framework called Adversarial Reward Augmented Maximum Likelihood (ARAML). During adversarial training, the discriminator assigns rewards to samples which are acquired from a stationary distribution near the data rather than the generator's distribution. The generator is optimized with maximum likelihood estimation augmented by the discriminator's rewards instead of policy gradient. Experiments show that our model can outperform state-of-the-art text GANs with a more stable training process.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Pei Ke (37 papers)
Fei Huang (408 papers)
Minlie Huang (225 papers)
Xiaoyan Zhu (54 papers)

Citations (22)

View on Semantic Scholar

ARAML: A Stable Adversarial Training Framework for Text Generation (1908.07195v1)

Related Papers