Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation (2310.14892v3)

Published 23 Oct 2023 in cs.CL

Abstract: Controllable text generation (CTG) aims to generate text with desired attributes, and decoding-time-based methods have shown promising performance on this task. However, in this paper, we identify the phenomenon of Attribute Collapse for the first time. It causes the fluency of generated text to rapidly decrease when the control strength exceeds a critical value, rendering the text completely unusable. This limitation hinders the effectiveness of decoding methods in achieving high levels of controllability. To address this problem, we propose a novel lightweight decoding framework named Air-Decoding. Its main idea is reconstructing the attribute distributions to balance the weights between attribute words and non-attribute words to generate more fluent text. Specifically, we train prefixes by prefix-tuning to obtain attribute distributions. Then we design a novel attribute distribution reconstruction method to balance the obtained distributions and use the reconstructed distributions to guide LLMs for generation, effectively avoiding the issue of Attribute Collapse. Experiments on multiple CTG tasks prove that our method achieves a new state-of-the-art control performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Tianqi Zhong (3 papers)
  2. Quan Wang (130 papers)
  3. Jingxuan Han (1 paper)
  4. Yongdong Zhang (119 papers)
  5. Zhendong Mao (55 papers)
Citations (8)
Github Logo Streamline Icon: https://streamlinehq.com