Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bridging the Gap Between Training and Inference of Bayesian Controllable Language Models (2206.05519v1)

Published 11 Jun 2022 in cs.CL and cs.AI

Abstract: Large-scale pre-trained LLMs have achieved great success on natural language generation tasks. However, it is difficult to control the pre-trained LLMs to generate sentences with the desired attribute such as topic and sentiment, etc. Recently, Bayesian Controllable LLMs (BCLMs) have been shown to be efficient in controllable language generation. Rather than fine-tuning the parameters of pre-trained LLMs, BCLMs use external discriminators to guide the generation of pre-trained LLMs. However, the mismatch between training and inference of BCLMs limits the performance of the models. To address the problem, in this work we propose a "Gemini Discriminator" for controllable language generation which alleviates the mismatch problem with a small computational cost. We tested our method on two controllable language generation tasks: sentiment control and topic control. On both tasks, our method reached achieved new state-of-the-art results in automatic and human evaluations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Han Liu (340 papers)
  2. Bingning Wang (29 papers)
  3. Ting Yao (127 papers)
  4. Haijin Liang (4 papers)
  5. Jianjin Xu (11 papers)
  6. Xiaolin Hu (97 papers)
Citations (1)