Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization (2305.12782v1)

Published 22 May 2023 in cs.CL and cs.AI

Abstract: Generating persona consistent dialogue response is important for developing an intelligent conversational agent. Recent works typically fine-tune large-scale pre-trained models on this task by concatenating persona texts and dialogue history as a single input sequence to generate the target response. While simple and effective, our analysis shows that this popular practice is seriously affected by order sensitivity where different input orders of persona sentences significantly impact the quality and consistency of generated response, resulting in severe performance fluctuations (i.e., 29.4% on GPT2 and 83.2% on BART). To mitigate the order sensitivity problem, we propose a model-agnostic framework, ORder Insensitive Generation (ORIG), which enables dialogue models to learn robust representation under different persona orders and improve the consistency of response generation. Experiments on the Persona-Chat dataset justify the effectiveness and superiority of our method with two dominant pre-trained models (GPT2 and BART).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Liang Chen (360 papers)
  2. Hongru Wang (62 papers)
  3. Yang Deng (113 papers)
  4. Wai-Chung Kwan (8 papers)
  5. Zezhong Wang (30 papers)
  6. Kam-Fai Wong (92 papers)
Citations (13)