Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation (2305.00955v2)

Published 1 May 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Many recent advances in natural language generation have been fueled by training LLMs on internet-scale data. However, this paradigm can lead to models that generate toxic, inaccurate, and unhelpful content, and automatic evaluation metrics often fail to identify these behaviors. As models become more capable, human feedback is an invaluable signal for evaluating and improving models. This survey aims to provide an overview of the recent research that has leveraged human feedback to improve natural language generation. First, we introduce an encompassing formalization of feedback, and identify and organize existing research into a taxonomy following this formalization. Next, we discuss how feedback can be described by its format and objective, and cover the two approaches proposed to use feedback (either for training or decoding): directly using the feedback or training feedback models. We also discuss existing datasets for human-feedback data collection, and concerns surrounding feedback collection. Finally, we provide an overview of the nascent field of AI feedback, which exploits LLMs to make judgments based on a set of principles and minimize the need for human intervention.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Patrick Fernandes (32 papers)
  2. Aman Madaan (30 papers)
  3. Emmy Liu (17 papers)
  4. António Farinhas (18 papers)
  5. Pedro Henrique Martins (11 papers)
  6. Amanda Bertsch (14 papers)
  7. José G. C. de Souza (12 papers)
  8. Shuyan Zhou (28 papers)
  9. Tongshuang Wu (53 papers)
  10. Graham Neubig (342 papers)
  11. André F. T. Martins (113 papers)
Citations (50)

Summary

We haven't generated a summary for this paper yet.