Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Attributed Question Answering: Evaluation and Modeling for Attributed Large Language Models (2212.08037v2)

Published 15 Dec 2022 in cs.CL

Abstract: LLMs have shown impressive results while requiring little or no direct supervision. Further, there is mounting evidence that LLMs may have potential in information-seeking scenarios. We believe the ability of an LLM to attribute the text that it generates is likely to be crucial in this setting. We formulate and study Attributed QA as a key first step in the development of attributed LLMs. We propose a reproducible evaluation framework for the task and benchmark a broad set of architectures. We take human annotations as a gold standard and show that a correlated automatic metric is suitable for development. Our experimental work gives concrete answers to two key questions (How to measure attribution?, and How well do current state-of-the-art methods perform on attribution?), and give some hints as to how to address a third (How to build LLMs with attribution?).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (22)
  1. Bernd Bohnet (21 papers)
  2. Vinh Q. Tran (19 papers)
  3. Pat Verga (16 papers)
  4. Roee Aharoni (35 papers)
  5. Daniel Andor (14 papers)
  6. Livio Baldini Soares (18 papers)
  7. Massimiliano Ciaramita (15 papers)
  8. Jacob Eisenstein (73 papers)
  9. Kuzman Ganchev (13 papers)
  10. Jonathan Herzig (34 papers)
  11. Kai Hui (27 papers)
  12. Tom Kwiatkowski (21 papers)
  13. Ji Ma (72 papers)
  14. Jianmo Ni (31 papers)
  15. Lierni Sestorain Saralegui (2 papers)
  16. Tal Schuster (33 papers)
  17. William W. Cohen (79 papers)
  18. Michael Collins (46 papers)
  19. Dipanjan Das (42 papers)
  20. Donald Metzler (49 papers)
Citations (56)
X Twitter Logo Streamline Icon: https://streamlinehq.com