Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ParaQA: A Question Answering Dataset with Paraphrase Responses for Single-Turn Conversation (2103.07771v1)

Published 13 Mar 2021 in cs.CL

Abstract: This paper presents ParaQA, a question answering (QA) dataset with multiple paraphrased responses for single-turn conversation over knowledge graphs (KG). The dataset was created using a semi-automated framework for generating diverse paraphrasing of the answers using techniques such as back-translation. The existing datasets for conversational question answering over KGs (single-turn/multi-turn) focus on question paraphrasing and provide only up to one answer verbalization. However, ParaQA contains 5000 question-answer pairs with a minimum of two and a maximum of eight unique paraphrased responses for each question. We complement the dataset with baseline models and illustrate the advantage of having multiple paraphrased answers through commonly used metrics such as BLEU and METEOR. The ParaQA dataset is publicly available on a persistent URI for broader usage and adaptation in the research community.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Endri Kacupaj (10 papers)
  2. Barshana Banerjee (1 paper)
  3. Kuldeep Singh (50 papers)
  4. Jens Lehmann (80 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.