Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fantastic Questions and Where to Find Them: FairytaleQA -- An Authentic Dataset for Narrative Comprehension (2203.13947v1)

Published 26 Mar 2022 in cs.CL

Abstract: Question answering (QA) is a fundamental means to facilitate assessment and training of narrative comprehension skills for both machines and young children, yet there is scarcity of high-quality QA datasets carefully designed to serve this purpose. In particular, existing datasets rarely distinguish fine-grained reading skills, such as the understanding of varying narrative elements. Drawing on the reading education research, we introduce FairytaleQA, a dataset focusing on narrative comprehension of kindergarten to eighth-grade students. Generated by educational experts based on an evidence-based theoretical framework, FairytaleQA consists of 10,580 explicit and implicit questions derived from 278 children-friendly stories, covering seven types of narrative elements or relations. Our dataset is valuable in two folds: First, we ran existing QA models on our dataset and confirmed that this annotation helps assess models' fine-grained learning skills. Second, the dataset supports question generation (QG) task in the education domain. Through benchmarking with QG models, we show that the QG model trained on FairytaleQA is capable of asking high-quality and more diverse questions.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (18)
  1. Ying Xu (81 papers)
  2. Dakuo Wang (87 papers)
  3. Mo Yu (117 papers)
  4. Daniel Ritchie (50 papers)
  5. Bingsheng Yao (49 papers)
  6. Tongshuang Wu (53 papers)
  7. Zheng Zhang (488 papers)
  8. Toby Jia-Jun Li (57 papers)
  9. Nora Bradford (1 paper)
  10. Branda Sun (1 paper)
  11. Tran Bao Hoang (1 paper)
  12. Yisi Sang (13 papers)
  13. Yufang Hou (49 papers)
  14. Xiaojuan Ma (74 papers)
  15. Diyi Yang (151 papers)
  16. Nanyun Peng (205 papers)
  17. Zhou Yu (206 papers)
  18. Mark Warschauer (4 papers)
Citations (88)

Summary

We haven't generated a summary for this paper yet.