Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DRCD: a Chinese Machine Reading Comprehension Dataset (1806.00920v3)

Published 4 Jun 2018 in cs.CL

Abstract: In this paper, we introduce DRCD (Delta Reading Comprehension Dataset), an open domain traditional Chinese machine reading comprehension (MRC) dataset. This dataset aimed to be a standard Chinese machine reading comprehension dataset, which can be a source dataset in transfer learning. The dataset contains 10,014 paragraphs from 2,108 Wikipedia articles and 30,000+ questions generated by annotators. We build a baseline model that achieves an F1 score of 89.59%. F1 score of Human performance is 93.30%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Chih Chieh Shao (1 paper)
  2. Trois Liu (1 paper)
  3. Yuting Lai (1 paper)
  4. Yiying Tseng (1 paper)
  5. Sam Tsai (11 papers)
Citations (121)