Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Conversational Machine Reading Comprehension for Vietnamese Healthcare Texts (2105.01542v6)

Published 4 May 2021 in cs.CL

Abstract: Machine reading comprehension (MRC) is a sub-field in natural language processing that aims to assist computers understand unstructured texts and then answer questions related to them. In practice, the conversation is an essential way to communicate and transfer information. To help machines understand conversation texts, we present UIT-ViCoQA, a new corpus for conversational machine reading comprehension in the Vietnamese language. This corpus consists of 10,000 questions with answers over 2,000 conversations about health news articles. Then, we evaluate several baseline approaches for conversational machine comprehension on the UIT-ViCoQA corpus. The best model obtains an F1 score of 45.27%, which is 30.91 points behind human performance (76.18%), indicating that there is ample room for improvement. Our dataset is available at our website: http://nlp.uit.edu.vn/datasets/ for research purposes.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Son T. Luu (26 papers)
  2. Mao Nguyen Bui (1 paper)
  3. Loi Duc Nguyen (1 paper)
  4. Khiem Vinh Tran (7 papers)
  5. Kiet Van Nguyen (74 papers)
  6. Ngan Luu-Thuy Nguyen (56 papers)
Citations (8)