KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations (2403.01469v3)

Published 3 Mar 2024 in cs.CL

Abstract: We present KorMedMCQA, the first Korean Medical Multiple-Choice Question Answering benchmark, derived from professional healthcare licensing examinations conducted in Korea between 2012 and 2024. The dataset contains 7,469 questions from examinations for doctor, nurse, pharmacist, and dentist, covering a wide range of medical disciplines. We evaluate the performance of 59 LLMs, spanning proprietary and open-source models, multilingual and Korean-specialized models, and those fine-tuned for clinical applications. Our results show that applying Chain of Thought (CoT) reasoning can enhance the model performance by up to 4.5% compared to direct answering approaches. We also investigate whether MedQA, one of the most widely used medical benchmarks derived from the U.S. Medical Licensing Examination, can serve as a reliable proxy for evaluating model performance in other regions-in this case, Korea. Our correlation analysis between model scores on KorMedMCQA and MedQA reveals that these two benchmarks align no better than benchmarks from entirely different domains (e.g., MedQA and MMLU-Pro). This finding underscores the substantial linguistic and clinical differences between Korean and U.S. medical contexts, reinforcing the need for region-specific medical QA benchmarks. To support ongoing research in Korean healthcare AI, we publicly release the KorMedMCQA via Huggingface.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (21)

Authors (10)

Sunjun Kweon (7 papers)
Byungjin Choi (1 paper)
Minkyu Kim (51 papers)
Rae Woong Park (2 papers)
Edward Choi (90 papers)
Gyouk Chu (2 papers)
Junyeong Song (1 paper)
Daeun Hyeon (1 paper)
Sujin Gan (1 paper)
Jueon Kim (1 paper)

Citations (4)

View on Semantic Scholar

KorMedMCQA: Multi-Choice Question Answering Benchmark for Korean Healthcare Professional Licensing Examinations (2403.01469v3)

Related Papers