Rethinking Label Smoothing on Multi-hop Question Answering (2212.09512v3)

Published 19 Dec 2022 in cs.CL

Abstract: Multi-Hop Question Answering (MHQA) is a significant area in question answering, requiring multiple reasoning components, including document retrieval, supporting sentence prediction, and answer span extraction. In this work, we analyze the primary factors limiting the performance of multi-hop reasoning and introduce label smoothing into the MHQA task. This is aimed at enhancing the generalization capabilities of MHQA systems and mitigating overfitting of answer spans and reasoning paths in training set. We propose a novel label smoothing technique, F1 Smoothing, which incorporates uncertainty into the learning process and is specifically tailored for Machine Reading Comprehension (MRC) tasks. Inspired by the principles of curriculum learning, we introduce the Linear Decay Label Smoothing Algorithm (LDLA), which progressively reduces uncertainty throughout the training process. Experiment on the HotpotQA dataset demonstrates the effectiveness of our methods in enhancing performance and generalizability in multi-hop reasoning, achieving new state-of-the-art results on the leaderboard.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (9)

Zhangyue Yin (27 papers)
Yuxin Wang (132 papers)
Xiannian Hu (2 papers)
Yiguang Wu (1 paper)
Hang Yan (86 papers)
Xinyu Zhang (296 papers)
Zhao Cao (36 papers)
Xuanjing Huang (287 papers)
Xipeng Qiu (257 papers)

Citations (9)

View on Semantic Scholar

Rethinking Label Smoothing on Multi-hop Question Answering (2212.09512v3)

Related Papers