MASKER: Masked Keyword Regularization for Reliable Text Classification (2012.09392v1)

Published 17 Dec 2020 in cs.LG and cs.CL

Abstract: Pre-trained LLMs have achieved state-of-the-art accuracies on various text classification tasks, e.g., sentiment analysis, natural language inference, and semantic textual similarity. However, the reliability of the fine-tuned text classifiers is an often underlooked performance criterion. For instance, one may desire a model that can detect out-of-distribution (OOD) samples (drawn far from training distribution) or be robust against domain shifts. We claim that one central obstacle to the reliability is the over-reliance of the model on a limited number of keywords, instead of looking at the whole context. In particular, we find that (a) OOD samples often contain in-distribution keywords, while (b) cross-domain samples may not always contain keywords; over-relying on the keywords can be problematic for both cases. In light of this observation, we propose a simple yet effective fine-tuning method, coined masked keyword regularization (MASKER), that facilitates context-based prediction. MASKER regularizes the model to reconstruct the keywords from the rest of the words and make low-confidence predictions without enough context. When applied to various pre-trained LLMs (e.g., BERT, RoBERTa, and ALBERT), we demonstrate that MASKER improves OOD detection and cross-domain generalization without degrading classification accuracy. Code is available at https://github.com/alinlab/MASKER.

Authors (5)

Seung Jun Moon (1 paper)
Sangwoo Mo (20 papers)
Kimin Lee (69 papers)
Jaeho Lee (51 papers)
Jinwoo Shin (196 papers)

Citations (35)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - alinlab/MASKER: MASKER: Masked Keyword Regularization for Reliable Text Classification (AAAI 2021) (51 stars)

MASKER: Masked Keyword Regularization for Reliable Text Classification (2012.09392v1)

Summary

Related Papers

GitHub