A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity Recognition (2204.04980v1)

Published 11 Apr 2022 in cs.CL

Abstract: Pre-trained LLMs (PLM) are effective components of few-shot named entity recognition (NER) approaches when augmented with continued pre-training on task-specific out-of-domain data or fine-tuning on in-domain data. However, their performance in low-resource scenarios, where such data is not available, remains an open question. We introduce an encoder evaluation framework, and use it to systematically compare the performance of state-of-the-art pre-trained representations on the task of low-resource NER. We analyze a wide range of encoders pre-trained with different strategies, model architectures, intermediate-task fine-tuning, and contrastive learning. Our experimental results across ten benchmark NER datasets in English and German show that encoder performance varies significantly, suggesting that the choice of encoder for a specific low-resource scenario needs to be carefully evaluated.

Authors (5)

Yuxuan Chen (80 papers)
Jonas Mikkelsen (1 paper)
Arne Binder (4 papers)
Christoph Alt (16 papers)
Leonhard Hennig (25 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Comparative Study of Pre-trained Encoders for Low-Resource Named Entity Recognition (2204.04980v1)

Summary

Related Papers