Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition (2105.14980v2)

Published 31 May 2021 in cs.CL, cs.HC, and cs.LG

Abstract: Crowdsourcing is regarded as one prospective solution for effective supervised learning, aiming to build large-scale annotated training data by crowd workers. Previous studies focus on reducing the influences from the noises of the crowdsourced annotations for supervised models. We take a different point in this work, regarding all crowdsourced annotations as gold-standard with respect to the individual annotators. In this way, we find that crowdsourcing could be highly similar to domain adaptation, and then the recent advances of cross-domain methods can be almost directly applied to crowdsourcing. Here we take named entity recognition (NER) as a study case, suggesting an annotator-aware representation learning model that inspired by the domain adaptation methods which attempt to capture effective domain-aware features. We investigate both unsupervised and supervised crowdsourcing learning, assuming that no or only small-scale expert annotations are available. Experimental results on a benchmark crowdsourced NER dataset show that our method is highly effective, leading to a new state-of-the-art performance. In addition, under the supervised setting, we can achieve impressive performance gains with only a very small scale of expert annotations.

Citations (20)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - izhx/CLasDA: [ACL 21] Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition (6 stars)

Crowdsourcing Learning as Domain Adaptation: A Case Study on Named Entity Recognition (2105.14980v2)

Summary

Related Papers

GitHub