Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages (2207.06882v1)

Published 14 Jul 2022 in cs.CL, cs.AI, and cs.LG

Abstract: We leverage pre-trained LLMs to solve the task of complex NER for two low-resource languages: Chinese and Spanish. We use the technique of Whole Word Masking(WWM) to boost the performance of masked LLMing objective on large and unsupervised corpora. We experiment with multiple neural network architectures, incorporating CRF, BiLSTMs, and Linear Classifiers on top of a fine-tuned BERT layer. All our models outperform the baseline by a significant margin and our best performing model obtains a competitive position on the evaluation leaderboard for the blind test set.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Amit Pandey (5 papers)
  2. Swayatta Daw (2 papers)
  3. Narendra Babu Unnam (2 papers)
  4. Vikram Pudi (11 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.