Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DEXTER: Deep Encoding of External Knowledge for Named Entity Recognition in Virtual Assistants (2108.06633v1)

Published 15 Aug 2021 in cs.CL, cs.AI, and cs.LG

Abstract: Named entity recognition (NER) is usually developed and tested on text from well-written sources. However, in intelligent voice assistants, where NER is an important component, input to NER may be noisy because of user or speech recognition error. In applications, entity labels may change frequently, and non-textual properties like topicality or popularity may be needed to choose among alternatives. We describe a NER system intended to address these problems. We test and train this system on a proprietary user-derived dataset. We compare with a baseline text-only NER system; the baseline enhanced with external gazetteers; and the baseline enhanced with the search and indirect labelling techniques we describe below. The final configuration gives around 6% reduction in NER error rate. We also show that this technique improves related tasks, such as semantic parsing, with an improvement of up to 5% in error rate.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Deepak Muralidharan (3 papers)
  2. Joel Ruben Antony Moniz (23 papers)
  3. Weicheng Zhang (4 papers)
  4. Stephen Pulman (14 papers)
  5. Lin Li (330 papers)
  6. Megan Barnes (3 papers)
  7. Jingjing Pan (5 papers)
  8. Jason Williams (27 papers)
  9. Alex Acero (1 paper)
Citations (2)

Summary

We haven't generated a summary for this paper yet.