Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Context-aware Stand-alone Neural Spelling Correction (2011.06642v1)

Published 12 Nov 2020 in cs.CL

Abstract: Existing natural language processing systems are vulnerable to noisy inputs resulting from misspellings. On the contrary, humans can easily infer the corresponding correct words from their misspellings and surrounding context. Inspired by this, we address the stand-alone spelling correction problem, which only corrects the spelling of each token without additional token insertion or deletion, by utilizing both spelling information and global context representations. We present a simple yet powerful solution that jointly detects and corrects misspellings as a sequence labeling task by fine-turning a pre-trained LLM. Our solution outperforms the previous state-of-the-art result by 12.8% absolute F0.5 score.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Xiangci Li (16 papers)
  2. Hairong Liu (26 papers)
  3. Liang Huang (108 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.