Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking (2105.12306v1)

Published 26 May 2021 in cs.CL

Abstract: Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in the Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts noticed this phenomenon and try to use the similarity for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called ReaLiSe, by directly leveraging the multimodal information of the Chinese characters. The ReaLiSe model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Heng-Da Xu (4 papers)
  2. Zhongli Li (11 papers)
  3. Qingyu Zhou (28 papers)
  4. Chao Li (429 papers)
  5. Zizhen Wang (5 papers)
  6. Yunbo Cao (43 papers)
  7. Heyan Huang (107 papers)
  8. Xian-Ling Mao (76 papers)
Citations (85)