Neural Grammatical Error Correction with Finite State Transducers (1903.10625v2)

Published 25 Mar 2019 in cs.CL

Abstract: Grammatical error correction (GEC) is one of the areas in natural language processing in which purely neural models have not yet superseded more traditional symbolic models. Hybrid systems combining phrase-based statistical machine translation (SMT) and neural sequence models are currently among the most effective approaches to GEC. However, both SMT and neural sequence-to-sequence models require large amounts of annotated data. LLM based GEC (LM-GEC) is a promising alternative which does not rely on annotated training data. We show how to improve LM-GEC by applying modelling techniques based on finite state transducers. We report further gains by rescoring with neural LLMs. We show that our methods developed for LM-GEC can also be used with SMT systems if annotated training data is available. Our best system outperforms the best published result on the CoNLL-2014 test set, and achieves far better relative improvements over the SMT baselines than previous hybrid systems.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (3)

Felix Stahlberg (31 papers)
Christopher Bryant (12 papers)
Bill Byrne (57 papers)

Citations (26)

View on Semantic Scholar

YouTube

Show All Videos

Neural Grammatical Error Correction with Finite State Transducers (1903.10625v2)

Related Papers

YouTube