A Chat About Boring Problems: Studying GPT-based text normalization (2309.13426v2)

Published 23 Sep 2023 in cs.CL and cs.AI

Abstract: Text normalization - the conversion of text from written to spoken form - is traditionally assumed to be an ill-formed task for LLMs. In this work, we argue otherwise. We empirically show the capacity of Large-LLMs (LLM) for text normalization in few-shot scenarios. Combining self-consistency reasoning with linguistic-informed prompt engineering, we find LLM based text normalization to achieve error rates around 40\% lower than top normalization systems. Further, upon error analysis, we note key limitations in the conventional design of text normalization tasks. We create a new taxonomy of text normalization errors and apply it to results from GPT-3.5-Turbo and GPT-4.0. Through this new framework, we can identify strengths and weaknesses of GPT-based TN, opening opportunities for future work.

References (23)

Authors (6)

Yang Zhang (1129 papers)
Travis M. Bartley (3 papers)
Mariana Graterol-Fuenmayor (1 paper)
Vitaly Lavrukhin (32 papers)
Evelina Bakhturina (21 papers)
Boris Ginsburg (111 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Chat About Boring Problems: Studying GPT-based text normalization (2309.13426v2)

Summary

Related Papers