2000 character limit reached
Do Language Models Understand Measurements? (2210.12694v1)
Published 23 Oct 2022 in cs.CL
Abstract: Recent success of pre-trained LLMs (PLMs) has stimulated interest in their ability to understand and work with numbers. Yet, the numerical reasoning over measurements has not been formally studied despite their importance. In this study, we show that PLMs lack the capability required for reasoning over measurements. Furthermore, we find that a LLM trained on a measurement-rich corpus shows better performance on understanding measurements. We propose a simple embedding strategy to better distinguish between numbers and units, which leads to a significant improvement in the probing tasks.