Laying Anchors: Semantically Priming Numerals in Language Modeling (2404.01536v2)

Published 2 Apr 2024 in cs.CL, cs.AI, and cs.LG

Abstract: Off-the-shelf pre-trained LLMs have become the de facto standard in NLP pipelines for a multitude of downstream tasks. However, the inability of these models to properly encode numerals limits their performance on tasks requiring numeric comprehension. We introduce strategies to semantically prime numerals in any corpus by generating anchors governed by the distribution of numerals in said corpus, thereby enabling mathematically grounded representations of these numeral tokens. We establish the superiority of our proposed techniques through evaluation on a range of numeracy tasks for both in-domain (seen) and out-domain (unseen) numerals. Further, we expand our empirical evaluations to numerals ranging from 1 to 10 billion, a significantly broader range compared to previous studies of the same nature, and we demonstrate significant improvements in the mathematical grounding of our learned embeddings.

References (31)

Authors (5)

Mandar Sharma (9 papers)
Rutuja Murlidhar Taware (1 paper)
Pravesh Koirala (6 papers)
Nikhil Muralidhar (19 papers)
Naren Ramakrishnan (72 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/mandarsharma/status/1775341858522075270

https://twitter.com/SanghaniCtrVT/status/1780635433958834645

Laying Anchors: Semantically Priming Numerals in Language Modeling (2404.01536v2)

Summary

Related Papers

Tweets