Lower Bounds on the Expressivity of Recurrent Neural Language Models (2405.19222v2)

Published 29 May 2024 in cs.CL

Abstract: The recent successes and spread of large neural LLMs (LMs) call for a thorough understanding of their computational ability. Describing their computational abilities through LMs' \emph{representational capacity} is a lively area of research. However, investigation into the representational capacity of neural LMs has predominantly focused on their ability to \emph{recognize} formal languages. For example, recurrent neural networks (RNNs) with Heaviside activations are tightly linked to regular languages, i.e., languages defined by finite-state automata (FSAs). Such results, however, fall short of describing the capabilities of RNN \emph{LLMs} (LMs), which are definitionally \emph{distributions} over strings. We take a fresh look at the representational capacity of RNN LMs by connecting them to \emph{probabilistic} FSAs and demonstrate that RNN LMs with linearly bounded precision can express arbitrary regular LMs.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Anej Svete (20 papers)
Franz Nowak (8 papers)
Anisha Mohamed Sahabdeen (1 paper)
Ryan Cotterell (226 papers)

Tweets

https://twitter.com/franz_nowak/status/1803819343307813093

Lower Bounds on the Expressivity of Recurrent Neural Language Models (2405.19222v2)

Related Papers

Tweets