Transformers in the loop: Polarity in neural models of language (2109.03926v2)
Abstract: Representation of linguistic phenomena in computational LLMs is typically assessed against the predictions of existing linguistic theories of these phenomena. Using the notion of polarity as a case study, we show that this is not always the most adequate set-up. We probe polarity via so-called 'negative polarity items' (in particular, English 'any') in two pre-trained Transformer-based models (BERT and GPT-2). We show that - at least for polarity - metrics derived from LLMs are more consistent with data from psycholinguistic experiments than linguistic theory predictions. Establishing this allows us to more adequately evaluate the performance of LLMs and also to use LLMs to discover new insights into natural language grammar beyond existing linguistic theories. This work contributes to establishing closer ties between psycholinguistic experiments and experiments with LLMs.
- Lisa Bylinina (7 papers)
- Alexey Tikhonov (35 papers)