Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Attribution Analysis of Grammatical Dependencies in LSTMs (2005.00062v1)

Published 30 Apr 2020 in cs.CL and cs.NE

Abstract: LSTM LLMs have been shown to capture syntax-sensitive grammatical dependencies such as subject-verb agreement with a high degree of accuracy (Linzen et al., 2016, inter alia). However, questions remain regarding whether they do so using spurious correlations, or whether they are truly able to match verbs with their subjects. This paper argues for the latter hypothesis. Using layer-wise relevance propagation (Bach et al., 2015), a technique that quantifies the contributions of input features to model behavior, we show that LSTM performance on number agreement is directly correlated with the model's ability to distinguish subjects from other nouns. Our results suggest that LSTM LLMs are able to infer robust representations of syntactic dependencies.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
  1. Yiding Hao (10 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.