Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Investigating BERT's Knowledge of Language: Five Analysis Methods with NPIs (1909.02597v2)

Published 5 Sep 2019 in cs.CL

Abstract: Though state-of-the-art sentence representation models can perform tasks requiring significant knowledge of grammar, it is an open question how best to evaluate their grammatical knowledge. We explore five experimental methods inspired by prior work evaluating pretrained sentence representation models. We use a single linguistic phenomenon, negative polarity item (NPI) licensing in English, as a case study for our experiments. NPIs like "any" are grammatical only if they appear in a licensing environment like negation ("Sue doesn't have any cats" vs. "Sue has any cats"). This phenomenon is challenging because of the variety of NPI licensing environments that exist. We introduce an artificially generated dataset that manipulates key features of NPI licensing for the experiments. We find that BERT has significant knowledge of these features, but its success varies widely across different experimental methods. We conclude that a variety of methods is necessary to reveal all relevant aspects of a model's grammatical knowledge in a given domain.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (16)
  1. Alex Warstadt (35 papers)
  2. Yu Cao (129 papers)
  3. Ioana Grosu (1 paper)
  4. Wei Peng (165 papers)
  5. Hagen Blix (4 papers)
  6. Yining Nie (1 paper)
  7. Anna Alsop (1 paper)
  8. Shikha Bordia (6 papers)
  9. Haokun Liu (26 papers)
  10. Alicia Parrish (31 papers)
  11. Sheng-Fu Wang (6 papers)
  12. Jason Phang (40 papers)
  13. Anhad Mohananey (6 papers)
  14. Phu Mon Htut (18 papers)
  15. Samuel R. Bowman (103 papers)
  16. Paloma Jeretič (2 papers)
Citations (120)

Summary

We haven't generated a summary for this paper yet.