Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Interventional Probing in High Dimensions: An NLI Case Study (2304.10346v1)

Published 20 Apr 2023 in cs.CL

Abstract: Probing strategies have been shown to detect the presence of various linguistic features in LLMs; in particular, semantic features intermediate to the "natural logic" fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting for interventional studies on the NLI models' representations, allowing for stronger causal conjectures and a deeper critical analysis of interventional probing methods. In this work, we carry out new and existing representation-level interventions to investigate the effect of these semantic features on NLI classification: we perform amnesic probing (which removes features as directed by learned linear probes) and introduce the mnestic probing variation (which forgets all dimensions except the probe-selected ones). Furthermore, we delve into the limitations of these methods and outline some pitfalls have been obscuring the effectivity of interventional probing studies.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Julia Rozanova (11 papers)
  2. Marco Valentino (46 papers)
  3. Lucas Cordeiro (30 papers)
  4. Andre Freitas (52 papers)
Citations (6)