Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Local Structure Matters Most: Perturbation Study in NLU (2107.13955v2)

Published 29 Jul 2021 in cs.CL and cs.AI

Abstract: Recent research analyzing the sensitivity of natural language understanding models to word-order perturbations has shown that neural models are surprisingly insensitive to the order of words. In this paper, we investigate this phenomenon by developing order-altering perturbations on the order of words, subwords, and characters to analyze their effect on neural models' performance on language understanding tasks. We experiment with measuring the impact of perturbations to the local neighborhood of characters and global position of characters in the perturbed texts and observe that perturbation functions found in prior literature only affect the global ordering while the local ordering remains relatively unperturbed. We empirically show that neural models, invariant of their inductive biases, pretraining scheme, or the choice of tokenization, mostly rely on the local structure of text to build understanding and make limited use of the global structure.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Prasanna Parthasarathi (23 papers)
  2. Amal Zouaq (15 papers)
  3. Sarath Chandar (93 papers)
  4. Louis Clouatre (2 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.