The Locality and Symmetry of Positional Encodings (2310.12864v1)

Published 19 Oct 2023 in cs.CL

Abstract: Positional Encodings (PEs) are used to inject word-order information into transformer-based LLMs. While they can significantly enhance the quality of sentence representations, their specific contribution to LLMs is not fully understood, especially given recent findings that various positional encodings are insensitive to word order. In this work, we conduct a systematic study of positional encodings in \textbf{Bidirectional Masked LLMs} (BERT-style) , which complements existing work in three aspects: (1) We uncover the core function of PEs by identifying two common properties, Locality and Symmetry; (2) We show that the two properties are closely correlated with the performances of downstream tasks; (3) We quantify the weakness of current PEs by introducing two new probing tasks, on which current PEs perform poorly. We believe that these results are the basis for developing better PEs for transformer-based LLMs. The code is available at \faGithub~ \url{https://github.com/tigerchen52/locality\_symmetry}

References (65)

Authors (3)

Lihu Chen (12 papers)
Gaël Varoquaux (87 papers)
Fabian M. Suchanek (12 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - tigerchen52/locality_symmetry: Code for "The Locality and Symmetry of Positional Encodings" EMNLP Findings (3 stars)

YouTube

Show All Videos

The Locality and Symmetry of Positional Encodings (2310.12864v1)

Summary

Related Papers

GitHub

YouTube