LieRE: Generalizing Rotary Position Encodings (2406.10322v2)

Published 14 Jun 2024 in cs.CV and cs.LG

Abstract: While Rotary Position Embeddings (RoPE) for LLMs have become widely adopted, their application for other modalities has been slower. Here, we introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs. We evaluate the performance of LieRE on 2D and 3D image classification tasks and observe that LieRE leads to marked relative improvements in performance (up to 9.7% for 2D and up to 25.5% for 3D), training efficiency (3.5x reduction), data efficiency (30%) compared to the baselines of DeiT III, RoPE-Mixed and Vision-Llama. https://github.com/Stanford-AIMI/LieRE

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (5)

Sophie Ostmeier (11 papers)
Brian Axelrod (11 papers)
Michael E. Moseley (1 paper)
Akshay Chaudhari (34 papers)
Curtis Langlotz (24 papers)

Tweets

YouTube

Show All Videos

LieRE: Generalizing Rotary Position Encodings (2406.10322v2)

Related Papers

Tweets

YouTube