Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze? (2205.10226v1)

Published 25 Apr 2022 in cs.CL and cs.LG

Abstract: Learned self-attention functions in state-of-the-art NLP models often correlate with human attention. We investigate whether self-attention in large-scale pre-trained LLMs is as predictive of human eye fixation patterns during task-reading as classical cognitive models of human attention. We compare attention functions across two task-specific reading datasets for sentiment analysis and relation extraction. We find the predictiveness of large-scale pre-trained self-attention for human attention depends on `what is in the tail', e.g., the syntactic nature of rare contexts. Further, we observe that task-specific fine-tuning does not increase the correlation with human task-specific reading. Through an input reduction experiment we give complementary insights on the sparsity and fidelity trade-off, showing that lower-entropy attention vectors are more faithful.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Stephanie Brandl (14 papers)
Oliver Eberle (14 papers)
Jonas Pilot (1 paper)
Anders Søgaard (121 papers)

Citations (27)

View on Semantic Scholar

Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze? (2205.10226v1)

Related Papers