IDS at SemEval-2020 Task 10: Does Pre-trained Language Model Know What to Emphasize? (2007.12390v1)

Published 24 Jul 2020 in cs.CL

Abstract: We propose a novel method that enables us to determine words that deserve to be emphasized from written text in visual media, relying only on the information from the self-attention distributions of pre-trained LLMs (PLMs). With extensive experiments and analyses, we show that 1) our zero-shot approach is superior to a reasonable baseline that adopts TF-IDF and that 2) there exist several attention heads in PLMs specialized for emphasis selection, confirming that PLMs are capable of recognizing important words in sentences.

Citations (1)

View on Semantic Scholar