A Survey on Long Text Modeling with Transformers (2302.14502v1)

Published 28 Feb 2023 in cs.CL

Abstract: Modeling long texts has been an essential technique in the field of NLP. With the ever-growing number of long documents, it is important to develop effective modeling methods that can process and analyze such texts. However, long texts pose important research challenges for existing text models, with more complex semantics and special characteristics. In this paper, we provide an overview of the recent advances on long texts modeling based on Transformer models. Firstly, we introduce the formal definition of long text modeling. Then, as the core content, we discuss how to process long input to satisfy the length limitation and design improved Transformer architectures to effectively extend the maximum context length. Following this, we discuss how to adapt Transformer models to capture the special characteristics of long texts. Finally, we describe four typical applications involving long text modeling and conclude this paper with a discussion of future directions. Our survey intends to provide researchers with a synthesis and pointer to related work on long text modeling.

Authors (4)

Zican Dong (12 papers)
Tianyi Tang (30 papers)
Lunyi Li (1 paper)
Wayne Xin Zhao (196 papers)

Citations (43)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

A Survey on Long Text Modeling with Transformers (2302.14502v1)

Summary

Related Papers