Effectiveness of chunkization for LLM summarization
Determine whether the chunkization approach—splitting a long document into smaller segments, summarizing each segment with a large language model such as GPT-3.5-turbo or GPT-4, and aggregating the outputs—performs equally well as single-pass summarization of the entire document by a large language model with a sufficiently large context window.
References
However, it is not clear whether this approach works equally well for summarization tasks.
— A Scoping Review of ChatGPT Research in Accounting and Finance
(2412.05731 - Dong et al., 7 Dec 2024) in Appendix: Technical Guide — Context Window