`Keep it Together': Enforcing Cohesion in Extractive Summaries by Simulating Human Memory (2402.10643v1)

Published 16 Feb 2024 in cs.CL and cs.AI

Abstract: Extractive summaries are usually presented as lists of sentences with no expected cohesion between them. In this paper, we aim to enforce cohesion whilst controlling for informativeness and redundancy in summaries, in cases where the input exhibits high redundancy. The pipeline controls for redundancy in long inputs as it is consumed, and balances informativeness and cohesion during sentence selection. Our sentence selector simulates human memory to keep track of topics --modeled as lexical chains--, enforcing cohesive ties between noun phrases. Across a variety of domains, our experiments revealed that it is possible to extract highly cohesive summaries that nevertheless read as informative to humans as summaries extracted by only accounting for informativeness or redundancy. The extracted summaries exhibit smooth topic transitions between sentences as signaled by lexical chains, with chains spanning adjacent or near-adjacent sentences.

References (49)

Summary

The paper presents a novel two-stage extractive summarization method that simulates human memory to balance redundancy, informativeness, and cohesion.
The methodology employs a cascaded encoder with the KvD-Select mechanism to build cohesive chains of noun phrases, boosting cohesive ties by over 20%.
Experimental results across diverse domains confirm enhanced readability, with evaluations showing over 15% more salient noun phrases compared to baseline methods.

Enforcing Cohesion in Extractive Summaries by Simulating Human Memory

Introduction

Recent advancements in extractive summarization have primarily focused on optimizing for informativeness and minimizing redundancy. This paper introduces a novel approach to extractive summarization that adds an additional dimension to this optimization problem: cohesion. The authors argue that to improve the readability and coherence of summaries, it’s necessary to model and enforce cohesive ties between sentences. This is achieved through a two-stage pipeline that controls for redundancy in the input and balances informativeness and cohesion during summary extraction.

Methodology

The paper’s proposed methodology introduces a cascaded encoder for handling long texts by minimizing redundancy. This approach prevents the summarization model from being exposed to repetitive information. Furthermore, the sentence selector simulates human memory, specifically leveraging the Micro-Macro theory, to manage and create cohesive chains of noun phrases, thereby ensuring that the selected sentences for the summary maintain lexical cohesion.

The cascade encoder efficiently processes long documents by splitting them into blocks and handling them sequentially to minimize semantic similarity between selected passages. On the other hand, the sentence selection mechanism, named KvD-Select, incrementally builds cohesive chains of noun phrases while balancing the sentence’s informativeness. This process aligns with human cognitive processes by keeping active the most salient topics (or chains) within working memory, thus fostering a cohesive summary output.

Experimental Results

The methodology was tested across several domains including newswire, scientific articles, patents, and government reports. The results indicated a significant reduction in redundancy of the summaries, with the sentence selector able to maintain high informativeness and substantially improve cohesion. Specific numerical outcomes include over 15% more noun phrases and over 20% more sentences connected through cohesive ties, compared to a baseline greedy selector. Moreover, human evaluation confirmed that summaries generated with the proposed method are perceived as more cohesive without compromising informativeness.

The paper situates its contributions amid existing works on text summarization that have tackled cohesion to varying extents. Prior approaches have attempted to model cohesion by tracking named entities, topic flow, or implementing discourse theories. Unlike these models, this paper’s approach quantifies summary properties separately and models cohesion explicitly during sentence selection. This method allows for more precise control over both informativeness and cohesion.

Implications and Future Directions

The presented work demonstrates that controlling for cohesion, alongside informativeness and redundancy, can yield summaries that are not only compact but also readable and coherent. This has practical implications for the development of summarization tools for domains where accuracy and readability are critical, such as scientific literature and legal documents. The paper also opens up avenues for future research, including exploring post-editing techniques to further enhance coherence and integrating the method with generative models to reduce hallucinations and improve stylistic consistency.

Conclusion

This paper introduces a novel approach to extractive text summarization that prioritizes cohesion, alongside informativeness and redundancy. By simulating human memory processes, the proposed methodology is able to produce summaries that are both informative and cohesive. The findings highlight the importance of considering cohesion as a critical factor in summary quality and suggest potential paths for integrating these insights into more advanced summarization technologies.

PDF Markdown