Exploring Key Point Analysis with Pairwise Generation and Graph Partitioning (2404.11384v1)
Abstract: Key Point Analysis (KPA), the summarization of multiple arguments into a concise collection of key points, continues to be a significant and unresolved issue within the field of argument mining. Existing models adapt a two-stage pipeline of clustering arguments or generating key points for argument clusters. This approach rely on semantic similarity instead of measuring the existence of shared key points among arguments. Additionally, it only models the intra-cluster relationship among arguments, disregarding the inter-cluster relationship between arguments that do not share key points. To address these limitations, we propose a novel approach for KPA with pairwise generation and graph partitioning. Our objective is to train a generative model that can simultaneously provide a score indicating the presence of shared key point between a pair of arguments and generate the shared key point. Subsequently, to map generated redundant key points to a concise set of key points, we proceed to construct an arguments graph by considering the arguments as vertices, the generated key points as edges, and the scores as edge weights. We then propose a graph partitioning algorithm to partition all arguments sharing the same key points to the same subgraph. Notably, our experimental findings demonstrate that our proposed model surpasses previous models when evaluated on both the ArgKP and QAM datasets.
- Key point analysis via contrastive learning and extractive argument summarization. In Proceedings of the 8th Workshop on Argument Mining, pages 184–189, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- From arguments to key points: Towards automatic argument summarization. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 4029–4039, Online. Association for Computational Linguistics.
- Every bite is an experience: Key Point Analysis of business reviews. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 3376–3386, Online. Association for Computational Linguistics.
- Quantitative argument summarization and beyond: Cross-domain key point analysis. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 39–49, Online. Association for Computational Linguistics.
- Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008:P10008.
- Recent advances in graph partitioning. In Algorithm Engineering.
- From key points to key point hierarchy: Structured and expressive opinion summarization. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 912–928, Toronto, Canada. Association for Computational Linguistics.
- Finding community structure in very large networks. Physical review. E, Statistical, nonlinear, and soft matter physics, 70 6 Pt 2:066111.
- Gennaro Cordasco and Luisa Gargano. 2010. Community detection via semi-synchronous label propagation algorithms. In 2010 IEEE International Workshop on: Business Applications of Social Network Analysis (BASNA), pages 1–8.
- Nicolas Dugué and Anthony Perez. 2015. Directed louvain : maximizing modularity in directed networks.
- Overview of the 2021 key point analysis shared task. In Proceedings of the 8th Workshop on Argument Mining, pages 154–164, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- SimCSE: Simple contrastive learning of sentence embeddings. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 6894–6910, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- AQE: Argument quadruplet extraction via a quad-tagging augmented generative approach. In Findings of the Association for Computational Linguistics: ACL 2023, pages 932–946, Toronto, Canada. Association for Computational Linguistics.
- Team enigma at ArgMining-EMNLP 2021: Leveraging pre-trained language models for key point matching. In Proceedings of the 8th Workshop on Argument Mining, pages 200–205, Punta Cana, Dominican Republic. Association for Computational Linguistics.
- An efficient heuristic procedure for partitioning graphs. The Bell System Technical Journal, 49(2):291–307.
- Supervised contrastive learning. In Advances in Neural Information Processing Systems, volume 33, pages 18661–18673. Curran Associates, Inc.
- Partitioning graphs into balanced components. In Proceedings of the Twentieth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’09, page 942–949, USA. Society for Industrial and Applied Mathematics.
- Do you hear the people sing? key point analysis via iterative clustering and abstractive summarisation. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14064–14080, Toronto, Canada. Association for Computational Linguistics.
- Chin-Yew Lin. 2004. ROUGE: A package for automatic evaluation of summaries. In Text Summarization Branches Out, pages 74–81, Barcelona, Spain. Association for Computational Linguistics.
- Roberta: A robustly optimized bert pretraining approach. ArXiv, abs/1907.11692.
- Stuart P. Lloyd. 1982. Least squares quantization in pcm. IEEE Trans. Inf. Theory, 28:129–136.
- Conditional set generation using seq2seq models. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 4874–4896, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
- Inderjeet Mani and Eric Bloedorn. 1997. Multi-document summarization by graph search and matching. In AAAI/IAAI.
- Daniel Margo and Margo Seltzer. 2015. A scalable distributed graph partitioner. Proc. VLDB Endow., 8(12):1478–1489.
- The pagerank citation ranking : Bringing order to the web. In The Web Conference.
- Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E, 76:036106.
- Bleurt: Learning robust metrics for text generation. In Proceedings of ACL.
- From louvain to leiden: guaranteeing well-connected communities. Scientific Reports, 9.
- Large language models enable few-shot clustering. ArXiv, abs/2307.00524.
- C-pack: Packaged resources to advance general chinese embedding. ArXiv, abs/2309.07597.
- PRIMERA: Pyramid-based masked sentence pre-training for multi-document summarization. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5245–5263, Dublin, Ireland. Association for Computational Linguistics.
- Pegasus: Pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the 37th International Conference on Machine Learning, ICML’20. JMLR.org.
- Clusterllm: Large language models as a guide for text clustering. ArXiv, abs/2305.14871.