Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LBMT team at VLSP2022-Abmusu: Hybrid method with text correlation and generative models for Vietnamese multi-document summarization (2304.05205v1)

Published 11 Apr 2023 in cs.CL

Abstract: Multi-document summarization is challenging because the summaries should not only describe the most important information from all documents but also provide a coherent interpretation of the documents. This paper proposes a method for multi-document summarization based on cluster similarity. In the extractive method we use hybrid model based on a modified version of the PageRank algorithm and a text correlation considerations mechanism. After generating summaries by selecting the most important sentences from each cluster, we apply BARTpho and ViT5 to construct the abstractive models. Both extractive and abstractive approaches were considered in this study. The proposed method achieves competitive results in VLSP 2022 competition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tan-Minh Nguyen (7 papers)
  2. Thai-Binh Nguyen (9 papers)
  3. Hoang-Trung Nguyen (4 papers)
  4. Hai-Long Nguyen (6 papers)
  5. Tam Doan Thanh (2 papers)
  6. Ha-Thanh Nguyen (33 papers)
  7. Thi-Hai-Yen Vuong (13 papers)

Summary

We haven't generated a summary for this paper yet.