Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Distilling Opinions at Scale: Incremental Opinion Summarization using XL-OPSUMM (2406.10886v1)

Published 16 Jun 2024 in cs.CL and cs.LG

Abstract: Opinion summarization in e-commerce encapsulates the collective views of numerous users about a product based on their reviews. Typically, a product on an e-commerce platform has thousands of reviews, each review comprising around 10-15 words. While LLMs have shown proficiency in summarization tasks, they struggle to handle such a large volume of reviews due to context limitations. To mitigate, we propose a scalable framework called Xl-OpSumm that generates summaries incrementally. However, the existing test set, AMASUM has only 560 reviews per product on average. Due to the lack of a test set with thousands of reviews, we created a new test set called Xl-Flipkart by gathering data from the Flipkart website and generating summaries using GPT-4. Through various automatic evaluations and extensive analysis, we evaluated the framework's efficiency on two datasets, AMASUM and Xl-Flipkart. Experimental results show that our framework, Xl-OpSumm powered by Llama-3-8B-8k, achieves an average ROUGE-1 F1 gain of 4.38% and a ROUGE-L F1 gain of 3.70% over the next best-performing model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Sri Raghava Muddu (1 paper)
  2. Rupasai Rangaraju (4 papers)
  3. Tejpalsingh Siledar (5 papers)
  4. Swaroop Nath (5 papers)
  5. Pushpak Bhattacharyya (153 papers)
  6. Swaprava Nath (26 papers)
  7. Suman Banerjee (66 papers)
  8. Amey Patil (5 papers)
  9. Muthusamy Chelliah (8 papers)
  10. Sudhanshu Shekhar Singh (4 papers)
  11. Nikesh Garera (13 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.