Presence Factor-Oriented Blog Summarization (1302.7131v1)
Abstract: The research that has been carried out on blogs focused on blog posts only, ignoring the title of the blog page. Also, in summarization only a set of representative sentences are extracted. Some analysis has been done and it has been found that the blog post contains the content that is likely to be related to the topic of the blog post. Thus, proposed system of summarization makes use of title contained in a blog page. The approach makes use of the Presence factor that indicates the presence of each term of the title in each sentence of the blog post. This is a key feature because it considers those sentences as more relevant for summarization that contain each of the term present in the title. The system has been implemented and evaluated experimentally. The system has shown promising results.