Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Convolutional Hierarchical Attention Network for Query-Focused Video Summarization (2002.03740v3)

Published 31 Jan 2020 in cs.CV

Abstract: Previous approaches for video summarization mainly concentrate on finding the most diverse and representative visual contents as video summary without considering the user's preference. This paper addresses the task of query-focused video summarization, which takes user's query and a long video as inputs and aims to generate a query-focused video summary. In this paper, we consider the task as a problem of computing similarity between video shots and query. To this end, we propose a method, named Convolutional Hierarchical Attention Network (CHAN), which consists of two parts: feature encoding network and query-relevance computing module. In the encoding network, we employ a convolutional network with local self-attention mechanism and query-aware global attention mechanism to learns visual information of each shot. The encoded features will be sent to query-relevance computing module to generate queryfocused video summary. Extensive experiments on the benchmark dataset demonstrate the competitive performance and show the effectiveness of our approach.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Shuwen Xiao (2 papers)
  2. Zhou Zhao (219 papers)
  3. Zijian Zhang (125 papers)
  4. Xiaohui Yan (9 papers)
  5. Min Yang (239 papers)
Citations (45)

Summary

We haven't generated a summary for this paper yet.