Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Summarizing First-Person Videos from Third Persons' Points of Views (1711.08922v2)

Published 24 Nov 2017 in cs.CV

Abstract: Video highlight or summarization is among interesting topics in computer vision, which benefits a variety of applications like viewing, searching, or storage. However, most existing studies rely on training data of third-person videos, which cannot easily generalize to highlight the first-person ones. With the goal of deriving an effective model to summarize first-person videos, we propose a novel deep neural network architecture for describing and discriminating vital spatiotemporal information across videos with different points of view. Our proposed model is realized in a semi-supervised setting, in which fully annotated third-person videos, unlabeled first-person videos, and a small number of annotated first-person ones are presented during training. In our experiments, qualitative and quantitative evaluations on both benchmarks and our collected first-person video datasets are presented.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Hsuan-I Ho (5 papers)
  2. Wei-Chen Chiu (54 papers)
  3. Yu-Chiang Frank Wang (88 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.