Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DeVAn: Dense Video Annotation for Video-Language Models (2310.05060v2)

Published 8 Oct 2023 in cs.CV and cs.AI

Abstract: We present a novel human annotated dataset for evaluating the ability for visual-LLMs to generate both short and long descriptions for real-world video clips, termed DeVAn (Dense Video Annotation). The dataset contains 8.5K YouTube video clips of 20-60 seconds in duration and covers a wide range of topics and interests. Each video clip is independently annotated by 5 human annotators, producing both captions (1 sentence) and summaries (3-10 sentences). Given any video selected from the dataset and its corresponding ASR information, we evaluate visualLLMs on either caption or summary generation that is grounded in both the visual and auditory content of the video. Additionally, models are also evaluated on caption- and summary-based retrieval tasks, where the summary-based retrieval task requires the identification of a target video given excerpts of a given summary. Given the novel nature of the paragraph-length video summarization task, we compared different existing evaluation metrics and their alignment with human preferences and found that model-based evaluation metrics provide more semantically-oriented and human-aligned evaluation. Finally, we benchmarked a wide range of current video-LLMs on DeVAn, and we aim for DeVAn to serve as a useful evaluation set in the age of LLMs and complex multi-modal tasks. Code is available at https: //github.com/TK-21st/DeVAn.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Tingkai Liu (9 papers)
  2. Yunzhe Tao (20 papers)
  3. Haogeng Liu (8 papers)
  4. Qihang Fan (13 papers)
  5. Ding Zhou (10 papers)
  6. Huaibo Huang (58 papers)
  7. Ran He (172 papers)
  8. Hongxia Yang (130 papers)
Citations (3)