Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

VDMA: Video Question Answering with Dynamically Generated Multi-Agents (2407.03610v1)

Published 4 Jul 2024 in cs.CV

Abstract: This technical report provides a detailed description of our approach to the EgoSchema Challenge 2024. The EgoSchema Challenge aims to identify the most appropriate responses to questions regarding a given video clip. In this paper, we propose Video Question Answering with Dynamically Generated Multi-Agents (VDMA). This method is a complementary approach to existing response generation systems by employing a multi-agent system with dynamically generated expert agents. This method aims to provide the most accurate and contextually appropriate responses. This report details the stages of our approach, the tools employed, and the results of our experiments.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Noriyuki Kugo (4 papers)
  2. Tatsuya Ishibashi (3 papers)
  3. Kosuke Ono (2 papers)
  4. Yuji Sato (4 papers)

Summary

We haven't generated a summary for this paper yet.