Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Audio Visual Scene-Aware Dialog (AVSD) Challenge at DSTC7 (1806.00525v1)

Published 1 Jun 2018 in cs.CL and cs.CV

Abstract: Scene-aware dialog systems will be able to have conversations with users about the objects and events around them. Progress on such systems can be made by integrating state-of-the-art technologies from multiple research areas including end-to-end dialog systems visual dialog, and video description. We introduce the Audio Visual Scene Aware Dialog (AVSD) challenge and dataset. In this challenge, which is one track of the 7th Dialog System Technology Challenges (DSTC7) workshop1, the task is to build a system that generates responses in a dialog about an input video

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Huda Alamri (5 papers)
  2. Vincent Cartillier (9 papers)
  3. Raphael Gontijo Lopes (8 papers)
  4. Abhishek Das (61 papers)
  5. Jue Wang (203 papers)
  6. Irfan Essa (91 papers)
  7. Dhruv Batra (160 papers)
  8. Devi Parikh (129 papers)
  9. Anoop Cherian (65 papers)
  10. Tim K. Marks (22 papers)
  11. Chiori Hori (21 papers)
Citations (31)