Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review (2207.00782v1)

Published 2 Jul 2022 in cs.AI and cs.HC

Abstract: The intelligent dialogue system, aiming at communicating with humans harmoniously with natural language, is brilliant for promoting the advancement of human-machine interaction in the era of artificial intelligence. With the gradually complex human-computer interaction requirements (e.g., multimodal inputs, time sensitivity), it is difficult for traditional text-based dialogue system to meet the demands for more vivid and convenient interaction. Consequently, Visual Context Augmented Dialogue System (VAD), which has the potential to communicate with humans by perceiving and understanding multimodal information (i.e., visual context in images or videos, textual dialogue history), has become a predominant research paradigm. Benefiting from the consistency and complementarity between visual and textual context, VAD possesses the potential to generate engaging and context-aware responses. For depicting the development of VAD, we first characterize the concepts and unique features of VAD, and then present its generic system architecture to illustrate the system workflow. Subsequently, several research challenges and representative works are detailed investigated, followed by the summary of authoritative benchmarks. We conclude this paper by putting forward some open issues and promising research trends for VAD, e.g., the cognitive mechanisms of human-machine dialogue under cross-modal dialogue context, and knowledge-enhanced cross-modal semantic interaction.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Hao Wang (1120 papers)
  2. Bin Guo (150 papers)
  3. Yating Zeng (1 paper)
  4. Yasan Ding (10 papers)
  5. Chen Qiu (43 papers)
  6. Ying Zhang (389 papers)
  7. Lina Yao (194 papers)
  8. Zhiwen Yu (78 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.