Role of natural language supervision in processing nuanced multimodal social signals
Determine the role and effectiveness of natural language supervision in processing nuanced multimodal social signals for Social-AI systems, including whether language can reliably guide social perception and behavior generation across modalities.
References
While scientists are studying the capacity of language to scaffold visual understanding, audio understanding, and virtual agent motion generation, the role of natural language supervision for processing nuanced multimodal social signals remains an open question.
— Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions
(2404.11023 - Mathur et al., 17 Apr 2024) in Section 4, Subsection (C2) Nuanced Signals, C2 Opportunities and Open Questions