Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future (2403.14659v1)

Published 28 Feb 2024 in cs.CY, cs.AI, and cs.CL

Abstract: As NLP systems become increasingly integrated into human social life, these technologies will need to increasingly rely on social intelligence. Although there are many valuable datasets that benchmark isolated dimensions of social intelligence, there does not yet exist any body of work to join these threads into a cohesive subfield in which researchers can quickly identify research gaps and future directions. Towards this goal, we build a Social AI Data Infrastructure, which consists of a comprehensive social AI taxonomy and a data library of 480 NLP datasets. Our infrastructure allows us to analyze existing dataset efforts, and also evaluate LLMs' performance in different social intelligence aspects. Our analyses demonstrate its utility in enabling a thorough understanding of current data landscape and providing a holistic perspective on potential directions for future dataset development. We show there is a need for multifaceted datasets, increased diversity in language and culture, more long-tailed social situations, and more interactive data in future social intelligence data efforts.

References (115)

Authors (4)

Minzhi Li (8 papers)
Weiyan Shi (42 papers)
Caleb Ziems (22 papers)
Diyi Yang (151 papers)

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/shi_weiyan/status/1823187073295176057

https://twitter.com/EllaMinzhiLi/status/1823187232448028875

https://twitter.com/gastronomy/status/1772113486165750090

HackerNews

Social Intelligence Data Infrastructure (1 point, 0 comments)

Social Intelligence Data Infrastructure: Structuring the Present and Navigating the Future (2403.14659v1)

Summary

Related Papers

Tweets

HackerNews