SOTOPIA-$π$: Interactive Learning of Socially Intelligent Language Agents (2403.08715v3)

Published 13 Mar 2024 in cs.CL

Abstract: Humans learn social skills through both imitation and social interaction. This social learning process is largely understudied by existing research on building language agents. Motivated by this gap, we propose an interactive learning method, SOTOPIA-$\pi$, improving the social intelligence of language agents. This method leverages behavior cloning and self-reinforcement training on filtered social interaction data according to LLM ratings. We show that our training method allows a 7B LLM to reach the social goal completion ability of an expert model (GPT-4-based agent), while improving the safety of language agents and maintaining general QA ability on the MMLU benchmark. We also find that this training paradigm uncovers some difficulties in LLM-based evaluation of social intelligence: LLM-based evaluators overestimate the abilities of the language agents trained specifically for social interaction.

References (59)

Authors (8)

Ruiyi Wang (11 papers)
Haofei Yu (17 papers)
Wenxin Zhang (27 papers)
Zhengyang Qi (6 papers)
Maarten Sap (86 papers)
Graham Neubig (342 papers)
Yonatan Bisk (91 papers)
Hao Zhu (212 papers)

Citations (15)

View on Semantic Scholar

Summary

The paper introduces SOTOPIA-π, an innovative framework that enhances language agents' social intelligence through interactive social task generation, behavior cloning, and self-reinforcement.
The framework employs GPT-4 to generate diverse social scenarios, creating rich training data from expert demonstrations and self-reinforced interactions.
Empirical evaluations reveal marked improvements in goal completion and safety, bridging the gap between human judgment and AI performance in social contexts.

Enhancing Social Intelligence in LLMs through Interactive Learning

Introduction to Social Learning in AI

Recent research efforts focus on enhancing the social intelligence of language agents through interactive learning, a process analogous to human social skill development. Social intelligence, fundamental for nuanced human-machine interactions, encompasses abilities ranging from understanding social cues to engaging in complex conversational exchanges. Despite the impressive strides in LLM (LM) capabilities, there remains a significant gap in achieving human-like social intelligence. This gap underscores the challenge of equipping language agents with the ability to intuit social norms, make socially aware decisions, and undertake goal-driven social interactions engagingly and safely.

SOTOPIA- $\pi$ : A New Framework for Social Learning

In response to these challenges, the recent work on the SOTOPIA- $\pi$ framework presents an innovative approach to interactive learning, aiming to bolster the social intelligence of language agents. The framework consists of three primary components: automatic generation of diverse social tasks, collection of interaction data through behavior cloning and self-reinforcement, and iterative policy updates to improve agent performance.

Social Task Generation: Task diversity is crucial for developing transferable social strategies. SOTOPIA- $\pi$ leverages GPT-4 to synthesize novel social scenarios, drawing from a wide array of potential social interactions. This process not only ensures a breadth of learning opportunities but also simulates the unpredictability and richness of human social experiences.
Training Data Collection: The framework utilizes a dual strategy for data collection. For behavior cloning, interactions between expert agents (based on GPT-4) serve as exemplary models of social behavior. Self-reinforcement, on the other hand, relies on the agent's own experiences, focusing on instances of successful goal completion as rated by GPT-4.
Agent Policy Update: Training incorporates both learning from experts and reinforcing positive self-generated interactions, refined through GPT-4-based performance evaluations. This multi-faceted approach allows for the gradual improvement of the agent's social acumen, balancing between learning effective strategies and minimizing unsafe or undesired behaviors.

Empirical Evaluation and Findings

The evaluation of SOTOPIA- $\pi$ yields several key insights:

Improvement in Social Intelligence: The framework demonstrates significant advancements in the language agents' goal completion abilities, bringing them closer to the expert model's performance. This indicates the effectiveness of combining behavior cloning and self-reinforcement for social learning.
Challenges in Evaluation: The increased performance, however, uncovers limitations in the current evaluation protocols, particularly the gap between LLM-based evaluators and human judgment. This mismatch signals the need for more nuanced and human-aligned evaluation metrics for social intelligence.
Balance with Other AI Capabilities: SOTOPIA- $\pi$ not only enhances social intelligence but does so without compromising the LLMs' general knowledge and reasoning abilities. Furthermore, it introduces improvements in safety, reducing the propensity for generating toxic responses.

Theoretical and Practical Implications

The SOTOPIA- $\pi$ framework underscores the potential of interactive learning for enhancing social intelligence in LLMs. Theoretically, it aligns with the understanding that social learning is not merely about imitation but involves complex cognitive processes including hypothesis testing and reinforcement. Practically, the findings advocate for a holistic approach to AI development, where improving social abilities goes hand in hand with ensuring safe and aligned interactions.

Future Directions

This research opens several avenues for future exploration. First, refining the evaluation metrics and methodologies to better capture the nuance of human social judgment stands as a priority. Moreover, integrating online reinforcement learning could offer real-time feedback mechanisms for continuous improvement. Lastly, extending the framework to incorporate human interaction data could provide richer learning experiences, further bridging the gap between AI and human social intelligence.

In conclusion, the SOTOPIA- $\pi$ framework represents a significant step forward in the quest to endow LLMs with advanced social intelligence. By leveraging interactive learning, it not only enhances the ability of AI agents to navigate complex social scenarios but also lays the groundwork for safer and more meaningful human-AI interactions.

PDF Markdown

Related Papers

GitHub

GitHub - sotopia-lab/sotopia-pi: Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (arXiv 2024) (65 stars)

Tweets

https://twitter.com/_akhaliq/status/1768118035770974262

https://twitter.com/RuiyiWang153/status/1768340167717458283

https://twitter.com/RuiyiWang153/status/1791164037889163328

https://twitter.com/KyeGomezB/status/1768314274072662374

YouTube

Show All Videos