Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data (2407.01937v2)

Published 2 Jul 2024 in cs.CL

Abstract: In recent years, with the rapid advancements in LLMs, achieving excellent empathetic response capability has become a crucial prerequisite. Consequently, managing and understanding large-scale video datasets has gained increasing importance. However, empathetic data are typically trained without any quality selection, leading to inefficient data usage and wasted computational resources. Additionally, using raw data can result in low performance in empathetic dialogues. In this work, we present Efficient-Empathy, a sensibility and rationality score-based data selection algorithm that automatically selects sensibility and rationality data while discarding low-quality data. With only the sensibility data (59% of the full dataset), our trained sensibility model efficiently achieves state-of-the-art (SoTA) performance. Furthermore, with multiple data selection hyperparameters, the sensibility model demonstrates SoTA performance, showcasing the robustness of our method. By integrating sensibility and rationality data with a MoE structure, we achieve even higher performance, demonstrating the effectiveness of our Efficient-Empathy algorithm.

References (53)

Authors (7)

Linzhuang Sun (18 papers)
Hao Liang (137 papers)
Jingxuan Wei (21 papers)
Linkun Sun (2 papers)
Bihui Yu (16 papers)
Bin Cui (165 papers)
Wentao Zhang (261 papers)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Efficient-Empathy: Towards Efficient and Effective Selection of Empathy Data (2407.01937v2)

Summary

Related Papers