Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

XPersona: Evaluating Multilingual Personalized Chatbot (2003.07568v2)

Published 17 Mar 2020 in cs.CL

Abstract: Personalized dialogue systems are an essential step toward better human-machine interaction. Existing personalized dialogue agents rely on properly designed conversational datasets, which are mostly monolingual (e.g., English), which greatly limits the usage of conversational agents in other languages. In this paper, we propose a multi-lingual extension of Persona-Chat, namely XPersona. Our dataset includes persona conversations in six different languages other than English for building and evaluating multilingual personalized agents. We experiment with both multilingual and cross-lingual trained baselines, and evaluate them against monolingual and translation-pipeline models using both automatic and human evaluation. Experimental results show that the multilingual trained models outperform the translation-pipeline and that they are on par with the monolingual models, with the advantage of having a single model across multiple languages. On the other hand, the state-of-the-art cross-lingual trained models achieve inferior performance to the other models, showing that cross-lingual conversation modeling is a challenging task. We hope that our dataset and baselines will accelerate research in multilingual dialogue systems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Zhaojiang Lin (45 papers)
  2. Zihan Liu (102 papers)
  3. Genta Indra Winata (94 papers)
  4. Samuel Cahyawijaya (75 papers)
  5. Andrea Madotto (64 papers)
  6. Yejin Bang (25 papers)
  7. Etsuko Ishii (18 papers)
  8. Pascale Fung (150 papers)
Citations (57)