Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems (2402.13374v1)

Published 20 Feb 2024 in cs.CL

Abstract: In the realm of dialogue systems, user simulation techniques have emerged as a game-changer, redefining the evaluation and enhancement of task-oriented dialogue (TOD) systems. These methods are crucial for replicating real user interactions, enabling applications like synthetic data augmentation, error detection, and robust evaluation. However, existing approaches often rely on rigid rule-based methods or on annotated data. This paper introduces DAUS, a Domain-Aware User Simulator. Leveraging LLMs, we fine-tune DAUS on real examples of task-oriented dialogues. Results on two relevant benchmarks showcase significant improvements in terms of user goal fulfiLLMent. Notably, we have observed that fine-tuning enhances the simulator's coherence with user goals, effectively mitigating hallucinations -- a major source of inconsistencies in simulator responses.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Silvia Terragni (8 papers)
  2. Victor Guimarães (7 papers)
  3. Nghia Khau (3 papers)
  4. Bruna Guedes (3 papers)
  5. Modestas Filipavicius (4 papers)
  6. André Ferreira Manso (1 paper)
  7. Roland Mathis (8 papers)
  8. Ivan Sekulić (12 papers)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com