Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Datasets (2210.12687v1)

Published 23 Oct 2022 in cs.CL and cs.AI

Abstract: To build open-domain chatbots that are able to use diverse communicative skills, we propose a novel framework BotsTalk, where multiple agents grounded to the specific target skills participate in a conversation to automatically annotate multi-skill dialogues. We further present Blended Skill BotsTalk (BSBT), a large-scale multi-skill dialogue dataset comprising 300K conversations. Through extensive experiments, we demonstrate that our dataset can be effective for multi-skill dialogue systems which require an understanding of skill blending as well as skill grounding. Our code and data are available at https://github.com/convei-lab/BotsTalk.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Minju Kim (12 papers)
  2. Chaehyeong Kim (3 papers)
  3. Yongho Song (5 papers)
  4. Seung-won Hwang (59 papers)
  5. Jinyoung Yeo (46 papers)
Citations (12)

Summary

We haven't generated a summary for this paper yet.