Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ChatGPT as Data Augmentation for Compositional Generalization: A Case Study in Open Intent Detection (2308.13517v1)

Published 25 Aug 2023 in cs.CL and cs.AI

Abstract: Open intent detection, a crucial aspect of natural language understanding, involves the identification of previously unseen intents in user-generated text. Despite the progress made in this field, challenges persist in handling new combinations of language components, which is essential for compositional generalization. In this paper, we present a case study exploring the use of ChatGPT as a data augmentation technique to enhance compositional generalization in open intent detection tasks. We begin by discussing the limitations of existing benchmarks in evaluating this problem, highlighting the need for constructing datasets for addressing compositional generalization in open intent detection tasks. By incorporating synthetic data generated by ChatGPT into the training process, we demonstrate that our approach can effectively improve model performance. Rigorous evaluation of multiple benchmarks reveals that our method outperforms existing techniques and significantly enhances open intent detection capabilities. Our findings underscore the potential of LLMs like ChatGPT for data augmentation in natural language understanding tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Yihao Fang (9 papers)
  2. Xianzhi Li (38 papers)
  3. Stephen W. Thomas (3 papers)
  4. Xiaodan Zhu (94 papers)
Citations (10)