2000 character limit reached
ZeroShotDataAug: Generating and Augmenting Training Data with ChatGPT
Published 27 Apr 2023 in cs.AI | (2304.14334v1)
Abstract: In this paper, we investigate the use of data obtained from prompting a large generative LLM, ChatGPT, to generate synthetic training data with the aim of augmenting data in low resource scenarios. We show that with appropriate task-specific ChatGPT prompts, we outperform the most popular existing approaches for such data augmentation. Furthermore, we investigate methodologies for evaluating the similarity of the augmented data generated from ChatGPT with the aim of validating and assessing the quality of the data generated.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.