Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DevGPT: Studying Developer-ChatGPT Conversations (2309.03914v2)

Published 31 Aug 2023 in cs.SE

Abstract: This paper introduces DevGPT, a dataset curated to explore how software developers interact with ChatGPT, a prominent LLM. The dataset encompasses 29,778 prompts and responses from ChatGPT, including 19,106 code snippets, and is linked to corresponding software development artifacts such as source code, commits, issues, pull requests, discussions, and Hacker News threads. This comprehensive dataset is derived from shared ChatGPT conversations collected from GitHub and Hacker News, providing a rich resource for understanding the dynamics of developer interactions with ChatGPT, the nature of their inquiries, and the impact of these interactions on their work. DevGPT enables the study of developer queries, the effectiveness of ChatGPT in code generation and problem solving, and the broader implications of AI-assisted programming. By providing this dataset, the paper paves the way for novel research avenues in software engineering, particularly in understanding and improving the use of LLMs like ChatGPT by developers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Tao Xiao (23 papers)
  2. Christoph Treude (137 papers)
  3. Hideaki Hata (48 papers)
  4. Kenichi Matsumoto (73 papers)
Citations (17)
X Twitter Logo Streamline Icon: https://streamlinehq.com