DevGPT: Studying Developer-ChatGPT Conversations

08/31/2023
by   Tao Xiao, et al.
0

The emergence of large language models (LLMs) such as ChatGPT has disrupted the landscape of software development. Many studies are investigating the quality of responses generated by ChatGPT, the efficacy of various prompting techniques, and its comparative performance in programming contests, to name a few examples. Yet, we know very little about how ChatGPT is actually used by software developers. What questions do developers present to ChatGPT? What are the dynamics of these interactions? What is the backdrop against which these conversations are held, and how do the conversations feedback into the artifacts of their work? To close this gap, we introduce DevGPT, a curated dataset which encompasses 17,913 prompts and ChatGPT's responses including 11,751 code snippets, coupled with the corresponding software development artifacts – ranging from source code, commits, issues, pull requests, to discussions and Hacker News threads – to enable the analysis of the context and implications of these developer interactions with ChatGPT.

READ FULL TEXT
research
07/18/2022

Software Artifact Mining in Software Engineering Conferences: A Meta-Analysis

Background: Software development results in the production of various ty...
research
11/16/2020

The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History

Software Heritage is the largest existing public archive of software sou...
research
02/14/2023

The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software Development

Large language models (LLMs) have recently been applied in software engi...
research
04/11/2023

APISENS- Sentiment Scoring Tool for APIs with Crowd-Knowledge

Utilizing pre-existing software artifacts, such as libraries and Applica...
research
01/17/2020

Eye of the Mind: Image Processing for Social Coding

Developers are increasingly sharing images in social coding environments...
research
02/26/2022

Automated Identification of Toxic Code Reviews: How Far Can We Go?

Toxic conversations during software development interactions may have se...
research
10/15/2021

Nalanda: A Socio-Technical Graph for Building Software Analytics Tools at Enterprise Scale

Software development is information-dense knowledge work that requires c...

Please sign up or login with your details

Forgot password? Click here to reset