Large Language Models for Software Engineering: A Systematic Literature Review

08/21/2023
by   Xinyi Hou, et al.
0

Large Language Models (LLMs) have significantly impacted numerous domains, including Software Engineering (SE). Many recent publications have explored LLMs applied to various SE tasks. Nevertheless, a comprehensive understanding of the application, effects, and possible limitations of LLMs on SE is still in its early stages. To bridge this gap, we conducted a systematic literature review on LLM4SE, with a particular focus on understanding how LLMs can be exploited to optimize processes and outcomes. We collect and analyze 229 research papers from 2017 to 2023 to answer four key research questions (RQs). In RQ1, we categorize different LLMs that have been employed in SE tasks, characterizing their distinctive features and uses. In RQ2, we analyze the methods used in data collection, preprocessing, and application highlighting the role of well-curated datasets for successful LLM for SE implementation. RQ3 investigates the strategies employed to optimize and evaluate the performance of LLMs in SE. Finally, RQ4 examines the specific SE tasks where LLMs have shown success to date, illustrating their practical contributions to the field. From the answers to these RQs, we discuss the current state-of-the-art and trends, identifying gaps in existing research, and flagging promising areas for future study.

READ FULL TEXT

page 8

page 13

page 21

page 24

page 25

page 38

research
07/12/2023

How Many Papers Should You Review? A Research Synthesis of Systematic Literature Reviews in Software Engineering

[Context] Systematic Literature Review (SLR) has been a major type of st...
research
08/22/2023

Towards an Understanding of Large Language Models in Software Engineering Tasks

Large Language Models (LLMs) have drawn widespread attention and researc...
research
09/07/2023

The Devil is in the Tails: How Long-Tailed Code Distributions Impact Large Language Models

Learning-based techniques, especially advanced Large Language Models (LL...
research
06/08/2022

Towards Continuous Systematic Literature Review in Software Engineering

Context: New scientific evidence continuously arises with advances in So...
research
09/21/2022

Unifying Classification Schemes for Software Engineering Meta-Research

Background: Classifications in meta-research enable researchers to cope ...
research
03/31/2022

On the Evaluation of NLP-based Models for Software Engineering

NLP-based models have been increasingly incorporated to address SE probl...
research
06/13/2023

Ethical Aspects of ChatGPT in Software Engineering Research

ChatGPT can improve Software Engineering (SE) research practices by offe...

Please sign up or login with your details

Forgot password? Click here to reset