A Survey on Pretrained Language Models for Neural Code Intelligence

12/20/2022
by   Yichen Xu, et al.
0

As the complexity of modern software continues to escalate, software engineering has become an increasingly daunting and error-prone endeavor. In recent years, the field of Neural Code Intelligence (NCI) has emerged as a promising solution, leveraging the power of deep learning techniques to tackle analytical tasks on source code with the goal of improving programming efficiency and minimizing human errors within the software industry. Pretrained language models have become a dominant force in NCI research, consistently delivering state-of-the-art results across a wide range of tasks, including code summarization, generation, and translation. In this paper, we present a comprehensive survey of the NCI domain, including a thorough review of pretraining techniques, tasks, datasets, and model architectures. We hope this paper will serve as a bridge between the natural language and programming language communities, offering insights for future research in this rapidly evolving field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2023

CodeTF: One-stop Transformer Library for State-of-the-art Code LLM

Code intelligence plays a key role in transforming modern software engin...
research
05/24/2022

Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code

Recent years have seen the successful application of deep learning to so...
research
05/03/2022

A Survey of Deep Learning Models for Structural Code Understanding

In recent years, the rise of deep learning and automation requirements i...
research
05/12/2021

Improving Code Autocompletion with Transfer Learning

Software language models have achieved promising results predicting code...
research
07/11/2019

Warfarin dose estimation on multiple datasets with automated hyperparameter optimisation and a novel software framework

Warfarin is an effective preventative treatment for arterial and venous ...
research
01/10/2022

Better Modeling the Programming World with Code Concept Graphs-augmented Multi-modal Learning

The progress made in code modeling has been tremendous in recent years t...
research
05/15/2023

Improving ChatGPT Prompt for Code Generation

Automated code generation can be a powerful technique for software devel...

Please sign up or login with your details

Forgot password? Click here to reset