Retrieval Augmented Code Generation and Summarization

08/26/2021
by   Md Rizwan Parvez, et al.
0

Software developers write a lot of source code and documentation during software development. Intrinsically, developers often recall parts of source code or code summaries that they had written in the past while implementing software or documenting them. To mimic developers' code or summary generation behavior, we propose a retrieval augmented framework, REDCODER, that retrieves relevant code or summaries from a retrieval database and provides them as a supplement to code generation or summarization models. REDCODER has a couple of uniqueness. First, it extends the state-of-the-art dense retrieval technique to search for relevant code or summaries. Second, it can work with retrieval databases that include unimodal (only code or natural language description) or bimodal instances (code-description pairs). We conduct experiments and extensive analysis on two benchmark datasets of code generation and summarization in Java and Python, and the promising results endorse the effectiveness of our proposed retrieval augmented framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2018

Supporting software documentation with source code summarization

Source code summarization is a process of generating summaries that desc...
research
06/15/2022

An Extractive-and-Abstractive Framework for Source Code Summarization

(Source) Code summarization aims to automatically generate summaries/com...
research
05/29/2021

CoDesc: A Large Code-Description Parallel Dataset

Translation between natural language and source code can help software d...
research
10/21/2022

Low-Resources Project-Specific Code Summarization

Code summarization generates brief natural language descriptions of sour...
research
03/08/2019

Developer Reading Behavior While Summarizing Java Methods: Size and Context Matters

An eye-tracking study of 18 developers reading and summarizing Java meth...
research
08/12/2022

Towards Code Summarization of APIs Using NLP Techniques

Each programming language comes with official documentation to guide dev...
research
01/27/2021

In-IDE Code Generation from Natural Language: Promise and Challenges

A great part of software development involves conceptualizing or communi...

Please sign up or login with your details

Forgot password? Click here to reset